Body Sort
This was the first sort I created on my own.
Background
This sort is designed to do as much processing as possible in a small working memory data area.
It then uses a "horse sort" to combine the working data sets.
Purpose
This algorithm is designed for cases where the working memory is limited and the external or regular memory is much slower.
It is also good for cases were the data is being added slowly or is not all ready at the same time.
Basic Design
This sort has some important similarities with the horse sort. One part is how it uses a very similar system of breaking the list down into horses that are runs from the bigger list. The merging the the list takes a surprising small amount of the overall time. Most of the time is from making the runs. In this case we use a modified priority queue to help us make the runs.
This sort uses a priority queue to make short "runs" of in-order sub-lists. The sublists are then combined latter.
The rules of the queue (At least in this case) are as follows.
The queue can have zero or more numbers. (it is not required to be full sized)
The queue has a relatively strict limit on size. Once that size has been reached it needs to start outputting numbers
When creating one of the runs it will consume as much of the unsorted list as possible to provide best chance to find the next number in the run.
Observed Result
This sort is based on pivot sort. We just allow multiple processors to help.
It works with several observations:
1. We can statistically gather a sample and then approximate what an ideal pivot would be.
2. The sooner we can get an additional core to help the better. We do not want only one core to act as the dispatcher.
3. The fewer Joins the better. We want each core to be able to complete it's work without interruption.
When a core gets the job of sorting a range in the array it will immediately verify it has permission to spit the work by looking to see if its magnitude is greater than zero and that the work is big enough to justify being split.
If it does have permission to split the work it will collect a sample and find an approximate middle pivot and then split the work into two sets. It will then give one of the sets to another thread along with all relevant information. If it still has permission to split the work further it will continue doing so.
When the work is done it will wait for all the threads it asked for help from to return and then return itself.
This sort has similar advantages to Quick Sort but is much faster on machines with multiple cores.
Notes on Implimintation and speed
// to replace later
Debuging information:
did split in 0: 0 524431 999999
made new thread 2
did split in 0: 0 258968 524431
made new thread 1
did split in 2: 524433 773525 999999
made new thread 3
doing quicksort in 2: 524433 773525
doing quicksort in 0: 0 258968
doing quicksort in 1: 258970 524431
doing quicksort in 3: 773527 999999
Conclusion
While experimental this multi threaded quick-sort was able to outperform it's single threaded comparison version.
Implementation
/*
Sorting. This performs multithreaded quicksort
Copyright (C) 2021 Karl Kendall
This program is free software; you can redistribute it and/or
modify it under the terms of the GNU General Public License
as published by the Free Software Foundation; either version 2
of the License.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.
*/
5/11/2022