Sorting Algorithm - Quick Sort
QuickSort is also a Divide And Conquer algorithm with a performance of $O(n * log(n))$, however in the worst case this becomes $O(n^2)$.
How
We start by picking an element which is called the pivot, then we will rearrange the array so that every element smaller than the pivot will be on the left of the pivot and all elements larger than the pivot will be on the right. this operation is called the partitioning.
Now we just recursively apply the above steps to the sub array of elements.
Take the following array:
9 0 8 3 5 4 2 1 6 7
We pick our pivot (the middle element, don't forget the rounding of course), the brackets show the index points.
[9] 0 8 3 (5) 4 2 1 6 [7]
We start from index 0 and index arraySize - 1, we compare the values and swap if needed.
So here we have 9 and 7, 9 > 5 < 7 so we increase the index by 1 for the right side, because 7 is in it's correct spot.
[9] 0 8 3 (5) 4 2 1 [6] 7
Again 9 > 5 < 6, so we increment the right side index
[9] 0 8 3 (5) 4 2 [1] 6 7
We now see that 9 > 5 > 1, which means we should swap 1 and 9. We can also increment the left and the right index.
1 [0] 8 3 (5) 4 [2] 9 6 7
0 < 5 > 2, increment left side.
1 0 [8] 3 (5) 4 [2] 9 6 7
8 > 5 < 2, swap
1 0 2 [3] (5) [4] 8 9 6 7
3 < 5 > 4, increment the left index. This will now become our pivot! so this is the last step.
1 0 2 3 [(5)] [4] 8 9 6 7
5 > 4, swap
1 0 2 3 4 (5) 8 9 6 7
We now have 2 sub arrays, a left one and a right one, run this method recursively on these 2 sub arrays.
On the end we will have our sorted array:
0 1 2 3 4 5 6 7 8 9
Advantages and Disadvantages
Advantages
- Fast overall
Disadvantages
- Not stable
- $O(lg(n))$ extra space
- $\Theta(n^2))$ performance in the worst case
- Not adaptive
Performance
|Worst Case|Average Case|Best Case| |-|-|-| |O(n2|O(n log n)|O(n log n)|
Note that the performance goes towards $O(n)$ using a three-way partition and equal keys.
Best Case
- Both parts are the same size
$T(n) = cn + 2T(\frac{n}{2})$
which translates into:
$T(n) = O(n*lg(n))$
Average Case
- Happens when pivot is in the middle of the elements
The average case is something that is really tricky for quick sort. If we take the previous point about the pivot being in the middle of the elements into consideration, then we get that the execution time becomes:
$$T(n) = cn + \frac{1}{n}\sum^n_{i = 1}(T(i - 1) + T(n - i))$$
Because the most terms appear twice we get:
$$T(n) = cn + \frac{2}{n}\sum^{n - 1}_{i = 0}T(i)$$
Hard sum, but by using full history recurrence we are able to remove it by subtracting it with the same formula off $T(n - 1)$. And multiplying it with n.
$$nT(n) - (n - 1)T(n - 1) = c(2n - 1) + 2T(n - 1)$$
Remove the c:
$$nT(n) = (n + 1)T(n - 1) + 2cn$$
Now we perform the classic technique telescoping after we divided both terms by $n(n + 1)$:
$$\frac{T(n)}{n + 1} = \frac{T(n - 1)}{n} + \frac{2c}{n + 1}$$
$$\frac{T(n - 1)}{n} = \frac{T(n - 2)}{n - 1} + \frac{2c}{n}$$
$$\frac{T(2)}{3} = \frac{T(1)}{2} + \frac{2c}{3}$$
$$\frac{T(n)}{n + 1} = \frac{T(1)}{2} + 2c\sum^n{i = 2}\frac{1}{i + 1} < 2c\sum^n{i = 1} = 2cH_n$$
The last number is the harmonic number, this has the value $ln(n) + O(1)$. This results in $O(n*lg(n))$.
Worst Case
- One of the 2 parts is just one element
$$T(n) = cn + T(1) + T(n - 1)$$ $$T(n - 1) = c(n - 1) + T(1) + T(n - 2)$$ $$T(n - 2) = c(n - 2) + T(1) + T(n - 3)$$ $$T(2) = c(2) + T(1) + T(1)$$
Count everything together: $$T(n) = c(n + (n - 1) + … + 3 + 2) + nT(1)$$
This results into our performance becoming $O(n^2)$ which is just as bad as insertion sort. We can evade this problem if our pivot has been placed correctly.
Implementation
Quick Sort has 2 main steps, the partition step and the recursion step.
In the partition step we will put every element smaller than the pivot at the left, and every element larger than the pivot at the right.
Then we will recursively call our Quick Sort method till we got a sorted array.
C++ Code
void quick_sort(vector &v, int leftIdx, int rightIdx) const {
// Set indexes that we can move
int i = leftIdx;
int j = rightIdx;
// Find pivot
int pivot = v[(leftIdx + rightIdx) / 2];
// Swap container
int tmp;
// PARTITION
while (i <= j) {
// Move indexes till we got a swap point.
while (v[i] < pivot) i++; while (v[j] > pivot)
j--;
// Swap
if (i <= j) {
tmp = v[i];
v[i] = v[j];
v[j] = tmp;
// We swapped so move the indexes
i++;
j--;
}
}
// RECURSION as long as we are not at the end
if (leftIdx < j) quick_sort(v, leftIdx, j); if (rightIdx > i) {
quick_sort(v, i, rightIdx);
}
}
Benchmark
| | 100 | 1.000 | 10.000 | 100.000 | 1.000.000 |-|-|-|-|-|-| |Random Elements|8e-06|0.000105|0.001301|0.015583|0.179599 |Ascending Elements|3e-06|2.6e-05|0.000329|0.004379|0.043418 |Descending Elements|3e-06|2.7e-05|0.000354|0.004304|0.046504
Conclusion
QuickSort is fun to implement and is really fast when we do not hit the worst case scenario, because we are able to prevent hitting this scenario this is a commonly used sorting algorithm. It is so common that the STL library has implemented this as qsort
.
There is also a very famous variant on QuickSort called Three Way Pivot QuickSort, this algorithm is faster since 2009 and is also implemented in Java 7.
Member discussion