This page explains and implements selection sort, bubble sort, merge sort, quick sort, insertion sort, and shell sort.
7. The Shell Sort
The shell sort, sometimes called the "diminishing increment sort," improves on the insertion sort by breaking the original vector into a number of smaller subvectors, each of which is sorted using an insertion sort. The unique way
that these subvectors are chosen is the key to the shell sort. Instead of breaking the vector into subvectors of contiguous items, the shell sort uses an increment i
, sometimes called the gap, to create a subvector
by choosing all items that are i
items apart.
This can be seen in Figure 6. This vector has nine items. If we use an increment of three, there are three subvectors, each of which can be sorted by an insertion sort. After completing these sorts, we get the vector shown in Figure 7. Although this vector is not completely sorted, something very interesting has happened. By sorting the subvectors, we have moved the items closer to where they actually belong.
Figure 8 shows a final insertion sort using an increment of one; in other words, a standard insertion sort. Note that by performing the earlier subvector sorts, we have now reduced the total number of shifting operations necessary to put the vector in its final order. For this case, we need only four more shifts to complete the process.
We said earlier that the way in which the increments are chosen is the unique feature of the shell sort. The function shown in ActiveCode 1 uses a different set of increments. In this case, we begin with subvectors. On the next pass, subvectors are sorted. Eventually, a single vector is sorted with the basic insertion sort. Figure 9 shows the first subvectors for our example using this increment.
The following visualization shows the "gap" attribute in the form of brown, vertical bars. There are marker arrows that portray the "current values" being compared during the sort. It finishes by performing a full insertion sort on the full set of bars.
At first glance you may think that a shell sort cannot be better than an insertion sort, since it does a complete insertion sort as the last step. It turns out, however, that this final insertion sort does not need to do very many comparisons (or shifts) since the list has been pre-sorted by earlier incremental insertion sorts, as described above. In other words, each pass produces a list that is "more sorted" than the previous one. This makes the final pass very efficient.
Although a general analysis of the shell sort is well beyond the scope of this text, we can say that it tends to fall somewhere between and , based on the behavior described above. For the increments shown in Listing 5, the performance is . By changing the increment, for example using (1, 3, 7, 15, 31, and so on), a shell sort can perform at .