Heap's algorithm

From HandWiki
Short description: Method of generating all permutations of n objects
A map of the 24 permutations and the 23 swaps used in Heap's algorithm permuting the four letters A (amber), B (blue), C (cyan) and D (dark red)
Wheel diagram of all permutations of length [math]\displaystyle{ n=4 }[/math] generated by Heap's algorithm, where each permutation is color-coded (1=blue, 2=green, 3=yellow, 4=red).

Heap's algorithm generates all possible permutations of n objects. It was first proposed by B. R. Heap in 1963.[1] The algorithm minimizes movement: it generates each permutation from the previous one by interchanging a single pair of elements; the other n−2 elements are not disturbed. In a 1977 review of permutation-generating algorithms, Robert Sedgewick concluded that it was at that time the most effective algorithm for generating permutations by computer.[2]

The sequence of permutations of n objects generated by Heap's algorithm is the beginning of the sequence of permutations of n+1 objects. So there is one infinite sequence of permutations generated by Heap's algorithm (sequence A280318 in the OEIS).

Details of the algorithm

For a collection [math]\displaystyle{ C }[/math] containing n different elements, Heap found a systematic method for choosing at each step a pair of elements to switch in order to produce every possible permutation of these elements exactly once.

Described recursively as a decrease and conquer method, Heap's algorithm operates at each step on the [math]\displaystyle{ k }[/math] initial elements of the collection. Initially [math]\displaystyle{ k=n }[/math] and thereafter [math]\displaystyle{ k\lt n }[/math]. Each step generates the [math]\displaystyle{ k! }[/math] permutations that end with the same [math]\displaystyle{ n-k }[/math] final elements. It does this by calling itself once with the [math]\displaystyle{ k\text{th} }[/math] element unaltered and then [math]\displaystyle{ k-1 }[/math] times with the ([math]\displaystyle{ k\text{th} }[/math]) element exchanged for each of the initial [math]\displaystyle{ k-1 }[/math] elements. The recursive calls modify the initial [math]\displaystyle{ k-1 }[/math] elements and a rule is needed at each iteration to select which will be exchanged with the last. Heap's method says that this choice can be made by the parity of the number of elements operated on at this step. If [math]\displaystyle{ k }[/math] is even, then the final element is iteratively exchanged with each element index. If [math]\displaystyle{ k }[/math] is odd, the final element is always exchanged with the first.

procedure generate(k : integer, A : array of any):
    if k = 1 then
        output(A)
    else
        // Generate permutations with k-th unaltered
        // Initially k = length(A)
        generate(k - 1, A)

        // Generate permutations for k-th swapped with each k-1 initial
        for i := 0; i < k-1; i += 1 do
            // Swap choice dependent on parity of k (even or odd)
            if k is even then
                swap(A[i], A[k-1]) // zero-indexed, the k-th is at k-1
            else
                swap(A[0], A[k-1])
            end if
            generate(k - 1, A)
        end for
    end if

One can also write the algorithm in a non-recursive format.[3]

procedure generate(n : integer, A : array of any):
    // c is an encoding of the stack state. c[k] encodes the for-loop counter for when generate(k - 1, A) is called
    c : array of int

    for i := 0; i < n; i += 1 do
        c[i] := 0
    end for

    output(A)
    
    // i acts similarly to a stack pointer
    i := 1;
    while i < n do
        if  c[i] < i then
            if i is even then
                swap(A[0], A[i])
            else
                swap(A[c[i]], A[i])
            end if
            output(A)
            // Swap has occurred ending the for-loop. Simulate the increment of the for-loop counter
            c[i] += 1
            // Simulate recursive call reaching the base case by bringing the pointer to the base case analog in the array
            i := 1
        else
            // Calling generate(i+1, A) has ended as the for-loop terminated. Reset the state and simulate popping the stack by incrementing the pointer.
            c[i] := 0
            i += 1
        end if
    end while

Proof

In this proof, we'll use the implementation below as Heap's Algorithm. While it is not optimal (it does not minimize moves, which is described in the section below), the implementation is nevertheless still correct and will produce all permutations. The reason for using the below implementation is that the analysis is easier, and certain patterns can be easily illustrated.

procedure generate(k : integer, A : array of any):
    if k = 1 then
        output(A)
    else
        for i := 0; i < k; i += 1 do
            generate(k - 1, A)
            if k is even then
                swap(A[i], A[k-1])
            else
                swap(A[0], A[k-1])
            end if

        end for
    end if

Claim: If array A has length n, then performing Heap's algorithm will either result in A being "rotated" to the right by 1 (i.e. each element is shifted to the right with the last element occupying the first position) or result in A being unaltered, depending if n is even or odd, respectively.

Basis: The claim above trivially holds true for [math]\displaystyle{ n=1 }[/math] as Heap's algorithm will simply return A unaltered in order.

Induction: Assume the claim holds true for some [math]\displaystyle{ i \geq 1 }[/math]. We will then need to handle two cases for [math]\displaystyle{ i+1 }[/math]: [math]\displaystyle{ i+1 }[/math] is even or odd.

If, for A, [math]\displaystyle{ n=i+1 }[/math] is even, then the subset of the first i elements will remain unaltered after performing Heap's Algorithm on the subarray, as assumed by the induction hypothesis. By performing Heap's Algorithm on the subarray and then performing the swapping operation, in the kth iteration of the for-loop, where [math]\displaystyle{ k \leq i+1 }[/math], the kth element in A will be swapped into the last position of A which can be thought as a kind of "buffer". By swapping the 1st and last element, then swapping 2nd and last, all the way until the nth and last elements are swapped, the array will at last experience a rotation. To illustrate the above, look below for the case [math]\displaystyle{ n = 4 }[/math]

1,2,3,4 ... Original Array
1,2,3,4 ... 1st iteration (Permute subset)
4,2,3,1 ... 1st iteration (Swap 1st element into "buffer")
4,2,3,1 ... 2nd iteration (Permute subset)
4,1,3,2 ... 2nd iteration (Swap 2nd element into "buffer")
4,1,3,2 ... 3rd iteration (Permute subset)
4,1,2,3 ... 3rd iteration (Swap 3rd element into "buffer")
4,1,2,3 ... 4th iteration (Permute subset)
4,1,2,3 ... 4th iteration (Swap 4th element into "buffer") ... The altered array is a rotated version of the original

If, for A, [math]\displaystyle{ n=i+1 }[/math] is odd, then the subset of the first i elements will be rotated after performing Heap's Algorithm on the first i elements. Notice that, after 1 iteration of the for-loop, when performing Heap's Algorithm on A, A is rotated to the right by 1. By the induction hypothesis, it is assumed that the first i elements will rotate. After this rotation, the first element of A will be swapped into the buffer which, when combined with the previous rotation operation, will in essence perform a rotation on the array. Perform this rotation operation n times, and the array will revert to its original state. This is illustrated below for the case [math]\displaystyle{ n = 5 }[/math].

1,2,3,4,5 ... Original Array
4,1,2,3,5 ... 1st iteration (Permute subset/Rotate subset)
5,1,2,3,4 ... 1st iteration (Swap)
3,5,1,2,4 ... 2nd iteration (Permute subset/Rotate subset)
4,5,1,2,3 ... 2nd iteration (Swap)
2,4,5,1,3 ... 3rd iteration (Permute subset/Rotate subset)
3,4,5,1,2 ... 3rd iteration (Swap)
1,3,4,5,2 ... 4th iteration (Permute subset/Rotate subset)
2,3,4,5,1 ... 4th iteration (Swap)
5,2,3,4,1 ... 5th iteration (Permute subset/Rotate subset)
1,2,3,4,5 ... 5th iteration (Swap) ... The final state of the array is in the same order as the original

The induction proof for the claim is now complete, which will now lead to why Heap's Algorithm creates all permutations of array A. Once again we will prove by induction the correctness of Heap's Algorithm.

Basis: Heap's Algorithm trivially permutes an array A of size 1 as outputting A is the one and only permutation of A.

Induction: Assume Heap's Algorithm permutes an array of size i. Using the results from the previous proof, every element of A will be in the "buffer" once when the first i elements are permuted. Because permutations of an array can be made by altering some array A through the removal of an element x from A then tacking on x to each permutation of the altered array, it follows that Heap's Algorithm permutes an array of size [math]\displaystyle{ i+1 }[/math], for the "buffer" in essence holds the removed element, being tacked onto the permutations of the subarray of size i. Because each iteration of Heap's Algorithm has a different element of A occupying the buffer when the subarray is permuted, every permutation is generated as each element of A has a chance to be tacked onto the permutations of the array A without the buffer element.

Frequent mis-implementations

It is tempting to simplify the recursive version given above by reducing the instances of recursive calls. For example, as:

procedure generate(k : integer, A : array of any):
    if k = 1 then
        output(A)
    else

        // Recursively call once for each k
        for i := 0; i < k; i += 1 do
            generate(k - 1, A)
            // swap choice dependent on parity of k (even or odd)
            if k is even then
                // no-op when i == k-1
                swap(A[i], A[k-1])
            else
                // XXX incorrect additional swap when i==k-1
                swap(A[0], A[k-1]) 
            end if

        end for
    end if

This implementation will succeed in producing all permutations but does not minimize movement. As the recursive call-stacks unwind, it results in additional swaps at each level. Half of these will be no-ops of [math]\displaystyle{ A[i] }[/math] and [math]\displaystyle{ A[k-1] }[/math] where [math]\displaystyle{ i==k-1 }[/math] but when [math]\displaystyle{ k }[/math] is odd, it results in additional swaps of the [math]\displaystyle{ kth }[/math] with the [math]\displaystyle{ 0th }[/math] element.

[math]\displaystyle{ n }[/math] [math]\displaystyle{ n!-1 }[/math] swaps additional = swaps [math]\displaystyle{ - (n!-1) }[/math]
1 0 0 0
2 1 1 0
3 5 6 1
4 23 27 4
5 119 140 21
6 719 845 126
7 5039 5922 883
8 40319 47383 7064
9 362879 426456 63577

These additional swaps significantly alter the order of the [math]\displaystyle{ k-1 }[/math] prefix elements.

The additional swaps can be avoided by either adding an additional recursive call before the loop and looping [math]\displaystyle{ k-1 }[/math] times (as above) or looping [math]\displaystyle{ k }[/math] times and checking that [math]\displaystyle{ i }[/math] is less than [math]\displaystyle{ k-1 }[/math] as in:

procedure generate(k : integer, A : array of any):
    if k = 1 then
        output(A)
    else

        // Recursively call once for each k
        for i := 0; i < k; i += 1 do
            generate(k - 1, A)
            // avoid swap when i==k-1
            if (i < k - 1)
                // swap choice dependent on parity of k
                if k is even then
                    swap(A[i], A[k-1])
                else
                    swap(A[0], A[k-1])
                end if
            end if
        end for
    end if

The choice is primarily aesthetic but the latter results in checking the value of [math]\displaystyle{ i }[/math] twice as often.

See also

References

  1. Heap, B. R. (1963). "Permutations by Interchanges". The Computer Journal 6 (3): 293–4. doi:10.1093/comjnl/6.3.293. 
  2. Sedgewick, R. (1977). "Permutation Generation Methods". ACM Computing Surveys 9 (2): 137–164. doi:10.1145/356689.356692. 
  3. Sedgewick, Robert (4 June 2020). "a talk on Permutation Generation Algorithms". https://sedgewick.io/wp-content/uploads/2022/03/2002PermGeneration.pdf.