Sieve of Pritchard

From HandWiki
Short description: An algorithm for generating prime numbers
Sieve of Pritchard: algorithm steps for primes up to 150

In mathematics, the sieve of Pritchard is an algorithm for finding all prime numbers up to a specified bound. Like the ancient sieve of Eratosthenes, it has a simple conceptual basis in number theory.[1] It is especially suited to quick hand computation for small bounds.

Whereas the sieve of Eratosthenes marks off each non-prime for each of its prime factors, the sieve of Pritchard avoids considering almost all non-prime numbers by building progressively larger wheels, which represent the pattern of numbers not divisible by any of the primes processed thus far. It thereby achieves a better asymptotic complexity, and was the first sieve with a running time sublinear in the specified bound. Its asymptotic running-time has not been improved on, and it deletes fewer composites than any other known sieve. It was created in 1979 by Paul Pritchard.[2]

Since Pritchard has created a number of other sieve algorithms for finding prime numbers,[3][4][5] the sieve of Pritchard is sometimes singled out by being called the wheel sieve (by Pritchard himself[1]) or the dynamic wheel sieve.[6]

Overview

A prime number is a natural number that has no natural number divisors other than the number [math]\displaystyle{ 1 }[/math] and itself.

To find all the prime numbers less than or equal to a given integer [math]\displaystyle{ N }[/math], a sieve algorithm examines a set of candidates in the range [math]\displaystyle{ 2,3,...,N }[/math], and eliminates those that are not prime, leaving the primes at the end. The sieve of Eratosthenes examines all of the range, first removing all multiples of the first prime [math]\displaystyle{ 2 }[/math], then of the next prime [math]\displaystyle{ 3 }[/math], and so on. The sieve of Pritchard instead examines a subset of the range consisting of numbers that occur on successive wheels, which represent the pattern of numbers left after each successive prime is processed by the sieve of Eratosthenes.

For [math]\displaystyle{ i\gt 0 }[/math] the [math]\displaystyle{ i }[/math]'th wheel [math]\displaystyle{ W_i }[/math] represents this pattern. It is the set of numbers between [math]\displaystyle{ 1 }[/math] and the product [math]\displaystyle{ P_i=p_1*p_2*...*p_i }[/math] of the first [math]\displaystyle{ i }[/math] prime numbers that are not divisible by any of these prime numbers (and is said to have an associated length [math]\displaystyle{ P_i }[/math]). This is because adding [math]\displaystyle{ P_i }[/math] to a number doesn't change whether or not it is divisible by one of the first [math]\displaystyle{ i }[/math] prime numbers, since the remainder on division by any one of these primes is unchanged.

So [math]\displaystyle{ W_1=\{1\} }[/math] with length [math]\displaystyle{ P_1=2 }[/math] represents the pattern of odd numbers; [math]\displaystyle{ W_2=\{1,5\} }[/math] with length [math]\displaystyle{ P_2=6 }[/math] represents the pattern of numbers not divisible by [math]\displaystyle{ 2 }[/math] or [math]\displaystyle{ 3 }[/math]; etc. Wheels are so-called because [math]\displaystyle{ W_i }[/math] can be usefully visualized as a circle of circumference [math]\displaystyle{ P_i }[/math] with its members marked at their corresponding distances from an origin. Then rolling the wheel along the number line marks points corresponding to successive numbers not divisible by one of the first [math]\displaystyle{ i }[/math] prime numbers. The animation shows [math]\displaystyle{ W_2 }[/math] being rolled up to 30.

Rolling the 2nd wheel up to 30.

It's useful to define [math]\displaystyle{ W_i\rightarrow n }[/math] for [math]\displaystyle{ n\gt 0 }[/math] to be the result of rolling [math]\displaystyle{ W_i }[/math] up to [math]\displaystyle{ n }[/math]. Then the animation generates [math]\displaystyle{ W_2\rightarrow 30=\{1,5,7,11,13,17,19,23,25,29\} }[/math]. Note that up to [math]\displaystyle{ 5^2-1=24 }[/math], this consists only of [math]\displaystyle{ 1 }[/math] and the primes between [math]\displaystyle{ 5 }[/math] and [math]\displaystyle{ 25 }[/math].

The sieve of Pritchard is derived from the observation[1] that this holds generally: for all [math]\displaystyle{ i\gt 0 }[/math], the values in [math]\displaystyle{ W_i\rightarrow {(p_{i+1}^2-1)} }[/math] are [math]\displaystyle{ 1 }[/math] and the primes between [math]\displaystyle{ p_{i+1} }[/math] and [math]\displaystyle{ p_{i+1}^2 }[/math]. It even holds for [math]\displaystyle{ i=0 }[/math], where the wheel has length [math]\displaystyle{ 1 }[/math] and contains just [math]\displaystyle{ 1 }[/math] (representing all the natural numbers). So the sieve of Pritchard starts with the trivial wheel [math]\displaystyle{ W_0 }[/math] and builds successive wheels until the square of the wheel's first member after [math]\displaystyle{ 1 }[/math] is at least [math]\displaystyle{ N }[/math]. Wheels grow very quickly, but only their values up to [math]\displaystyle{ N }[/math] are needed and generated.

It remains to find a method for generating the next wheel. Note in the animation that [math]\displaystyle{ W_3=\{1,5,7,11,13,17,19,23,25,29\}-\{5*1,5*5\} }[/math] can be obtained by rolling [math]\displaystyle{ W_2 }[/math] up to [math]\displaystyle{ 30 }[/math] and then removing [math]\displaystyle{ 5 }[/math] times each member of [math]\displaystyle{ W_2 }[/math]. This also holds generally: for all [math]\displaystyle{ i\geq 0 }[/math], [math]\displaystyle{ W_{i+1} = (W_i\rightarrow P_{i+1}) - \{p_{i+1}*w|w\in W_i\} }[/math].[1] Rolling [math]\displaystyle{ W_i }[/math] past [math]\displaystyle{ P_i }[/math] just adds values to [math]\displaystyle{ W_i }[/math], so the current wheel is first extended by getting each successive member starting with [math]\displaystyle{ w=1 }[/math], adding [math]\displaystyle{ P_i }[/math] to it, and inserting the result in the set. Then the multiples of [math]\displaystyle{ p_{i+1} }[/math] are deleted. Care must be taken to avoid a number being deleted that itself needs to be multiplied by [math]\displaystyle{ p_{i+1} }[/math]. The sieve of Pritchard as originally presented[2] does so by first skipping past successive members until finding the maximum one needed, and then doing the deletions in reverse order by working back through the set. This is the method used in the first animation above. A simpler approach is just to gather the multiples of [math]\displaystyle{ p_{i+1} }[/math] in a list, and then delete them.[7] Another approach is given by Gries and Misra.[8]

If the main loop terminates with a wheel whose length is less than [math]\displaystyle{ N }[/math], it is extended up to [math]\displaystyle{ N }[/math] to generate the remaining primes.

The algorithm, for finding all primes up to N, is therefore as follows:

  1. Start with a set W={1} and length=1 representing wheel 0, and prime p=2.
  2. As long as p2 <= N, do the following
    1. if length < N then
      1. extend W by repeatedly getting successive members w of W starting with 1 and inserting length+w into W as long as it doesn't exceed p*length or N;
      2. increase length to the minimum of p*length and N.
    2. repeatedly delete p times each member of W by first finding the largest <= length and then working backwards.
    3. note the prime p, then set p to the next member of W after 1 (or 3 if p was 2).
  3. if length < N then extend W to N by repeatedly getting successive members w of W starting with 1 and inserting length+w into W as long as it doesn't exceed N;
  4. On termination, the rest of the primes up to N are the members of W after 1.

Example

To find all the prime numbers less than or equal to 150, proceed as follows.

Start with wheel 0 with length 1, representing all natural numbers 1, 2, 3...:

  1

The first number after 1 for wheel 0 (when rolled) is 2; note it as a prime. Now form wheel 1 with length 2x1=2 by first extending wheel 0 up to 2 and then deleting 2 times each number in wheel 0, to get:

  1 2

The first number after 1 for wheel 1 (when rolled) is 3; note it as a prime. Now form wheel 2 with length 3x2=6 by first extending wheel 1 up to 6 and then deleting 3 times each number in wheel 1, to get

  1 2 3 5

The first number after 1 for wheel 2 is 5; note it as a prime. Now form wheel 3 with length 5x6=30 by first extending wheel 2 up to 30 and then deleting 5 times each number in wheel 2 (in reverse order!), to get

  1 2 3 5 7 11 13 17 19 23 25 29

The first number after 1 for wheel 3 is 7; note it as a prime. Now wheel 4 has length 7x30=210, so we only extend wheel 3 up to our limit 150. (No further extending will be done now that the limit has been reached.) We then delete 7 times each number in wheel 3 until we exceed our limit 150, to get the elements in wheel 4 up to 150:

  1 2 3 5 7 11 13 17 19 23 25 29 31 37 41 43 47 49 53 59 61 67 71 73 77 79 83 89 91 97 101 103 107 109 113 119 121 127 131 133 137 139 143 149

The first number after 1 for this partial wheel 4 is 11; note it as a prime. Since we've finished with rolling, we delete 11 times each number in the partial wheel 4 until we exceed our limit 150, to get the elements in wheel 5 up to 150:

  1 2 3 5 7 11 13 17 19 23 25 29 31 37 41 43 47 49 53 59 61 67 71 73 77 79 83 89 91 97 101 103 107 109 113 119 121 127 131 133 137 139 143 149

The first number after 1 for this partial wheel 5 is 13. Since 13 squared is at least our limit 150, we stop. The remaining numbers (other than 1) are the rest of the primes up to our limit 150.

Just 8 composite numbers are removed, once each. The rest of the numbers considered (other than 1) are prime. In comparison, the natural version of Eratosthenes sieve (stopping at the same point) removes composite numbers 184 times.

Pseudocode

The sieve of Pritchard can be expressed in pseudocode, as follows:[1]

algorithm Sieve of Pritchard is
    input: an integer N >= 2.
    output: the set of prime numbers in {1,2,...,N}.

    let W and Pr be sets of integer values, and all other variables integer values.
    k, W, length, p, Pr := 1, {1}, 2, 3, {2};
    {invariant: p = pk+1 and W = Wk [math]\displaystyle{ \cap }[/math] {1,2,...,N} and length = minimum of Pk,N and Pr = the primes up to pk}
    while p2 <= N do
        if (length < N) then
            Extend W,length to minimum of p*length,N; 
        Delete multiples of p from W; 
        Insert p into Pr; 
        k, p := k+1, next(W, 1) 
    if (length < N) then
        Extend W,length to N;

    return Pr [math]\displaystyle{ \cup }[/math] W - {1};

where next(W, w) is the next value in the ordered set W after w.

procedure Extend W,length to n is
    {in: W = Wk and length = Pk and n > length}
    {out: W = Wk[math]\displaystyle{ \rightarrow }[/math]n and length = n}

    integer w, x;
    w, x := 1, length+1;
    while x <= n do
        Insert x into W;
        w := next(W,w);
        x := length + w;
    length := n;
procedure Delete multiples of p from W,length is
    integer w;
    w := p;
    while p*w <= length do
        w := next(W,w);
    while w > 1 do
        w := prev(W,w);
        Remove p*w from W;

where prev(W, w) is the previous value in the ordered set W before w. The algorithm can be initialized with [math]\displaystyle{ W_0 }[/math] instead of [math]\displaystyle{ W_1 }[/math] at the minor complicaion of making next(W,1) a special case when k = 0.

This abstract algorithm uses ordered sets supporting the operations of insertion of a value greater than the maximum, deletion of a member, getting the next value after a member, and getting the previous value before a member. Using one of Mertens' theorems (the third) it can be shown to use [math]\displaystyle{ O(N/\log\log N) }[/math] of these operations and additions and multiplications.[2]

Implementation

An array-based doubly-linked list s can be used to implement the ordered set W, with s[w] storing next(W,w) and s[w-1] storing prev(W,w). This permits each abstract operation to be implemented in a small number of operations. (The array can also be used to store the set Pr "for free".) Therefore the time complexity of the sieve of Pritchard to calculate the primes up to [math]\displaystyle{ N }[/math] in the random access machine model is [math]\displaystyle{ O(N/\log\log N) }[/math] operations on words of size [math]\displaystyle{ O(\log N) }[/math]. Pritchard also shows how multiplications can be eliminated by using very small multiplication tables,[2] so the bit complexity is [math]\displaystyle{ O(N\log N/\log\log N) }[/math] bit operations.

In the same model, the space complexity is [math]\displaystyle{ O(N) }[/math] words, i.e., [math]\displaystyle{ O(N\log N) }[/math] bits. The sieve of Eratosthenes requires only 1 bit for each candidate in the range 2 through [math]\displaystyle{ N }[/math], so its space complexity is lower at [math]\displaystyle{ O(N) }[/math] bits. Note that space needed for the primes is not counted, since they can printed or written to external storage as they are found. Pritchard[2] presented a variant of his sieve that requires only [math]\displaystyle{ O(N/\log\log N) }[/math] bits without compromising the sublinear time complexity, making it asymptotically superior to the natural version of the sieve of Eratostheses in both time and space.

However, the sieve of Eratostheses can be optimized to require much less memory by operating on successive segments of the natural numbers.[9] Its space complexity can be reduced to [math]\displaystyle{ O(\sqrt N) }[/math] bits without increasing its time complexity[3] This means that in practice it can be used for much larger limits [math]\displaystyle{ N }[/math] than would otherwise fit in memory, and also take advantage of fast cache memory. For maximum speed it is also optimized using a small wheel to avoid sieving with the first few primes (although this does not change its asymptotic time complexity). Therefore the sieve of Pritchard is not competitive as a practical sieve over sufficiently large ranges.

Geometric model

Generating successive wheels up to [math]\displaystyle{ W_3 }[/math]

At the heart of the sieve of Pritchard is an algorithm for building successive wheels. It has a simple geometric model as follows:

  1. Start with a circle of circumference 1 with a mark at 1
  2. To generate the next wheel:
    1. Go around the wheel and find (the distance to) the first mark after 1; call it p
    2. Create a new circle with p times the circumference of the current wheel
    3. Roll the current wheel around the new circle, marking it where a mark touches it
    4. Magnify the current wheel by p and remove the marks that coincide

Note that for the first 2 iterations it is necessary to continue round the circle until 1 is reached again.

The first circle represents [math]\displaystyle{ W_0=\{1\} }[/math], and successive circles represent wheels [math]\displaystyle{ W_1, W_2,... }[/math]. The animation on the right shows this model in action up to [math]\displaystyle{ W_3 }[/math].

It is apparent from the model that wheels are symmetric. This is because [math]\displaystyle{ P_k-w }[/math] is not divisible by one of the first [math]\displaystyle{ k }[/math] primes if and only if [math]\displaystyle{ w }[/math] is not so divisible. It is possible to exploit this to avoid processing some composites, but at the cost of a more complex algorithm.

Related sieves

Once the wheel in the sieve of Pritchard reaches its maximum size, the remaining operations are equivalent to those performed by Euler's sieve.

The sieve of Pritchard is unique in conflating the set of prime candidates with a dynamic wheel used to speed up the sifting process. But a separate static wheel (as frequently used to speed up the sieve of Eratosthenes) can give an [math]\displaystyle{ O(\log\log N) }[/math] speedup to the latter, or to linear sieves, provided it is large enough (as a function of [math]\displaystyle{ N }[/math]). Examples are the use of the largest wheel of length not exceeding [math]\displaystyle{ \sqrt{N}/log^{2}N }[/math] to get a version of the sieve of Eratosthenes that takes [math]\displaystyle{ O(N) }[/math] additions and requires only [math]\displaystyle{ O(\sqrt N/\log\log N) }[/math] bits,[3] and the speedup of the naturally linear sieve of Atkin to get a sublinear optimized version.

Bengalloun found a linear smoothly incremental sieve,[10] i.e., one that (in theory) can run indefinitely and takes a bounded number of operations to increment the current bound [math]\displaystyle{ N }[/math]. He also showed how to make it sublinear by adapting the sieve of Pritchard to incrementally build the next dynamic wheel while the current one is being used. Pritchard[5] showed how to avoid multiplications, thereby obtaining the same asymptotic bit-complexity as the sieve of Pritchard.

Runciman provides a functional algorithm[11] inspired by the sieve of Pritchard.

See also

References

  1. 1.0 1.1 1.2 1.3 1.4 Pritchard, Paul (1982). "Explaining the Wheel Sieve". Acta Informatica 17 (4): 477–485. doi:10.1007/BF00264164. 
  2. 2.0 2.1 2.2 2.3 2.4 Pritchard, Paul (1981). "A Sublinear Additive Sieve for Finding Prime Numbers". Communications of the ACM 24 (1): 18–23. doi:10.1145/358527.358540. 
  3. 3.0 3.1 3.2 Pritchard, Paul (1983). "Fast Compact Prime Number Sieves (Among Others)". Journal of Algorithms 4 (4): 332–344. doi:10.1016/0196-6774(83)90014-7. 
  4. Pritchard, Paul (1987). "Linear prime-number sieves: A family tree". Science of Computer Programming 9 (1): 17–35. doi:10.1016/0167-6423(87)90024-4. 
  5. 5.0 5.1 Pritchard, Paul (1980). "On the prime example of programming". Language Design and Programming Methodology. Lecture Notes in Computer Science. 877. pp. 280–288. doi:10.1007/3-540-09745-7_5. ISBN 978-3-540-09745-7. 
  6. Dunten, Brian; Jones, Julie; Sorenson, Jonathan (1996). "A Space-Efficient Fast Prime Number Sieve". Information Processing Letters 59 (2): 79–84. doi:10.1016/0020-0190(96)00099-3. 
  7. Mairson, Harry G. (1977). "Some new upper bounds on the generation of prime numbers". Communications of the ACM 20 (9): 664–669. doi:10.1145/359810.359838. 
  8. Gries, David; Misra, Jayadev (1978). "A linear sieve algorithm for finding prime numbers". Communications of the ACM 21 (12): 999–1003. doi:10.1145/359657.359660. 
  9. Bays, Carter; Hudson, Richard H. (1977). "The segmented sieve of Eratosthenes and primes in arithmetic progressions to 1012". BIT 17 (2): 121–127. doi:10.1007/BF01932283. 
  10. Bengelloun, S. A. (2004). "An incremental primal sieve". Acta Informatica 23 (2): 119–125. doi:10.1007/BF00289493. 
  11. Runciman, C. (1997). "Lazy Wheel Sieves and Spirals of Primes". Journal of Functional Programming 7 (2): 219–225. doi:10.1017/S0956796897002670. https://eprints.whiterose.ac.uk/3784/1/runcimanc1.pdf.