RANDU

From HandWiki
Three-dimensional plot of 100,000 values generated with RANDU. Each point represents 3 consecutive pseudorandom values. It is clearly seen that the points fall in 15 two-dimensional planes.

RANDU[1] is a linear congruential pseudorandom number generator (LCG) of the Park–Miller type, which was used primarily in the 1960s and 1970s.[2] It is defined by the recurrence:

[math]\displaystyle{ V_{j+1} = 65539\cdot V_j\, \bmod\, 2^{31}\, }[/math]

with the initial seed number, [math]\displaystyle{ V_0 }[/math] as an odd number. It generates pseudorandom integers [math]\displaystyle{ V_j }[/math] which are uniformly distributed in the interval [1, 231 − 1], but in practical applications are often mapped into pseudorandom rationals [math]\displaystyle{ X_j }[/math] in the interval (0, 1), by the formula:

[math]\displaystyle{ X_j = \frac{V_{j}}{2^{31}} }[/math].

IBM's RANDU is widely considered to be one of the most ill-conceived random number generators ever designed,[3] and was described as "truly horrible" by Donald Knuth.[4] It fails the spectral test badly for dimensions greater than 2 as will be seen below.

The reason for choosing these particular values for the multiplier and modulus had been that with a 32-bit-integer word size, the arithmetic of mod 231 and [math]\displaystyle{ 65539 }[/math] [math]\displaystyle{ (\text{i.e., }2^{16} + 3) }[/math] calculations could be done quickly, using bitwise operators in hardware, but the values were chosen for computational convenience, not statistical quality.

Problems with multiplier and modulus

For any linear congruential generator with modulus m used to generate points in n- dimensional space, the points fall in no more than [math]\displaystyle{ (n!\times m)^{1/n} }[/math] parallel hyperplanes. [5] This indicates that low-modulus LCGs are unsuited to high-dimensional Monte Carlo simulation. For m = 2^31 and n = 3, an LCG could have up to 2344 planes, theoretical maximum. A much tighter upper bound is proved in the same Marsaglia paper to be the sum of the absolute values of all the coefficients of the hyperplanes in standard form. That is, if the hyperplanes are of the form Ax1 + Bx2 + Cx3 = some integer such as 0, 1, 2 etc, then the maximum number of planes is |A|+|B|+|C|.[5]

Now we examine the values of multiplier 65539 and modulus 231 chosen for RANDU. Consider the following calculation where every term should be taken mod 231. Start by writing the recursive relation as:

[math]\displaystyle{ x_{k+2}=(2^{16}+3) x_{k+1}=(2^{16}+3 )^2 x_{k}\, }[/math]

which becomes, after expanding the quadratic factor:

[math]\displaystyle{ x_{k+2}=(2^{32}+6 \cdot2^{16} +9 )x_{k}=[6 \cdot (2^{16}+3)-9]x_{k}\, }[/math]
because 232 mod 231 = 0

and allows us to show the correlation between three points as:

[math]\displaystyle{ x_{k+2}=6x_{k+1}-9x_{k}\, }[/math]

Summing the absolute values of the coefficients, we get no more than 16 planes in 3D, becoming only 15 planes on closer examination as shown in the diagram above. Even by the standards of LCGs, this shows that RANDU is terrible: using RANDU for sampling a unit cube will only sample 15 parallel planes, not even close to the upper limit of [math]\displaystyle{ floor((2^{31}\times 3!)^{1/3}) = 2344 }[/math] planes.

As a result of the wide use of RANDU in the early 1970s, many results from that time are seen as suspicious.[6] This misbehavior was already detected in 1963[7] on a 36-bit computer, and carefully reimplemented[clarification needed] on the 32-bit IBM System/360. It was believed to have been widely purged by the early 1990s[8] but there were still FORTRAN compilers using it as late as 1999.[1]

Sample output

The start of the RANDU's output period for the initial seed [math]\displaystyle{ V_0 = 1 }[/math] is:

1, 65539, 393225, 1769499, 7077969, 26542323, … (sequence A096555 in the OEIS)

References

  1. 1.0 1.1 Compaq Fortran Language Reference Manual (Order Number: AA-Q66SD-TK) September 1999 (formerly DIGITAL Fortran and DEC Fortran 90)
  2. Entacher, Karl (June 2000). "A collection of classical pseudorandom number generators with linear structures - advanced version". http://random.mat.sbg.ac.at/results/karl/server/server.html. 
  3. Knuth D.E. The Art of Computer Programming, Volume 2: Seminumerical Algorithms, Second Edition. Addison-Wesley, 1981. ISBN:0-201-03822-6. Section 3.3.4, p. 104. "its very name RANDU is enough to bring dismay into the eyes and stomachs of many computer scientists!" [Extensive coverage of statistical tests for non-randomness.]
  4. Knuth (1998), p. 188
  5. 5.0 5.1 Marsaglia, George (1968). "Random Numbers Fall Mainly in the Planes". Proc. Natl. Acad. Sci. U.S.A. 61 (1): 25–28. doi:10.1073/pnas.61.1.25. PMID 16591687. Bibcode1968PNAS...61...25M. 
  6. Press, William H. (1992). Numerical Recipes in Fortran 77: The Art of Scientific Computing (2nd ed.). ISBN 0-521-43064-X. 
  7. ref. 7 of http://portal.acm.org/citation.cfm?id=363827
  8. Interview with Donald Knuth

External links