Computable number

From HandWiki
Short description: Real number that can be computed within arbitrary precision
π can be computed to arbitrary precision, while almost every real number is not computable.

In mathematics, computable numbers are the real numbers that can be computed to within any desired precision by a finite, terminating algorithm. They are also known as the recursive numbers, effective numbers[1] or the computable reals or recursive reals.[citation needed] The concept of a computable real number was introduced by Emile Borel in 1912, using the intuitive notion of computability available at the time.[2]

Equivalent definitions can be given using μ-recursive functions, Turing machines, or λ-calculus as the formal representation of algorithms. The computable numbers form a real closed field and can be used in the place of real numbers for many, but not all, mathematical purposes.

Informal definition using a Turing machine as example

In the following, Marvin Minsky defines the numbers to be computed in a manner similar to those defined by Alan Turing in 1936;[3] i.e., as "sequences of digits interpreted as decimal fractions" between 0 and 1:[4]

A computable number [is] one for which there is a Turing machine which, given n on its initial tape, terminates with the nth digit of that number [encoded on its tape].

The key notions in the definition are (1) that some n is specified at the start, (2) for any n the computation only takes a finite number of steps, after which the machine produces the desired output and terminates.

An alternate form of (2) – the machine successively prints all n of the digits on its tape, halting after printing the nth – emphasizes Minsky's observation: (3) That by use of a Turing machine, a finite definition – in the form of the machine's state table – is being used to define what is a potentially infinite string of decimal digits.

This is however not the modern definition which only requires the result be accurate to within any given accuracy. The informal definition above is subject to a rounding problem called the table-maker's dilemma whereas the modern definition is not.

Formal definition

A real number a is computable if it can be approximated by some computable function [math]\displaystyle{ f:\mathbb{N}\to\mathbb{Z} }[/math] in the following manner: given any positive integer n, the function produces an integer f(n) such that:

[math]\displaystyle{ {f(n)-1\over n} \leq a \leq {f(n)+1\over n}. }[/math]

There are two similar definitions that are equivalent:

  • There exists a computable function which, given any positive rational error bound [math]\displaystyle{ \varepsilon }[/math], produces a rational number r such that [math]\displaystyle{ |r - a| \leq \varepsilon. }[/math]
  • There is a computable sequence of rational numbers [math]\displaystyle{ q_i }[/math] converging to [math]\displaystyle{ a }[/math] such that [math]\displaystyle{ |q_i - q_{i+1}| \lt 2^{-i}\, }[/math] for each i.

There is another equivalent definition of computable numbers via computable Dedekind cuts. A computable Dedekind cut is a computable function [math]\displaystyle{ D\; }[/math] which when provided with a rational number [math]\displaystyle{ r }[/math] as input returns [math]\displaystyle{ D(r)=\mathrm{true}\; }[/math] or [math]\displaystyle{ D(r)=\mathrm{false}\; }[/math], satisfying the following conditions:

[math]\displaystyle{ \exists r D(r)=\mathrm{true}\; }[/math]
[math]\displaystyle{ \exists r D(r)=\mathrm{false}\; }[/math]
[math]\displaystyle{ (D(r)=\mathrm{true}) \wedge (D(s)=\mathrm{false}) \Rightarrow r\lt s\; }[/math]
[math]\displaystyle{ D(r)=\mathrm{true} \Rightarrow \exist s\gt r, D(s)=\mathrm{true}.\; }[/math]

An example is given by a program D that defines the cube root of 3. Assuming [math]\displaystyle{ q\gt 0\; }[/math] this is defined by:

[math]\displaystyle{ p^3\lt 3 q^3 \Rightarrow D(p/q)=\mathrm{true}\; }[/math]
[math]\displaystyle{ p^3\gt 3 q^3 \Rightarrow D(p/q)=\mathrm{false}.\; }[/math]

A real number is computable if and only if there is a computable Dedekind cut D corresponding to it. The function D is unique for each computable number (although of course two different programs may provide the same function).

A complex number is called computable if its real and imaginary parts are computable.

Properties

Not computably enumerable

Assigning a Gödel number to each Turing machine definition produces a subset [math]\displaystyle{ S }[/math] of the natural numbers corresponding to the computable numbers and identifies a surjection from [math]\displaystyle{ S }[/math] to the computable numbers. There are only countably many Turing machines, showing that the computable numbers are subcountable. The set [math]\displaystyle{ S }[/math] of these Gödel numbers, however, is not computably enumerable (and consequently, neither are subsets of [math]\displaystyle{ S }[/math] that are defined in terms of it). This is because there is no algorithm to determine which Gödel numbers correspond to Turing machines that produce computable reals. In order to produce a computable real, a Turing machine must compute a total function, but the corresponding decision problem is in Turing degree 0′′. Consequently, there is no surjective computable function from the natural numbers to the set [math]\displaystyle{ S }[/math] of machines representing computable reals, and Cantor's diagonal argument cannot be used constructively to demonstrate uncountably many of them.

While the set of real numbers is uncountable, the set of computable numbers is classically countable and thus almost all real numbers are not computable. Here, for any given computable number [math]\displaystyle{ x, }[/math] the well ordering principle provides that there is a minimal element in [math]\displaystyle{ S }[/math] which corresponds to [math]\displaystyle{ x }[/math], and therefore there exists a subset consisting of the minimal elements, on which the map is a bijection. The inverse of this bijection is an injection into the natural numbers of the computable numbers, proving that they are countable. But, again, this subset is not computable, even though the computable reals are themselves ordered.

Properties as a field

The arithmetical operations on computable numbers are themselves computable in the sense that whenever real numbers a and b are computable then the following real numbers are also computable: a + b, a - b, ab, and a/b if b is nonzero. These operations are actually uniformly computable; for example, there is a Turing machine which on input (A,B,[math]\displaystyle{ \epsilon }[/math]) produces output r, where A is the description of a Turing machine approximating a, B is the description of a Turing machine approximating b, and r is an [math]\displaystyle{ \epsilon }[/math] approximation of a+b.

The fact that computable real numbers form a field was first proved by Henry Gordon Rice in 1954.[5]

Computable reals however do not form a computable field, because the definition of a computable field requires effective equality.

Non-computability of the ordering

The order relation on the computable numbers is not computable. Let A be the description of a Turing machine approximating the number [math]\displaystyle{ a }[/math]. Then there is no Turing machine which on input A outputs "YES" if [math]\displaystyle{ a \gt 0 }[/math] and "NO" if [math]\displaystyle{ a \le 0. }[/math] To see why, suppose the machine described by A keeps outputting 0 as [math]\displaystyle{ \epsilon }[/math] approximations. It is not clear how long to wait before deciding that the machine will never output an approximation which forces a to be positive. Thus the machine will eventually have to guess that the number will equal 0, in order to produce an output; the sequence may later become different from 0. This idea can be used to show that the machine is incorrect on some sequences if it computes a total function. A similar problem occurs when the computable reals are represented as Dedekind cuts. The same holds for the equality relation : the equality test is not computable.

While the full order relation is not computable, the restriction of it to pairs of unequal numbers is computable. That is, there is a program that takes as input two Turing machines A and B approximating numbers [math]\displaystyle{ a }[/math] and [math]\displaystyle{ b }[/math], where [math]\displaystyle{ a \ne b }[/math], and outputs whether [math]\displaystyle{ a \lt b }[/math] or [math]\displaystyle{ a \gt b. }[/math] It is sufficient to use [math]\displaystyle{ \epsilon }[/math]-approximations where [math]\displaystyle{ \epsilon \lt |b-a|/2, }[/math] so by taking increasingly small [math]\displaystyle{ \epsilon }[/math] (approaching 0), one eventually can decide whether [math]\displaystyle{ a \lt b }[/math] or [math]\displaystyle{ a \gt b. }[/math]

Other properties

The computable real numbers do not share all the properties of the real numbers used in analysis. For example, the least upper bound of a bounded increasing computable sequence of computable real numbers need not be a computable real number.[6] A sequence with this property is known as a Specker sequence, as the first construction is due to Ernst Specker in 1949.[7] Despite the existence of counterexamples such as these, parts of calculus and real analysis can be developed in the field of computable numbers, leading to the study of computable analysis.

Every computable number is arithmetically definable, but not vice versa. There are many arithmetically definable, noncomputable real numbers, including:

Both of these examples in fact define an infinite set of definable, uncomputable numbers, one for each Universal Turing machine. A real number is computable if and only if the set of natural numbers it represents (when written in binary and viewed as a characteristic function) is computable.

The set of computable real numbers (as well as every countable, densely ordered subset of computable reals without ends) is order-isomorphic to the set of rational numbers.

Digit strings and the Cantor and Baire spaces

Turing's original paper defined computable numbers as follows:

A real number is computable if its digit sequence can be produced by some algorithm or Turing machine. The algorithm takes an integer [math]\displaystyle{ n \ge 1 }[/math] as input and produces the [math]\displaystyle{ n }[/math]-th digit of the real number's decimal expansion as output.

(The decimal expansion of a only refers to the digits following the decimal point.)

Turing was aware that this definition is equivalent to the [math]\displaystyle{ \epsilon }[/math]-approximation definition given above. The argument proceeds as follows: if a number is computable in the Turing sense, then it is also computable in the [math]\displaystyle{ \epsilon }[/math] sense: if [math]\displaystyle{ n \gt \log_{10} (1/\epsilon) }[/math], then the first n digits of the decimal expansion for a provide an [math]\displaystyle{ \epsilon }[/math] approximation of a. For the converse, we pick an [math]\displaystyle{ \epsilon }[/math] computable real number a and generate increasingly precise approximations until the nth digit after the decimal point is certain. This always generates a decimal expansion equal to a but it may improperly end in an infinite sequence of 9's in which case it must have a finite (and thus computable) proper decimal expansion.

Unless certain topological properties of the real numbers are relevant, it is often more convenient to deal with elements of [math]\displaystyle{ 2^{\omega} }[/math] (total 0,1 valued functions) instead of reals numbers in [math]\displaystyle{ [0,1] }[/math]. The members of [math]\displaystyle{ 2^{\omega} }[/math] can be identified with binary decimal expansions, but since the decimal expansions [math]\displaystyle{ .d_1d_2\ldots d_n0111\ldots }[/math] and [math]\displaystyle{ .d_1d_2\ldots d_n10 }[/math] denote the same real number, the interval [math]\displaystyle{ [0,1] }[/math] can only be bijectively (and homeomorphically under the subset topology) identified with the subset of [math]\displaystyle{ 2^{\omega} }[/math] not ending in all 1's.

Note that this property of decimal expansions means that it is impossible to effectively identify the computable real numbers defined in terms of a decimal expansion and those defined in the [math]\displaystyle{ \epsilon }[/math] approximation sense. Hirst has shown that there is no algorithm which takes as input the description of a Turing machine which produces [math]\displaystyle{ \epsilon }[/math] approximations for the computable number a, and produces as output a Turing machine which enumerates the digits of a in the sense of Turing's definition.[8] Similarly, it means that the arithmetic operations on the computable reals are not effective on their decimal representations as when adding decimal numbers. In order to produce one digit, it may be necessary to look arbitrarily far to the right to determine if there is a carry to the current location. This lack of uniformity is one reason why the contemporary definition of computable numbers uses [math]\displaystyle{ \epsilon }[/math] approximations rather than decimal expansions.

However, from a computability theoretic or measure theoretic perspective, the two structures [math]\displaystyle{ 2^{\omega} }[/math] and [math]\displaystyle{ [0,1] }[/math] are essentially identical. Thus, computability theorists often refer to members of [math]\displaystyle{ 2^{\omega} }[/math] as reals. While [math]\displaystyle{ 2^{\omega} }[/math] is totally disconnected, for questions about [math]\displaystyle{ \Pi^0_1 }[/math] classes or randomness it is easier to work in [math]\displaystyle{ 2^{\omega} }[/math].

Elements of [math]\displaystyle{ \omega^{\omega} }[/math] are sometimes called reals as well and though containing a homeomorphic image of [math]\displaystyle{ \mathbb{R} }[/math], [math]\displaystyle{ \omega^{\omega} }[/math] isn't even locally compact (in addition to being totally disconnected). This leads to genuine differences in the computational properties. For instance the [math]\displaystyle{ x \in \mathbb{R} }[/math] satisfying [math]\displaystyle{ \forall(n \in \omega)\phi(x,n) }[/math], with [math]\displaystyle{ \phi(x,n) }[/math] quantifier free, must be computable while the unique [math]\displaystyle{ x \in \omega^{\omega} }[/math] satisfying a universal formula may have an arbitrarily high position in the hyperarithmetic hierarchy.

Use in place of the reals

The computable numbers include the specific real numbers which appear in practice, including all real algebraic numbers, as well as e, π, and many other transcendental numbers. Though the computable reals exhaust those reals we can calculate or approximate, the assumption that all reals are computable leads to substantially different conclusions about the real numbers. The question naturally arises of whether it is possible to dispose of the full set of reals and use computable numbers for all of mathematics. This idea is appealing from a constructivist point of view, and has been pursued by what Errett Bishop and Fred Richman call the Russian school of constructive mathematics.[citation needed] [9]

To actually develop analysis over computable numbers, some care must be taken. For example, if one uses the classical definition of a sequence, the set of computable numbers is not closed under the basic operation of taking the supremum of a bounded sequence (for example, consider a Specker sequence, see the section above). This difficulty is addressed by considering only sequences which have a computable modulus of convergence. The resulting mathematical theory is called computable analysis.

Implementations of exact arithmetic

Computer packages representing real numbers as programs computing approximations have been proposed as early as 1985, under the name "exact arithmetic".[10] Modern examples include the CoRN library (Coq),[11] and the RealLib package (C++).[12] A related line of work is based on taking a real RAM program and running it with rational or floating-point numbers of sufficient precision, such as the iRRAM package.[13]

See also

Notes

  1. van der Hoeven (2006).
  2. P. Odifreddi, Classical Recursion Theory (1989), p.8. North-Holland, 0-444-87295-7
  3. Turing (1936).
  4. Minsky (1967).
  5. Rice (1954).
  6. Bridges & Richman (1987), p. 58.
  7. Specker (1949).
  8. Hirst (2007).
  9. Zalta, Edward N., ed. (2022), "Russian School of Constructive Mathematics", Constructive Mathematics, Metaphysics Research Lab, Stanford University 
  10. Boehm, Hans-J.; Cartwright, Robert; Riggle, Mark; O'Donnell, Michael J. (8 August 1986). "Exact real arithmetic: A case study in higher order programming". Proceedings of the 1986 ACM conference on LISP and functional programming - LFP '86. pp. 162–173. doi:10.1145/319838.319860. ISBN 0897912004. http://fricas-wiki.math.uni.wroc.pl/public/refs/exact-real-p162-boehm.pdf. 
  11. O’Connor, Russell (2008). "Certified Exact Transcendental Real Number Computation in Coq". Theorem Proving in Higher Order Logics. Lecture Notes in Computer Science. 5170. pp. 246–261. doi:10.1007/978-3-540-71067-7_21. ISBN 978-3-540-71065-3. 
  12. Lambov (2015).
  13. Gowland, Paul; Lester, David (2001). "A Survey of Exact Arithmetic Implementations" (in en). Computability and Complexity in Analysis. Lecture Notes in Computer Science. 2064. Springer. pp. 30–47. doi:10.1007/3-540-45335-0_3. ISBN 978-3-540-42197-9. https://link.springer.com/content/pdf/10.1007%2F3-540-45335-0_3.pdf. 

References

Further reading