Kleene's recursion theorem
In computability theory, Kleene's recursion theorems are a pair of fundamental results about the application of computable functions to their own descriptions. The theorems were first proved by Stephen Kleene in 1938[1] and appear in his 1952 book Introduction to Metamathematics.[2] A related theorem, which constructs fixed points of a computable function, is known as Rogers's theorem and is due to Hartley Rogers, Jr.[3]
The recursion theorems can be applied to construct fixed points of certain operations on computable functions, to generate quines, and to construct functions defined via recursive definitions.
Notation
The statement of the theorems refers to an admissible numbering [math]\displaystyle{ \varphi }[/math] of the partial recursive functions, such that the function corresponding to index [math]\displaystyle{ e }[/math] is [math]\displaystyle{ \varphi_e }[/math].
If [math]\displaystyle{ F }[/math] and [math]\displaystyle{ G }[/math] are partial functions on the natural numbers, the notation [math]\displaystyle{ F \simeq G }[/math] indicates that, for each n, either [math]\displaystyle{ F(n) }[/math] and [math]\displaystyle{ G(n) }[/math] are both defined and are equal, or else [math]\displaystyle{ F(n) }[/math] and [math]\displaystyle{ G(n) }[/math] are both undefined.
Rogers's fixed-point theorem
Given a function [math]\displaystyle{ F }[/math], a fixed point of [math]\displaystyle{ F }[/math] is an index [math]\displaystyle{ e }[/math] such that [math]\displaystyle{ \varphi_e \simeq \varphi_{F(e)} }[/math]. Note that the comparison of in- and outputs here is not in terms of numerical values, but in terms of their associated functions.
Rogers describes the following result as "a simpler version" of Kleene's (second) recursion theorem.[4]
Rogers's fixed-point theorem — If [math]\displaystyle{ F }[/math] is a total computable function, it has a fixed point in the above sense.
Proof of the fixed-point theorem
The proof uses a particular total computable function [math]\displaystyle{ h }[/math], defined as follows. Given a natural number [math]\displaystyle{ x }[/math], the function [math]\displaystyle{ h }[/math] outputs the index of the partial computable function that performs the following computation:
- Given an input [math]\displaystyle{ y }[/math], first attempt to compute [math]\displaystyle{ \varphi_{x}(x) }[/math]. If that computation returns an output [math]\displaystyle{ e }[/math], then compute [math]\displaystyle{ \varphi_e(y) }[/math] and return its value, if any.
Thus, for all indices [math]\displaystyle{ x }[/math] of partial computable functions, if [math]\displaystyle{ \varphi_x(x) }[/math] is defined, then [math]\displaystyle{ \varphi_{h(x)} \simeq \varphi_{\varphi_x(x)} }[/math]. If [math]\displaystyle{ \varphi_x(x) }[/math] is not defined, then [math]\displaystyle{ \varphi_{h(x)} }[/math] is a function that is nowhere defined. The function [math]\displaystyle{ h }[/math] can be constructed from the partial computable function [math]\displaystyle{ g(x,y) }[/math] described above and the s-m-n theorem: for each [math]\displaystyle{ x }[/math], [math]\displaystyle{ h(x) }[/math] is the index of a program which computes the function [math]\displaystyle{ y \mapsto g(x,y) }[/math].
To complete the proof, let [math]\displaystyle{ F }[/math] be any total computable function, and construct [math]\displaystyle{ h }[/math] as above. Let [math]\displaystyle{ e }[/math] be an index of the composition [math]\displaystyle{ F \circ h }[/math], which is a total computable function. Then [math]\displaystyle{ \varphi_{h(e)} \simeq \varphi_{\varphi_e(e)} }[/math] by the definition of [math]\displaystyle{ h }[/math]. But, because [math]\displaystyle{ e }[/math] is an index of [math]\displaystyle{ F \circ h }[/math], [math]\displaystyle{ \varphi_e(e) = (F \circ h)(e) }[/math], and thus [math]\displaystyle{ \varphi_{\varphi_e(e)} \simeq \varphi_{F(h(e))} }[/math]. By the transitivity of [math]\displaystyle{ \simeq }[/math], this means [math]\displaystyle{ \varphi_{h(e)} \simeq \varphi_{F(h(e))} }[/math]. Hence [math]\displaystyle{ \varphi_n \simeq \varphi_{F(n)} }[/math] for [math]\displaystyle{ n = h(e) }[/math].
This proof is a construction of a partial recursive function which implements the Y combinator.
Fixed-point-free functions
A function [math]\displaystyle{ F }[/math] such that [math]\displaystyle{ \varphi_e \not \simeq \varphi_{F(e)} }[/math] for all [math]\displaystyle{ e }[/math] is called fixed-point free. The fixed-point theorem shows that no total computable function is fixed-point free, but there are many non-computable fixed-point-free functions. Arslanov's completeness criterion states that the only recursively enumerable Turing degree that computes a fixed-point-free function is 0′, the degree of the halting problem.[5]
Kleene's second recursion theorem
The second recursion theorem is a generalization of Rogers's theorem with a second input in the function. One informal interpretation of the second recursion theorem is that it is possible to construct self-referential programs; see "Application to quines" below.
- The second recursion theorem. For any partial recursive function [math]\displaystyle{ Q(x,y) }[/math] there is an index [math]\displaystyle{ p }[/math] such that [math]\displaystyle{ \varphi_p \simeq \lambda y.Q(p,y) }[/math].
The theorem can be proved from Rogers's theorem by letting [math]\displaystyle{ F(p) }[/math] be a function such that [math]\displaystyle{ \varphi_{F(p)}(y) = Q(p,y) }[/math] (a construction described by the S-m-n theorem). One can then verify that a fixed-point of this [math]\displaystyle{ F }[/math] is an index [math]\displaystyle{ p }[/math] as required. The theorem is constructive in the sense that a fixed computable function maps an index for [math]\displaystyle{ Q }[/math] into the index [math]\displaystyle{ p }[/math].
This can be further extended to computable functions with this statement:
Let [math]\displaystyle{ f: \mathbb{N} \rightarrow \mathbb{N} }[/math] be a total computable function. Then, there exists a program [math]\displaystyle{ e_0 \in \mathbb{N} }[/math] such that [math]\displaystyle{ \phi_{e_0} = \phi_{(f_(e_0))} }[/math].
This essentially means that if we take a program, we apply an effective transformation in an extensional way (say, replace instructions such as successor, jump, remove lines), there will always be a program which is not changed by the transformation of the function, even before or after it.
This theorem can therefore be interpreted in the following manner “given any effective procedure to transform programs, there is always a program such that, when modified by the procedure, it does exactly what it did before, or it's impossible to write a program that changes the extensional behaviour of all other programs”.
Comparison to Rogers's theorem
Kleene's second recursion theorem and Rogers's theorem can both be proved, rather simply, from each other.[6] However, a direct proof of Kleene's theorem[7] does not make use of a universal program, which means that the theorem holds for certain subrecursive programming systems that do not have a universal program.
Application to quines
A classic example using the second recursion theorem is the function [math]\displaystyle{ Q(x,y)=x }[/math]. The corresponding index [math]\displaystyle{ p }[/math] in this case yields a computable function that outputs its own index when applied to any value.[8] When expressed as computer programs, such indices are known as quines.
The following example in Lisp illustrates how the [math]\displaystyle{ p }[/math] in the corollary can be effectively produced from the function [math]\displaystyle{ Q }[/math]. The function s11
in the code is the function of that name produced by the S-m-n theorem.
Q
can be changed to any two-argument function.
(setq Q '(lambda (x y) x)) (setq s11 '(lambda (f x) (list 'lambda '(y) (list f x 'y)))) (setq n (list 'lambda '(x y) (list Q (list s11 'x 'x) 'y))) (setq p (eval (list s11 n n)))
The results of the following expressions should be the same. [math]\displaystyle{ \varphi }[/math] p(nil)
(eval (list p nil))
Q(p, nil)
(eval (list Q p nil))
Application to elimination of recursion
Suppose that [math]\displaystyle{ g }[/math] and [math]\displaystyle{ h }[/math] are total computable functions that are used in a recursive definition for a function [math]\displaystyle{ f }[/math]:
- [math]\displaystyle{ f(0,y) \simeq g(y), }[/math]
- [math]\displaystyle{ f(x+1,y) \simeq h(f(x,y),x,y), }[/math]
The second recursion theorem can be used to show that such equations define a computable function, where the notion of computability does not have to allow, prima facie, for recursive definitions (for example, it may be defined by μ-recursion, or by Turing machines). This recursive definition can be converted into a computable function [math]\displaystyle{ \varphi_{F}(e,x,y) }[/math] that assumes [math]\displaystyle{ e }[/math] is an index to itself, to simulate recursion:
- [math]\displaystyle{ \varphi_{F}(e,0,y) \simeq g(y), }[/math]
- [math]\displaystyle{ \varphi_{F}(e,x+1,y) \simeq h(\varphi_e(x,y),x,y). }[/math]
The recursion theorem establishes the existence of a computable function [math]\displaystyle{ \varphi_f }[/math] such that [math]\displaystyle{ \varphi_f(x,y) \simeq \varphi_{F}(f,x,y) }[/math]. Thus [math]\displaystyle{ f }[/math] satisfies the given recursive definition.
Reflexive programming
Reflexive, or reflective, programming refers to the usage of self-reference in programs. Jones presents a view of the second recursion theorem based on a reflexive language.[9] It is shown that the reflexive language defined is not stronger than a language without reflection (because an interpreter for the reflexive language can be implemented without using reflection); then, it is shown that the recursion theorem is almost trivial in the reflexive language.
The first recursion theorem
While the second recursion theorem is about fixed points of computable functions, the first recursion theorem is related to fixed points determined by enumeration operators, which are a computable analogue of inductive definitions. An enumeration operator is a set of pairs (A,n) where A is a (code for a) finite set of numbers and n is a single natural number. Often, n will be viewed as a code for an ordered pair of natural numbers, particularly when functions are defined via enumeration operators. Enumeration operators are of central importance in the study of enumeration reducibility.
Each enumeration operator Φ determines a function from sets of naturals to sets of naturals given by
- [math]\displaystyle{ \Phi(X) = \{ n \mid \exists A \subseteq X [(A,n) \in \Phi]\}. }[/math]
A recursive operator is an enumeration operator that, when given the graph of a partial recursive function, always returns the graph of a partial recursive function.
A fixed point of an enumeration operator Φ is a set F such that Φ(F) = F. The first enumeration theorem shows that fixed points can be effectively obtained if the enumeration operator itself is computable.
- First recursion theorem. The following statements hold.
- For any computable enumeration operator Φ there is a recursively enumerable set F such that Φ(F) = F and F is the smallest set with this property.
- For any recursive operator Ψ there is a partial computable function φ such that Ψ(φ) = φ and φ is the smallest partial computable function with this property.
The first recursion theorem is also called Fixed point theorem (of recursion theory).[10] There is also a definition which can be applied to recursive functionals as follows:
Let [math]\displaystyle{ \Phi: \mathbb{F}(\mathbb{N}^k) \rightarrow (\mathbb{N}^k) }[/math] be a recursive functional. Then [math]\displaystyle{ \Phi }[/math] has a least fixed point [math]\displaystyle{ f_{\Phi}: \mathbb{N}^k \rightarrow \mathbb{N} }[/math] which is computable i.e.
1) [math]\displaystyle{ \Phi(f_{\phi})=f_{\Phi} }[/math]
2) [math]\displaystyle{ \forall g \in \mathbb{F}(\mathbb{N}^k) }[/math] such that [math]\displaystyle{ \Phi(g)=g }[/math] it holds that [math]\displaystyle{ f_{\Phi}\subseteq g }[/math]
3) [math]\displaystyle{ f_{\Phi} }[/math] is computable
Example
Like the second recursion theorem, the first recursion theorem can be used to obtain functions satisfying systems of recursion equations. To apply the first recursion theorem, the recursion equations must first be recast as a recursive operator.
Consider the recursion equations for the factorial function f:[math]\displaystyle{ \begin{align} &f(0) = 1 \\ &f(n+1) = (n + 1) \cdot f(n) \end{align} }[/math]The corresponding recursive operator Φ will have information that tells how to get to the next value of f from the previous value. However, the recursive operator will actually define the graph of f. First, Φ will contain the pair [math]\displaystyle{ ( \varnothing, (0, 1)) }[/math]. This indicates that f(0) is unequivocally 1, and thus the pair (0,1) is in the graph of f.
Next, for each n and m, Φ will contain the pair [math]\displaystyle{ ( \{ (n, m) \}, (n+1, (n+1)\cdot m)) }[/math]. This indicates that, if f(n) is m, then f(n + 1) is (n + 1)m, so that the pair (n + 1, (n + 1)m) is in the graph of f. Unlike the base case f(0) = 1, the recursive operator requires some information about f(n) before it defines a value of f(n + 1).
The first recursion theorem (in particular, part 1) states that there is a set F such that Φ(F) = F. The set F will consist entirely of ordered pairs of natural numbers, and will be the graph of the factorial function f, as desired.
The restriction to recursion equations that can be recast as recursive operators ensures that the recursion equations actually define a least fixed point. For example, consider the set of recursion equations:[math]\displaystyle{ \begin{align} &g(0) = 1\\ &g(n + 1) = 1\\ &g(2n) = 0 \end{align} }[/math]There is no function g satisfying these equations, because they imply g(2) = 1 and also imply g(2) = 0. Thus there is no fixed point g satisfying these recursion equations. It is possible to make an enumeration operator corresponding to these equations, but it will not be a recursive operator.
Proof sketch for the first recursion theorem
The proof of part 1 of the first recursion theorem is obtained by iterating the enumeration operator Φ beginning with the empty set. First, a sequence Fk is constructed, for [math]\displaystyle{ k = 0, 1, \ldots }[/math]. Let F0 be the empty set. Proceeding inductively, for each k, let Fk + 1 be [math]\displaystyle{ F_k \cup \Phi(F_k) }[/math]. Finally, F is taken to be [math]\displaystyle{ \bigcup F_k }[/math]. The remainder of the proof consists of a verification that F is recursively enumerable and is the least fixed point of Φ. The sequence Fk used in this proof corresponds to the Kleene chain in the proof of the Kleene fixed-point theorem.
The second part of the first recursion theorem follows from the first part. The assumption that Φ is a recursive operator is used to show that the fixed point of Φ is the graph of a partial function. The key point is that if the fixed point F is not the graph of a function, then there is some k such that Fk is not the graph of a function.
Comparison to the second recursion theorem
Compared to the second recursion theorem, the first recursion theorem produces a stronger conclusion but only when narrower hypotheses are satisfied. Rogers uses the term weak recursion theorem for the first recursion theorem and strong recursion theorem for the second recursion theorem.[3]
One difference between the first and second recursion theorems is that the fixed points obtained by the first recursion theorem are guaranteed to be least fixed points, while those obtained from the second recursion theorem may not be least fixed points.
A second difference is that the first recursion theorem only applies to systems of equations that can be recast as recursive operators. This restriction is similar to the restriction to continuous operators in the Kleene fixed-point theorem of order theory. The second recursion theorem can be applied to any total recursive function.
Generalized theorem
In the context of his theory of numberings, Ershov showed that Kleene's recursion theorem holds for any precomplete numbering.[11] A Gödel numbering is a precomplete numbering on the set of computable functions so the generalized theorem yields the Kleene recursion theorem as a special case.[12]
Given a precomplete numbering [math]\displaystyle{ \nu }[/math], then for any partial computable function [math]\displaystyle{ f }[/math] with two parameters there exists a total computable function [math]\displaystyle{ t }[/math] with one parameter such that
- [math]\displaystyle{ \forall n \in \mathbb{N} : \nu \circ f(n,t(n)) = \nu \circ t(n). }[/math]
See also
- Denotational semantics, where another least fixed point theorem is used for the same purpose as the first recursion theorem.
- Fixed-point combinators, which are used in lambda calculus for the same purpose as the first recursion theorem.
- Diagonal lemma a closely related result in mathematical logic.
References
- Ershov, Yuri L. (1999). "Part 4: Mathematics and Computability Theory. 14. Theory of numbering". in Griffor, Edward R.. Handbook of Computability Theory. Studies in logic and the foundations of mathematics. 140. Amsterdam: Elsevier. pp. 473–503. ISBN 9780444898821. OCLC 162130533. https://books.google.com/books?id=KqeXZ4pPd5QC. Retrieved 6 May 2020.
- Jones, Neil D. (1997). Computability and complexity: From a Programming Perspective. Cambridge, Massachusetts: MIT Press. ISBN 9780262100649. OCLC 981293265.
- Kleene, Stephen C. (1952). Introduction to Metamathematics. Bibliotheca Mathematica. North-Holland Publishing. ISBN 9780720421033. OCLC 459805591. https://archive.org/details/BubliothecaMathematicaStephenColeKleeneIntroductionToMetamathematicsWoltersNoordhoffPublishing1971. Retrieved 6 May 2020.
- Rogers, Hartley (1967). Theory of recursive functions and effective computability. Cambridge, Massachusetts: MIT Press. ISBN 9780262680523. OCLC 933975989. https://archive.org/details/theoryofrecursiv00roge. Retrieved 6 May 2020.
Footnotes
- ↑ Kleene, Stephen C. (1938). "On notation for ordinal numbers". Journal of Symbolic Logic 3 (4): 150–155. doi:10.2307/2267778. ISSN 0022-4812. http://www.thatmarcusfamily.org/philosophy/Course_Websites/Readings/Kleene%20-%20Ordinals.pdf. Retrieved 6 May 2020.
- ↑ Kleene 1952.
- ↑ 3.0 3.1 Rogers 1967.
- ↑ Rogers 1967, §11.2.
- ↑ Soare, R.I. (1987). Recursively Enumerable Sets and Degrees: A Study of Computable Functions and Computably Generated Sets. Perspectives in Mathematical Logic. Berlin and New York City: Springer-Verlag. p. 88. ISBN 9780387152998. OCLC 318368332.
- ↑ Jones 1997, pp. 229-30.
- ↑ Kleene 1952, pp. 352-3.
- ↑ Cutland, Nigel J. (1980). Computability: An Introduction to Recursive Function Theory. Cambridge University Press. p. 204. doi:10.1017/cbo9781139171496. ISBN 9781139935609. OCLC 488175597. https://books.google.com/books?id=wAstOUE36kcC. Retrieved 6 May 2020.
- ↑ Jones 1997.
- ↑ Cutland, Nigel. Computability: an introduction to recursive function theory.
- ↑ Barendregt, Henk; Terwijn, Sebastiaan A. (2019). "Fixed point theorems for precomplete numberings". Annals of Pure and Applied Logic 170 (10): 1151–1161. doi:10.1016/j.apal.2019.04.013. ISSN 0168-0072. http://www.sciencedirect.com/science/article/pii/S016800721930048X. Retrieved 6 May 2020. p. 1151.
- ↑ See Ershov 1999, §4.14 for a survey in English.
Further reading
- Jockusch, C. G.; Lerman, M.; Soare, R.I.; Solovay, R.M. (1989). "Recursively enumerable sets modulo iterated jumps and extensions of Arslanov's completeness criterion". The Journal of Symbolic Logic 54 (4): 1288–1323. doi:10.1017/S0022481200041104. ISSN 0022-4812.
External links
- Recursive Functions entry by Piergiorgio Odifreddi in the Stanford Encyclopedia of Philosophy, 2012.
Original source: https://en.wikipedia.org/wiki/Kleene's recursion theorem.
Read more |