Diagonal lemma
In mathematical logic, the diagonal lemma (also known as diagonalization lemma, self-reference lemma[1] or fixed point theorem) establishes the existence of self-referential sentences in certain formal theories of the natural numbers—specifically those theories that are strong enough to represent all computable functions. The sentences whose existence is secured by the diagonal lemma can then, in turn, be used to prove fundamental limitative results such as Gödel's incompleteness theorems and Tarski's undefinability theorem.[2]
Background
Let [math]\displaystyle{ \mathbb{N} }[/math] be the set of natural numbers. A first-order theory [math]\displaystyle{ T }[/math] in the language of arithmetic represents[3] the computable function [math]\displaystyle{ f: \mathbb{N}\rightarrow\mathbb{N} }[/math] if there exists a "graph" formula [math]\displaystyle{ \mathcal{G}_f(x, y) }[/math] in the language of [math]\displaystyle{ T }[/math] such that for each [math]\displaystyle{ n \in \mathbb{N} }[/math]
- [math]\displaystyle{ \vdash_{T}\,(\forall y)[(^\circ f(n)=y) \Leftrightarrow \mathcal{G}_f(^\circ n,\,y)] }[/math]
Here [math]\displaystyle{ {}^\circ n }[/math] is the numeral corresponding to the natural number [math]\displaystyle{ n }[/math], which is defined to be the [math]\displaystyle{ n }[/math]th successor of presumed first numeral [math]\displaystyle{ 0 }[/math] in [math]\displaystyle{ T }[/math].
The diagonal lemma also requires a systematic way of assigning to every formula [math]\displaystyle{ \mathcal{A} }[/math] a natural number [math]\displaystyle{ \#(\mathcal{A}) }[/math] (also written as [math]\displaystyle{ \#_{\mathcal{A}} }[/math]) called its Gödel number. Formulas can then be represented within [math]\displaystyle{ T }[/math] by the numerals corresponding to their Gödel numbers. For example, [math]\displaystyle{ \mathcal{A} }[/math] is represented by [math]\displaystyle{ ^{\circ}\#_{\mathcal{A}} }[/math]
The diagonal lemma applies to theories capable of representing all primitive recursive functions. Such theories include first-order Peano arithmetic and the weaker Robinson arithmetic, and even to a much weaker theory known as R. A common statement of the lemma (as given below) makes the stronger assumption that the theory can represent all computable functions, but all the theories mentioned have that capacity, as well.
Statement of the lemma
Lemma[4] — Let [math]\displaystyle{ T }[/math] be a first-order theory in the language of arithmetic and capable of representing all computable functions, and [math]\displaystyle{ \mathcal{F}(y) }[/math] be a formula in [math]\displaystyle{ T }[/math] with one free variable. Then there exists a sentence [math]\displaystyle{ \mathcal{C} }[/math] such that
- [math]\displaystyle{ \vdash_T\,\mathcal{C}\Leftrightarrow\mathcal{F}({}^{\circ}\#_{\mathcal{C}}) }[/math]
Intuitively, [math]\displaystyle{ \mathcal{C} }[/math] is a self-referential sentence: [math]\displaystyle{ \mathcal{C} }[/math] says that [math]\displaystyle{ \mathcal{C} }[/math] has the property [math]\displaystyle{ \mathcal{F} }[/math]. The sentence [math]\displaystyle{ \mathcal{C} }[/math] can also be viewed as a fixed point of the operation that assigns, to the equivalence class of a given sentence [math]\displaystyle{ \mathcal{A} }[/math], the equivalence class of the sentence [math]\displaystyle{ \mathcal{F}(^\circ \#_{\mathcal{A}}) }[/math] (a sentence's equivalence class is the set of all sentences to which it is provably equivalent in the theory [math]\displaystyle{ T }[/math]). The sentence [math]\displaystyle{ \mathcal{C} }[/math] constructed in the proof is not literally the same as [math]\displaystyle{ F(^\circ \#_{\mathcal{C}}) }[/math], but is provably equivalent to it in the theory [math]\displaystyle{ T }[/math].
Proof
Let [math]\displaystyle{ f:\mathbb{N}\to\mathbb{N} }[/math] be the function defined by:
- [math]\displaystyle{ f(\#_{\mathcal{A}}) = \#[\mathcal{A}(^{\circ}\#_{\mathcal{A}})] }[/math]
for each formula [math]\displaystyle{ \mathcal{A}(x) }[/math] with only one free variable [math]\displaystyle{ x }[/math] in theory [math]\displaystyle{ T }[/math], and [math]\displaystyle{ f(n)=0 }[/math] otherwise. Here [math]\displaystyle{ \#_{\mathcal{A}}=\#(\mathcal{A}(x)) }[/math] denotes the Gödel number of formula [math]\displaystyle{ \mathcal{A}(x) }[/math]. The function [math]\displaystyle{ f }[/math] is computable (which is ultimately an assumption about the Gödel numbering scheme), so there is a formula [math]\displaystyle{ \mathcal{G}_f(x,\,y) }[/math] representing [math]\displaystyle{ f }[/math] in [math]\displaystyle{ T }[/math]. Namely
- [math]\displaystyle{ \vdash_T\,(\forall y)\{\mathcal{G}_f(^{\circ}\#_{\mathcal{A}},\,y) \Leftrightarrow [y = {}^{\circ}f(\#_{\mathcal{A}})]\} }[/math]
which is to say
- [math]\displaystyle{ \vdash_T\,(\forall y)\{\mathcal{G}_f(^{\circ}\#_{\mathcal{A}},\,y) \Leftrightarrow [y = {}^{\circ}\#(\mathcal{A}(^{\circ}\#_{\mathcal{A}}))]\} }[/math]
Now, given an arbitrary formula [math]\displaystyle{ \mathcal{F}(y) }[/math] with one free variable [math]\displaystyle{ y }[/math], define the formula [math]\displaystyle{ \mathcal{B}(z) }[/math] as:
- [math]\displaystyle{ \mathcal{B}(z) := (\forall y) [\mathcal{G}_f(z,\,y)\Rightarrow \mathcal{F}(y)] }[/math]
Then, for all formulas [math]\displaystyle{ \mathcal{A}(x) }[/math] with one free variable:
- [math]\displaystyle{ \vdash_T\,\mathcal{B}(^{\circ}\#_{\mathcal{A}}) \Leftrightarrow (\forall y)\{[ y = {}^{\circ}\#(\mathcal{A}(^{\circ}\#_{\mathcal{A}}))] \Rightarrow \mathcal{F}(y)\} }[/math]
which is to say
- [math]\displaystyle{ \vdash_T\,\mathcal{B}(^{\circ}\#_{\mathcal{A}}) \Leftrightarrow \mathcal{F}(^{\circ}\#[\mathcal{A}(^{\circ}\#_{\mathcal{A}})]) }[/math]
Now substitute [math]\displaystyle{ \mathcal{A} }[/math] with [math]\displaystyle{ \mathcal{B} }[/math], and define the sentence [math]\displaystyle{ \mathcal{C} }[/math] as:
- [math]\displaystyle{ \mathcal{C}:= \mathcal{B}(^{\circ}\#_{\mathcal{B}}) }[/math]
Then the previous line can be rewritten as
- [math]\displaystyle{ \vdash_T\,\mathcal{C}\Leftrightarrow\mathcal{F}(^{\circ}\#_{\mathcal{C}}) }[/math]
which is the desired result.
(The same argument in different terms is given in [Raatikainen (2015a)].)
History
The lemma is called "diagonal" because it bears some resemblance to Cantor's diagonal argument.[5] The terms "diagonal lemma" or "fixed point" do not appear in Kurt Gödel's 1931 article or in Alfred Tarski's 1936 article.
Rudolf Carnap (1934) was the first to prove the general self-referential lemma,[6] which says that for any formula F in a theory T satisfying certain conditions, there exists a formula ψ such that ψ ↔ F(°#(ψ)) is provable in T. Carnap's work was phrased in alternate language, as the concept of computable functions was not yet developed in 1934. Mendelson (1997, p. 204) believes that Carnap was the first to state that something like the diagonal lemma was implicit in Gödel's reasoning. Gödel was aware of Carnap's work by 1937.[7]
The diagonal lemma is closely related to Kleene's recursion theorem in computability theory, and their respective proofs are similar.
See also
- Indirect self-reference
- List of fixed point theorems
- Primitive recursive arithmetic
- Self-reference
- Self-referential paradoxes
Notes
- ↑ Hájek, Petr; Pudlák, Pavel (1998). Metamathematics of First-Order Arithmetic. Perspectives in Mathematical Logic (1st ed.). Springer. ISBN 3-540-63648-X. "In modern texts these results are proved using the well-known diagonalization (or self-reference) lemma, which is already implicit in Gödel's proof."
- ↑ See Boolos and Jeffrey (2002, sec. 15) and Mendelson (1997, Prop. 3.37 and Cor. 3.44 ).
- ↑ For details on representability, see Hinman 2005, p. 316
- ↑ Smullyan (1991, 1994) are standard specialized references. The lemma is Prop. 3.34 in Mendelson (1997), and is covered in many texts on basic mathematical logic, such as Boolos and Jeffrey (1989, sec. 15) and Hinman (2005).
- ↑ See, for example, Gaifman (2006).
- ↑ Kurt Gödel, Collected Works, Volume I: Publications 1929–1936, Oxford University Press, 1986, p. 339.
- ↑ See Gödel's Collected Works, Vol. 1, Oxford University Press, 1986, p. 363, fn 23.
References
- George Boolos and Richard Jeffrey, 1989. Computability and Logic, 3rd ed. Cambridge University Press. ISBN:0-521-38026-X ISBN:0-521-38923-2
- Rudolf Carnap, 1934. Logische Syntax der Sprache. (English translation: 2003. The Logical Syntax of Language. Open Court Publishing.)
- Haim Gaifman, 2006. 'Naming and Diagonalization: From Cantor to Gödel to Kleene'. Logic Journal of the IGPL, 14: 709–728.
- Hinman, Peter, 2005. Fundamentals of Mathematical Logic. A K Peters. ISBN:1-56881-262-0
- Mendelson, Elliott, 1997. Introduction to Mathematical Logic, 4th ed. Chapman & Hall.
- Panu Raatikainen, 2015a. The Diagonalization Lemma. In Stanford Encyclopedia of Philosophy, ed. Zalta. Supplement to Raatikainen (2015b).
- Panu Raatikainen, 2015b. Gödel's Incompleteness Theorems. In Stanford Encyclopedia of Philosophy, ed. Zalta.
- Raymond Smullyan, 1991. Gödel's Incompleteness Theorems. Oxford Univ. Press.
- Raymond Smullyan, 1994. Diagonalization and Self-Reference. Oxford Univ. Press.
- Alfred Tarski (1936). "Der Wahrheitsbegriff in den formalisierten Sprachen". Studia Philosophica 1: 261–405. Archived from the original on 9 January 2014. https://web.archive.org/web/20140109135345/http://www.ifispan.waw.pl/studialogica/s-p-f/volumina_i-iv/I-07-Tarski-small.pdf. Retrieved 26 June 2013.
- Alfred Tarski, tr. J. H. Woodger, 1983. "The Concept of Truth in Formalized Languages". English translation of Tarski's 1936 article. In A. Tarski, ed. J. Corcoran, 1983, Logic, Semantics, Metamathematics, Hackett.
Original source: https://en.wikipedia.org/wiki/Diagonal lemma.
Read more |