Post's theorem
In computability theory Post's theorem, named after Emil Post, describes the connection between the arithmetical hierarchy and the Turing degrees.
Background
The statement of Post's theorem uses several concepts relating to definability and recursion theory. This section gives a brief overview of these concepts, which are covered in depth in their respective articles.
The arithmetical hierarchy classifies certain sets of natural numbers that are definable in the language of Peano arithmetic. A formula is said to be [math]\displaystyle{ \Sigma^{0}_m }[/math] if it is an existential statement in prenex normal form (all quantifiers at the front) with [math]\displaystyle{ m }[/math] alternations between existential and universal quantifiers applied to a formula with bounded quantifiers only. Formally a formula [math]\displaystyle{ \phi(s) }[/math] in the language of Peano arithmetic is a [math]\displaystyle{ \Sigma^{0}_m }[/math] formula if it is of the form
- [math]\displaystyle{ \left(\exists n^1_1\exists n^1_2\cdots\exists n^1_{j_1}\right)\left(\forall n^2_1 \cdots \forall n^2_{j_2}\right)\left(\exists n^3_1\cdots\right)\cdots\left(Q n^m_1 \cdots \right)\rho(n^1_1,\ldots n^m_{j_m},x_1,\ldots,x_k) }[/math]
where [math]\displaystyle{ \rho }[/math] contains only bounded quantifiers and Q is [math]\displaystyle{ \forall }[/math] if m is even and [math]\displaystyle{ \exists }[/math] if m is odd.
A set of natural numbers [math]\displaystyle{ A }[/math] is said to be [math]\displaystyle{ \Sigma^0_m }[/math] if it is definable by a [math]\displaystyle{ \Sigma^0_m }[/math] formula, that is, if there is a [math]\displaystyle{ \Sigma^0_m }[/math] formula [math]\displaystyle{ \phi(s) }[/math] such that each number [math]\displaystyle{ n }[/math] is in [math]\displaystyle{ A }[/math] if and only if [math]\displaystyle{ \phi(n) }[/math] holds. It is known that if a set is [math]\displaystyle{ \Sigma^0_m }[/math] then it is [math]\displaystyle{ \Sigma^0_n }[/math] for any [math]\displaystyle{ n \gt m }[/math], but for each m there is a [math]\displaystyle{ \Sigma^0_{m+1} }[/math] set that is not [math]\displaystyle{ \Sigma^0_m }[/math]. Thus the number of quantifier alternations required to define a set gives a measure of the complexity of the set.
Post's theorem uses the relativized arithmetical hierarchy as well as the unrelativized hierarchy just defined. A set [math]\displaystyle{ A }[/math] of natural numbers is said to be [math]\displaystyle{ \Sigma^0_m }[/math] relative to a set [math]\displaystyle{ B }[/math], written [math]\displaystyle{ \Sigma^{0,B}_m }[/math], if [math]\displaystyle{ A }[/math] is definable by a [math]\displaystyle{ \Sigma^0_m }[/math] formula in an extended language that includes a predicate for membership in [math]\displaystyle{ B }[/math].
While the arithmetical hierarchy measures definability of sets of natural numbers, Turing degrees measure the level of uncomputability of sets of natural numbers. A set [math]\displaystyle{ A }[/math] is said to be Turing reducible to a set [math]\displaystyle{ B }[/math], written [math]\displaystyle{ A \leq_T B }[/math], if there is an oracle Turing machine that, given an oracle for [math]\displaystyle{ B }[/math], computes the characteristic function of [math]\displaystyle{ A }[/math]. The Turing jump of a set [math]\displaystyle{ A }[/math] is a form of the Halting problem relative to [math]\displaystyle{ A }[/math]. Given a set [math]\displaystyle{ A }[/math], the Turing jump [math]\displaystyle{ A' }[/math] is the set of indices of oracle Turing machines that halt on input [math]\displaystyle{ 0 }[/math] when run with oracle [math]\displaystyle{ A }[/math]. It is known that every set [math]\displaystyle{ A }[/math] is Turing reducible to its Turing jump, but the Turing jump of a set is never Turing reducible to the original set.
Post's theorem uses finitely iterated Turing jumps. For any set [math]\displaystyle{ A }[/math] of natural numbers, the notation [math]\displaystyle{ A^{(n)} }[/math] indicates the [math]\displaystyle{ n }[/math]–fold iterated Turing jump of [math]\displaystyle{ A }[/math]. Thus [math]\displaystyle{ A^{(0)} }[/math] is just [math]\displaystyle{ A }[/math], and [math]\displaystyle{ A^{(n+1)} }[/math] is the Turing jump of [math]\displaystyle{ A^{(n)} }[/math].
Post's theorem and corollaries
Post's theorem establishes a close connection between the arithmetical hierarchy and the Turing degrees of the form [math]\displaystyle{ \emptyset^{(n)} }[/math], that is, finitely iterated Turing jumps of the empty set. (The empty set could be replaced with any other computable set without changing the truth of the theorem.)
Post's theorem states:
- A set [math]\displaystyle{ B }[/math] is [math]\displaystyle{ \Sigma^0_{n+1} }[/math] if and only if [math]\displaystyle{ B }[/math] is recursively enumerable by an oracle Turing machine with an oracle for [math]\displaystyle{ \emptyset^{(n)} }[/math], that is, if and only if [math]\displaystyle{ B }[/math] is [math]\displaystyle{ \Sigma^{0,\emptyset^{(n)}}_1 }[/math].
- The set [math]\displaystyle{ \emptyset^{(n)} }[/math] is [math]\displaystyle{ \Sigma^0_n }[/math]-complete for every [math]\displaystyle{ n \gt 0 }[/math]. This means that every [math]\displaystyle{ \Sigma^0_n }[/math] set is many-one reducible to [math]\displaystyle{ \emptyset^{(n)} }[/math].
Post's theorem has many corollaries that expose additional relationships between the arithmetical hierarchy and the Turing degrees. These include:
- Fix a set [math]\displaystyle{ C }[/math]. A set [math]\displaystyle{ B }[/math] is [math]\displaystyle{ \Sigma^{0,C}_{n+1} }[/math] if and only if [math]\displaystyle{ B }[/math] is [math]\displaystyle{ \Sigma^{0,C^{(n)}}_1 }[/math]. This is the relativization of the first part of Post's theorem to the oracle [math]\displaystyle{ C }[/math].
- A set [math]\displaystyle{ B }[/math] is [math]\displaystyle{ \Delta_{n+1} }[/math] if and only if [math]\displaystyle{ B \leq_T \emptyset^{(n)} }[/math]. More generally, [math]\displaystyle{ B }[/math] is [math]\displaystyle{ \Delta^C_{n+1} }[/math] if and only if [math]\displaystyle{ B \leq_T C^{(n)} }[/math].
- A set is defined to be arithmetical if it is [math]\displaystyle{ \Sigma^0_n }[/math] for some [math]\displaystyle{ n }[/math]. Post's theorem shows that, equivalently, a set is arithmetical if and only if it is Turing reducible to [math]\displaystyle{ \emptyset^{(m)} }[/math] for some m.
Proof of Post's theorem
Formalization of Turing machines in first-order arithmetic
The operation of a Turing machine [math]\displaystyle{ T }[/math] on input [math]\displaystyle{ n }[/math] can be formalized logically in first-order arithmetic. For example, we may use symbols [math]\displaystyle{ A_k }[/math], [math]\displaystyle{ B_k }[/math], and [math]\displaystyle{ C_k }[/math] for the tape configuration, machine state and location along the tape after [math]\displaystyle{ k }[/math] steps, respectively. [math]\displaystyle{ T }[/math]'s transition system determines the relation between [math]\displaystyle{ (A_k,B_k,C_k) }[/math] and [math]\displaystyle{ (A_{k+1},B_{k+1},C_{k+1}) }[/math]; their initial values (for [math]\displaystyle{ k=0 }[/math]) are the input, the initial state and zero, respectively. The machine halts if and only if there is a number [math]\displaystyle{ k }[/math] such that [math]\displaystyle{ B_k }[/math] is the halting state.
The exact relation depends on the specific implementation of the notion of Turing machine (e.g. their alphabet, allowed mode of motion along the tape, etc.)
In case [math]\displaystyle{ T }[/math] halts at time [math]\displaystyle{ n_1 }[/math], the relation between [math]\displaystyle{ (A_k,B_k,C_k) }[/math] and [math]\displaystyle{ (A_{k+1},B_{k+1},C_{k+1}) }[/math] must be satisfied only for k bounded from above by [math]\displaystyle{ n_1 }[/math].
Thus there is a formula [math]\displaystyle{ \varphi(n,n_1) }[/math] in first-order arithmetic with no unbounded quantifiers, such that [math]\displaystyle{ T }[/math] halts on input [math]\displaystyle{ n }[/math] at time [math]\displaystyle{ n_1 }[/math] at most if and only if [math]\displaystyle{ \varphi(n,n_1) }[/math] is satisfied.
Implementation example
For example, for a prefix-free Turing machine with binary alphabet and no blank symbol, we may use the following notations:
- [math]\displaystyle{ A_k }[/math] is the 1-ary symbol for the configuration of the whole tape after [math]\displaystyle{ k }[/math] steps (which we may write as a number with LSB first, the value of the m-th location on the tape being its m-th least significant bit). In particular [math]\displaystyle{ A_0 }[/math] is the initial configuration of the tape, which corresponds the input to the machine.
- [math]\displaystyle{ B_k }[/math] is the 1-ary symbol for the Turing machine state after [math]\displaystyle{ k }[/math] steps. In particular, [math]\displaystyle{ B_0=q_I }[/math], the initial state of the Turing machine.
- [math]\displaystyle{ C_k }[/math] is the 1-ary symbol for the Turing machine location on the tape after [math]\displaystyle{ k }[/math] steps. In particular [math]\displaystyle{ C_0=0 }[/math].
- [math]\displaystyle{ M(q,b) }[/math] is the transition function of the Turing machine, written as a function from a doublet (machine state, bit read by the machine) to a triplet (new machine state, bit written by the machine, +1 or -1 machine movement along the tape).
- [math]\displaystyle{ bit(j,m) }[/math] is the j-th bit of a number [math]\displaystyle{ m }[/math]. This can be written as a first-order arithmetic formula with no unbounded quantifiers.
For a prefix-free Turing machine we may use, for input n, the initial tape configuration [math]\displaystyle{ t(n)= cat(2^{ceil(log_2 n)}-1,0,n) }[/math] where cat stands for concatenation; thus [math]\displaystyle{ t(n) }[/math] is a [math]\displaystyle{ \log(n)- }[/math]length string of [math]\displaystyle{ 1-s }[/math] followed by [math]\displaystyle{ 0 }[/math] and then by [math]\displaystyle{ n }[/math].
The operation of the Turing machine at the first [math]\displaystyle{ n_1 }[/math] steps can thus be written as the conjunction of the initial conditions and the following formulas, quantified over [math]\displaystyle{ k }[/math] for all [math]\displaystyle{ k\lt n_1 }[/math]:
- [math]\displaystyle{ (B_{k+1}, bit(C_k ,A_{k+1}), D) = M(B_k, bit(C_k ,A_k)) }[/math]. Since M has a finite domain, this can be replaced by a first-order quantifier-free arithmetic formula. The exact formula obviously depends on M.
- [math]\displaystyle{ C_{k+1} = C_k+D }[/math]
- [math]\displaystyle{ \forall j: j\ne C_k \rightarrow bit(j ,A_{k+1}) = bit(j ,A_k) }[/math]. Note that at the first [math]\displaystyle{ n_1 }[/math] steps, [math]\displaystyle{ T }[/math] never arrives at a location along the tape greater than [math]\displaystyle{ n_1 }[/math]. Thus the universal quantifier over j can be bounded by [math]\displaystyle{ n_1 }[/math]+1, as bits beyond this location have no relevance for the machine's operation.
T halts on input [math]\displaystyle{ n }[/math] at time [math]\displaystyle{ n_1 }[/math] at most if and only if [math]\displaystyle{ \varphi(n,n_1) }[/math] is satisfied, where:
- [math]\displaystyle{ \begin{align}\varphi(n,n_1) =& (A_0=t(n)) \land (B_0=q_I) \land (C_0=0) \land (B_{n_1}=q_H)\\ &\land \forall k\lt n_1: ((B_{k+1},bit(C_k,A_{k+1}),1) = M(B_k,bit(C_k,A_k)) \land C_{k+1}=C_k+1)&\\ &\lor ((B_{k+1},bit(C_k ,A_{k+1}),-1) = M(B_k,bit(C_k,A_k)) \land C_{k+1}=C_k-1))& \\&\land \forall j\lt n_1+1: j\ne C_k \rightarrow (bit(j,A_{k+1})=bit(j,A_k)) & \end{align} }[/math]
This is a first-order arithmetic formula with no unbounded quantifiers, i.e. it is in [math]\displaystyle{ \Sigma^0_0 }[/math].
Recursively enumerable sets
Let [math]\displaystyle{ S }[/math] be a set that can be recursively enumerated by a Turing machine. Then there is a Turing machine [math]\displaystyle{ T }[/math] that for every [math]\displaystyle{ n }[/math] in [math]\displaystyle{ S }[/math], [math]\displaystyle{ T }[/math] halts when given [math]\displaystyle{ n }[/math] as an input.
This can be formalized by the first-order arithmetical formula presented above. The members of [math]\displaystyle{ S }[/math] are the numbers [math]\displaystyle{ n }[/math] satisfying the following formula:
[math]\displaystyle{ \exists n_1:\varphi(n,n_1) }[/math]
This formula is in [math]\displaystyle{ \Sigma^0_1 }[/math]. Therefore, [math]\displaystyle{ S }[/math] is in [math]\displaystyle{ \Sigma^0_1 }[/math]. Thus every recursively enumerable set is in [math]\displaystyle{ \Sigma^0_1 }[/math].
The converse is true as well: for every formula [math]\displaystyle{ \varphi(n) }[/math] in [math]\displaystyle{ \Sigma^0_1 }[/math] with k existential quantifiers, we may enumerate the [math]\displaystyle{ k }[/math]–tuples of natural numbers and run a Turing machine that goes through all of them until it finds the formula is satisfied. This Turing machine halts on precisely the set of natural numbers satisfying [math]\displaystyle{ \varphi(n) }[/math], and thus enumerates its corresponding set.
Oracle machines
Similarly, the operation of an oracle machine [math]\displaystyle{ T }[/math] with an oracle O that halts after at most [math]\displaystyle{ n_1 }[/math] steps on input [math]\displaystyle{ n }[/math] can be described by a first-order formula [math]\displaystyle{ \varphi_O(n,n_1) }[/math], except that the formula [math]\displaystyle{ \varphi_1(n,n_1) }[/math] now includes:
- A new predicate, [math]\displaystyle{ O_m }[/math], giving the oracle answer. This predicate must satisfy some formula to be discussed below.
- An additional tape - the oracle tape - on which [math]\displaystyle{ T }[/math] has to write the number m for every call O(m) to the oracle; writing on this tape can be logically formalized in a similar manner to writing on the machine's tape. Note that an oracle machine that halts after at most [math]\displaystyle{ n_1 }[/math] steps has time to write at most [math]\displaystyle{ n_1 }[/math] digits on the oracle tape. So the oracle can only be called with numbers m satisfying [math]\displaystyle{ m\lt 2^{n_1} }[/math].
If the oracle is for a decision problem, [math]\displaystyle{ O_m }[/math] is always "Yes" or "No", which we may formalize as 0 or 1. Suppose the decision problem itself can be formalized by a first-order arithmetic formula [math]\displaystyle{ \psi^O(m) }[/math]. Then [math]\displaystyle{ T }[/math] halts on [math]\displaystyle{ n }[/math] after at most [math]\displaystyle{ n_1 }[/math] steps if and only if the following formula is satisfied: [math]\displaystyle{ \varphi_O(n,n_1) =\forall m\lt 2^{n_1}:((\psi^O(m)\rightarrow (O_m=1)) \land(\lnot\psi^O(m)\rightarrow (O_m=0))) \land {\varphi_O}_1(n,n_1) }[/math]
where [math]\displaystyle{ {\varphi_O}_1(n,n_1) }[/math] is a first-order formula with no unbounded quantifiers.
Turing jump
If O is an oracle to the halting problem of a machine [math]\displaystyle{ T' }[/math], then [math]\displaystyle{ \psi^O(m) }[/math] is the same as "there exists [math]\displaystyle{ m_1 }[/math] such that [math]\displaystyle{ T' }[/math] starting with input m is at the halting state after [math]\displaystyle{ m_1 }[/math] steps". Thus: [math]\displaystyle{ \psi^O(m) = \exists m_1: \psi_H(m,m_1) }[/math] where [math]\displaystyle{ \psi_H(m,m_1) }[/math] is a first-order formula that formalizes [math]\displaystyle{ T' }[/math]. If [math]\displaystyle{ T' }[/math] is a Turing machine (with no oracle), [math]\displaystyle{ \psi_H(m,m_1) }[/math] is in [math]\displaystyle{ \Sigma^0_0 = \Pi^0_0 }[/math] (i.e. it has no unbounded quantifiers).
Since there is a finite number of numbers m satisfying [math]\displaystyle{ m\lt 2^{n_1} }[/math], we may choose the same number of steps for all of them: there is a number [math]\displaystyle{ m_1 }[/math], such that [math]\displaystyle{ T' }[/math] halts after [math]\displaystyle{ m_1 }[/math] steps precisely on those inputs [math]\displaystyle{ m\lt 2^{n_1} }[/math] for which it halts at all.
Moving to prenex normal form, we get that the oracle machine halts on input [math]\displaystyle{ n }[/math] if and only if the following formula is satisfied: [math]\displaystyle{ \varphi(n) =\exists n_1\exists m_1 \forall m_2 :(\psi_H(m,m_2)\rightarrow (O_m=1)) \land(\lnot\psi_H(m,m_1)\rightarrow (O_m=0))) \land {\varphi_O}_1(n,n_1) }[/math]
(informally, there is a "maximal number of steps"[math]\displaystyle{ m_1 }[/math] such every oracle that does not halt within the first [math]\displaystyle{ m_1 }[/math] steps does not stop at all; however, for every[math]\displaystyle{ m_2 }[/math], each oracle that halts after [math]\displaystyle{ m_2 }[/math] steps does halt).
Note that we may replace both [math]\displaystyle{ n_1 }[/math] and [math]\displaystyle{ m_1 }[/math] by a single number - their maximum - without changing the truth value of [math]\displaystyle{ \varphi(n) }[/math]. Thus we may write: [math]\displaystyle{ \varphi(n) =\exists n_1 \forall m_2 :(\psi_H(m,m_2)\rightarrow (O_m=1)) \land(\lnot\psi_H(m,n_1)\rightarrow (O_m=0))) \land {\varphi_O}_1(n,n_1) }[/math]
For the oracle to the halting problem over Turing machines, [math]\displaystyle{ \psi_H(m,m_1) }[/math] is in [math]\displaystyle{ \Pi^0_0 }[/math] and [math]\displaystyle{ \varphi(n) }[/math] is in [math]\displaystyle{ \Sigma^0_2 }[/math]. Thus every set that is recursively enumerable by an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(1)} }[/math], is in [math]\displaystyle{ \Sigma^0_2 }[/math].
The converse is true as well: Suppose [math]\displaystyle{ \varphi(n) }[/math] is a formula in [math]\displaystyle{ \Sigma^0_2 }[/math] with [math]\displaystyle{ k_1 }[/math] existential quantifiers followed by [math]\displaystyle{ k_2 }[/math] universal quantifiers. Equivalently, [math]\displaystyle{ \varphi(n) }[/math] has [math]\displaystyle{ k_1 }[/math]> existential quantifiers followed by a negation of a formula in [math]\displaystyle{ \Sigma^0_1 }[/math]; the latter formula can be enumerated by a Turing machine and can thus be checked immediately by an oracle for [math]\displaystyle{ \emptyset ^{(1)} }[/math].
We may thus enumerate the [math]\displaystyle{ k_1 }[/math]–tuples of natural numbers and run an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(1)} }[/math] that goes through all of them until it finds a satisfaction for the formula. This oracle machine halts on precisely the set of natural numbers satisfying [math]\displaystyle{ \varphi(n) }[/math], and thus enumerates its corresponding set.
Higher Turing jumps
More generally, suppose every set that is recursively enumerable by an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(p)} }[/math] is in [math]\displaystyle{ \Sigma^0_{p+1} }[/math]. Then for an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(p+1)} }[/math], [math]\displaystyle{ \psi^O(m) = \exists m_1: \psi_H(m,m_1) }[/math] is in [math]\displaystyle{ \Sigma^0_{p+1} }[/math].
Since [math]\displaystyle{ \psi^O(m) }[/math] is the same as [math]\displaystyle{ \varphi(n) }[/math] for the previous Turing jump, it can be constructed (as we have just done with [math]\displaystyle{ \varphi(n) }[/math] above) so that [math]\displaystyle{ \psi_H(m,m_1) }[/math] in [math]\displaystyle{ \Pi^0_p }[/math]. After moving to prenex formal form the new [math]\displaystyle{ \varphi(n) }[/math] is in [math]\displaystyle{ \Sigma^0_{p+2} }[/math].
By induction, every set that is recursively enumerable by an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(p)} }[/math], is in [math]\displaystyle{ \Sigma^0_{p+1} }[/math].
The other direction can be proven by induction as well: Suppose every formula in [math]\displaystyle{ \Sigma^0_{p+1} }[/math] can be enumerated by an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(p)} }[/math].
Now Suppose [math]\displaystyle{ \varphi(n) }[/math] is a formula in [math]\displaystyle{ \Sigma^0_{p+2} }[/math] with [math]\displaystyle{ k_1 }[/math] existential quantifiers followed by [math]\displaystyle{ k_2 }[/math] universal quantifiers etc. Equivalently, [math]\displaystyle{ \varphi(n) }[/math] has [math]\displaystyle{ k_1 }[/math]> existential quantifiers followed by a negation of a formula in [math]\displaystyle{ \Sigma^0_{p+1} }[/math]; the latter formula can be enumerated by an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(p)} }[/math] and can thus be checked immediately by an oracle for [math]\displaystyle{ \emptyset ^{(p+1)} }[/math].
We may thus enumerate the [math]\displaystyle{ k_1 }[/math]–tuples of natural numbers and run an oracle machine with an oracle for [math]\displaystyle{ \emptyset ^{(p+1)} }[/math] that goes through all of them until it finds a satisfaction for the formula. This oracle machine halts on precisely the set of natural numbers satisfying [math]\displaystyle{ \varphi(n) }[/math], and thus enumerates its corresponding set.
References
Rogers, H. The Theory of Recursive Functions and Effective Computability, MIT Press. ISBN 0-262-68052-1; ISBN 0-07-053522-1
Soare, R. Recursively enumerable sets and degrees. Perspectives in Mathematical Logic. Springer-Verlag, Berlin, 1987. ISBN 3-540-15299-7
Original source: https://en.wikipedia.org/wiki/Post's theorem.
Read more |