Deterministic pushdown automaton

From HandWiki
Revision as of 15:05, 6 February 2024 by AstroAI (talk | contribs) (simplify)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

In automata theory, a deterministic pushdown automaton (DPDA or DPA) is a variation of the pushdown automaton. The class of deterministic pushdown automata accepts the deterministic context-free languages, a proper subset of context-free languages.[1] Machine transitions are based on the current state and input symbol, and also the current topmost symbol of the stack. Symbols lower in the stack are not visible and have no immediate effect. Machine actions include pushing, popping, or replacing the stack top. A deterministic pushdown automaton has at most one legal transition for the same combination of input symbol, state, and top stack symbol. This is where it differs from the nondeterministic pushdown automaton.

Formal definition

A (not necessarily deterministic) PDA [math]\displaystyle{ M }[/math] can be defined as a 7-tuple:

[math]\displaystyle{ M=(Q\,, \Sigma\,, \Gamma\,, q_0\,, Z_0\,, A\,, \delta\,) }[/math]

where

  • [math]\displaystyle{ Q\, }[/math] is a finite set of states
  • [math]\displaystyle{ \Sigma\, }[/math] is a finite set of input symbols
  • [math]\displaystyle{ \Gamma\, }[/math] is a finite set of stack symbols
  • [math]\displaystyle{ q_0\,\in Q\, }[/math] is the start state
  • [math]\displaystyle{ Z_0\,\in\Gamma\, }[/math] is the starting stack symbol
  • [math]\displaystyle{ A\,\subseteq Q\, }[/math], where [math]\displaystyle{ A }[/math] is the set of accepting, or final, states
  • [math]\displaystyle{ \delta\, }[/math] is a transition function, where
[math]\displaystyle{ \delta\colon(Q\, \times ( \Sigma\, \cup \left \{ \varepsilon\, \right \} ) \times \Gamma\,) \longrightarrow \mathcal{P}(Q \times \Gamma ^{*}) }[/math]
where [math]\displaystyle{ * }[/math] is the Kleene star, meaning that [math]\displaystyle{ \Gamma^{*} }[/math] is "the set of all finite strings (including the empty string [math]\displaystyle{ \varepsilon }[/math]) of elements of [math]\displaystyle{ \Gamma }[/math]", [math]\displaystyle{ \varepsilon }[/math] denotes the empty string, and [math]\displaystyle{ \mathcal{P}(X) }[/math] is the power set of a set [math]\displaystyle{ X }[/math].

M is deterministic if it satisfies both the following conditions:

  • For any [math]\displaystyle{ q \in Q, a \in \Sigma \cup \left \{ \varepsilon \right \}, x \in \Gamma }[/math], the set [math]\displaystyle{ \delta(q,a,x)\, }[/math] has at most one element.
  • For any [math]\displaystyle{ q \in Q, x \in \Gamma }[/math], if [math]\displaystyle{ \delta(q, \varepsilon, x) \not= \emptyset\, }[/math], then [math]\displaystyle{ \delta\left( q,a,x \right) = \emptyset }[/math] for every [math]\displaystyle{ a \in \Sigma. }[/math]

There are two possible acceptance criteria: acceptance by empty stack and acceptance by final state. The two are not equivalent for the deterministic pushdown automaton (although they are for the non-deterministic pushdown automaton). The languages accepted by empty stack are those languages that are accepted by final state and are prefix-free: no word in the language is the prefix of another word in the language.[citation needed]

The usual acceptance criterion is final state, and it is this acceptance criterion which is used to define the deterministic context-free languages.

Languages recognized

If [math]\displaystyle{ L(A) }[/math] is a language accepted by a PDA [math]\displaystyle{ A }[/math], it can also be accepted by a DPDA if and only if there is a single computation from the initial configuration until an accepting one for all strings belonging to [math]\displaystyle{ L(A) }[/math]. If [math]\displaystyle{ L(A) }[/math] can be accepted by a PDA it is a context free language and if it can be accepted by a DPDA it is a deterministic context-free language (DCFL).

Not all context-free languages are deterministic. This makes the DPDA a strictly weaker device than the PDA. For example, the language Lp of even-length palindromes on the alphabet of 0 and 1 has the context-free grammar S → 0S0 | 1S1 | ε. If a DPDA for this language exists, and it sees a string 0n, it must use its stack to memorize the length n, in order to be able to distinguish its possible continuations 0n 11 0nLp and 0n 11 0n+2Lp. Hence, after reading 0n 11 0n, comparing the post-"11" length to the pre-"11" length will make the stack empty again. For this reason, the strings 0n 11 0n 0n 11 0nLp and 0n 11 0n 0n+2 11 0n+2Lp cannot be distinguished.[2]

Restricting the DPDA to a single state reduces the class of languages accepted to the LL(1) languages,[3] which is a proper subclass of the DCFL.[4] In the case of a PDA, this restriction has no effect on the class of languages accepted.

Properties

Closure

Closure properties of deterministic context-free languages (accepted by deterministic PDA by final state) are drastically different from the context-free languages. As an example they are (effectively) closed under complementation, but not closed under union. To prove that the complement of a language accepted by a deterministic PDA is also accepted by a deterministic PDA is tricky.[citation needed] In principle one has to avoid infinite computations.

As a consequence of the complementation it is decidable whether a deterministic PDA accepts all words over its input alphabet, by testing its complement for emptiness. This is not possible for context-free grammars (hence not for general PDA).

Equivalence problem

Géraud Sénizergues (1997) proved that the equivalence problem for deterministic PDA (i.e. given two deterministic PDA A and B, is L(A)=L(B)?) is decidable,[5][6][7] a proof that earned him the 2002 Gödel Prize. For nondeterministic PDA, equivalence is undecidable.

Notes

  1. Michael Sipser (1997). Introduction to the Theory of Computation. PWS Publishing. p. 102. ISBN 0-534-94728-X. https://archive.org/details/introductiontoth00sips/page/102. 
  2. Hopcroft, John; Rajeev Motwani; Jeffrey Ullman (2001). Introduction to Automata Theory, Languages, and Computation (2 ed.). Addison-Wesley. pp. 249–253. 
  3. Kurki-Suonio, R. (1969). "Notes on top-down languages". BIT 9 (3): 225–238. doi:10.1007/BF01946814. 
  4. Rosenkrantz, D. J.; Stearns, R. E. (1970). "Properties of Deterministic Top Down Grammars". Information and Control 17 (3): 226–256. doi:10.1016/s0019-9958(70)90446-8.  Here: p.246–247
  5. Sénizergues, Géraud (1997). "The equivalence problem for deterministic pushdown automata is decidable". Proc. Int. Coll. on Automata, Languages, and Programming (ICALP). Lecture Notes in Computer Science. 1256. pp. 671–681. doi:10.1007/3-540-63165-8_221. ISBN 978-3-540-63165-1.  — Full version: Géraud Sénizergues (1997). L(A) = L(B)? (Technical Report 1161-97). http://dept-info.labri.u-bordeaux.fr/~ges/lalb_l.ps. 
  6. Géraud Sénizergues (2001). "Fundamental study: L(A) = L(B)? decidability results from complete formal systems". Theoretical Computer Science 251 (1–2): 1–166. doi:10.1016/S0304-3975(00)00285-1. 
  7. Géraud Sénizergues (2002). "L(A) = L(B)? A simplified decidability proof". Theoretical Computer Science 281 (1–2): 555–608. doi:10.1016/S0304-3975(02)00027-0. 

Further reading