Attribute grammar

From HandWiki

An attribute grammar is a formal way to supplement a formal grammar with semantic information processing. Semantic information is stored in attributes associated with terminal and nonterminal symbols of the grammar. The values of attributes are result of attribute evaluation rules associated with productions of the grammar. Attributes allow to transfer information from anywhere in the abstract syntax tree to anywhere else, in a controlled and formal way.[1] Each semantic function deals with attributes of symbols occurring only in one production rule: both semantic function parameters and its result are attributes of symbols from one particular rule. When a semantic function defines the value of an attribute of the symbol on the left hand side of the rule, the attribute is called synthesized; otherwise it is called inherited.[2] Thus, synthesized attributes serve to pass semantic information up the parse tree, while inherited attributes allow values to be passed from the parent nodes down and across the syntax tree.

In simple applications, such as evaluation of arithmetic expressions, attribute grammar may be used to describe the entire task to be performed besides parsing in straightforward way; in complicated systems, for instance, when constructing a language translation tool, such as a compiler, it may be used to validate semantic checks associated with a grammar, representing the rules of a language not explicitly imparted by the syntax definition. It may be also used by parsers or compilers to translate the syntax tree directly into code for some specific machine, or into some intermediate language.

History

Attribute grammars were invented by Donald Knuth and Peter Wegner.[3] While Donald Knuth is credited for the overall concept, Peter Wegner invented inherited attributes during a conversation with Knuth. Some embryonic ideas trace back[3] to the work of Edgar T. "Ned" Irons,[4] the author of IMP.

Example

The following is a simple context-free grammar which can describe a language made up of multiplication and addition of integers.

 ExprExpr + Term
 ExprTerm
 TermTerm * Factor
 TermFactor
 Factor → "(" Expr ")"
 Factorinteger

The following attribute grammar can be used to calculate the result of an expression written in the grammar. Note that this grammar only uses synthesized values, and is therefore an S-attributed grammar.

 Expr1Expr2 + Term [ Expr1.value = Expr2.value + Term.value ]
 ExprTerm [ Expr.value = Term.value ]
 Term1Term2 * Factor [ Term1.value = Term2.value * Factor.value ]
 TermFactor [ Term.value = Factor.value ]
 Factor → "(" Expr ")" [ Factor.value =  Expr.value ]
 Factorinteger [ Factor.value = strToInt(integer.str) ]

Synthesized attributes

A synthesized attribute is computed from the values of attributes of the children. Since the values of the children must be computed first, this is an example of bottom-up propagation.[5] To formally define a synthesized attribute, let [math]\displaystyle{ G= \langle V_n,V_t,P,S \rangle }[/math] be a formal grammar, where

  • [math]\displaystyle{ V_n }[/math] is the set of non terminal symbols
  • [math]\displaystyle{ V_t }[/math] is the set of terminal symbols
  • [math]\displaystyle{ P }[/math] is the set of productions
  • [math]\displaystyle{ S }[/math] is the distinguished, or start, symbol

Then, given a string of nonterminal symbols [math]\displaystyle{ A }[/math] and an attribute name [math]\displaystyle{ a }[/math], [math]\displaystyle{ A.a }[/math] is a synthesized attribute if all three of these conditions are met:

  • [math]\displaystyle{ A \rightarrow \alpha \in P }[/math] (i.e. [math]\displaystyle{ A \rightarrow \alpha }[/math] is one of the rules in the grammar)
  • [math]\displaystyle{ \alpha = \alpha_1 \ldots \alpha_n, \forall i, 1 \leq i \leq n: \alpha_i \in (V_n \cup V_t) }[/math] (i.e. every symbol in the body of the rule is either nonterminal or terminal)
  • [math]\displaystyle{ A.a = f(\alpha_{j_1}.a_1, \ldots ,\alpha_{j_m}.a_m) }[/math], where [math]\displaystyle{ \{\alpha_{j_1}, \ldots ,\alpha_{j_m}\} \subseteq \{\alpha_1, \ldots ,\alpha_n\} }[/math] (i.e. the value of the attribute is a function [math]\displaystyle{ f }[/math] applied to some values from the symbols in the body of the rule)

Inherited attributes

An inherited attribute at a node in parse tree is defined using the attribute values at the parent or siblings. Inherited attributes are convenient for expressing the dependence of a programming language construct on the context in which it appears. For example, we can use an inherited attribute to keep track of whether an identifier appears on the left or the right side of an assignment in order to decide whether the address or the value of the identifier is needed. In contrast to synthesized attributes, inherited attributes can take values from parent and/or siblings. As in the following production,

S → ABC

where A can get values from S, B, and C. B can take values from S, A, and C. Likewise, C can take values from S, A, and B.

Special types of attribute grammars

  • L-attributed grammar: inherited attributes can be evaluated in one left-to-right traversal of the abstract syntax tree
  • LR-attributed grammar: an L-attributed grammar whose inherited attributes can also be evaluated in bottom-up parsing.
  • ECLR-attributed grammar: a subset of LR-attributed grammars where equivalence classes can be used to optimize the evaluation of inherited attributes.
  • S-attributed grammar: a simple type of attribute grammar, using only synthesized attributes, but no inherited attributes

See also

References

  1. Knuth 1968, p. 134.
  2. Knuth 1968, p. 132.
  3. 3.0 3.1 D. E. Knuth: The genesis of attribute grammars. Proceedings of the international conference on Attribute grammars and their applications (1990), LNCS, vol. 461, 1–12.
  4. "Main". http://zzcad.com/ned.htm. 
  5. Knuth 1968, p. 130.

External links