Tuple-generating dependency

From HandWiki

In relational database theory, a tuple-generating dependency (TGD) is a certain kind of constraint on a relational database. It is a subclass of the class of embedded dependencies (EDs). An algorithm known as the chase takes as input an instance that may or may not satisfy a set of TGDs (or more generally EDs) and, if it terminates (which is a priori undecidable), outputs an instance that does satisfy the TGDs.

Definition

A tuple-generating dependency is a sentence in first-order logic of the form:[1]

[math]\displaystyle{ \forall x_1,\ldots, x_n . \phi(x_1, \ldots, x_n) \rightarrow \exists y_1, \ldots, y_m, \psi(x_1, \ldots, x_n, y_1, \ldots, y_m) }[/math]

where [math]\displaystyle{ \phi }[/math] is a possibly empty and [math]\displaystyle{ \psi }[/math] is a non-empty conjunction of relational atoms. A relational atom has the form [math]\displaystyle{ R(w_1, \ldots, w_h) }[/math], where each of the terms [math]\displaystyle{ w, \ldots, w_h }[/math] are variables or constants.

Fragments

Several fragments of TGDs have been defined. For instance, full TGDs are TGDs which do not use the existential quantifier. Full TGDs can equivalently be seen as programs in the Datalog query language.

There are also some fragments of TGDs that can be expressed in guarded logic, in particular:[2][3]

  • in frontier-guarded TGDs (FGTGD), all the variables shared by the body and the head of a rule (called frontier variables) must occur together in some atom;
  • guarded TGDs (GTGD) are particular FGTGDs where all variables used in the body of a rule must occur together in some atom;
  • linear TGDs (LTGD) are particular GTGDs where whose body consists of a single atom;
  • inclusion dependencies (IND) are particular LTGDs where in both the sides of the rule there is only one relational atom.[4][|permanent dead link|dead link}}]

In SQL, inclusion dependencies are typically expressed by means of a stronger constraint called foreign key, which forces the frontier variables to be a candidate key in the table corresponding to the relational atom of [math]\displaystyle{ \psi }[/math].

References

  1. Fagin, Ronald (2009). "Tuple-Generating Dependencies". in LIU, LING (in en). Encyclopedia of Database Systems. Springer US. pp. 3201–3202. doi:10.1007/978-0-387-39940-9_1274. ISBN 9780387355443. https://archive.org/details/encyclopediadata00liul_180. 
  2. Benedikt, Michael; Bourhis, Pierre; Jachiet, Louis; Thomazo, Michaël (Aug 2019). "Reasoning about Disclosure in Data Integration in the Presence of Source Constraints". IJCAI 2019 - 28th International Joint Conference on Artificial Intelligence. Macao, China. pp. 1551-1557. doi:10.24963/ijcai.2019/215. 
  3. Console, Marco; Kolaitis, Phokion G.; Pieris, Andreas (June 2021). "Model-theoretic Characterizations of Rule-based Ontologies". Symposium on Principles of Database Systems. Virtual Event, China. pp. 416–428. doi:10.1145/3452021.3458310. 
  4. Kolaitis, Phokion G.. "A Tutorial on Database Dependencies". University of California Santa Cruz & IBM Research - Almaden. https://www.knaw.nl/shared/resources/actueel/bestanden/kolaitis.pdf#page=5. 

Further reading