Dependent type

From HandWiki
Short description: Type whose definition depends on a value

In computer science and logic, a dependent type is a type whose definition depends on a value. It is an overlapping feature of type theory and type systems. In intuitionistic type theory, dependent types are used to encode logic's quantifiers like "for all" and "there exists". In functional programming languages like Agda, ATS, Coq, F*, Epigram, and Idris, dependent types help reduce bugs by enabling the programmer to assign types that further restrain the set of possible implementations.

Two common examples of dependent types are dependent functions and dependent pairs. The return type of a dependent function may depend on the value (not just type) of one of its arguments. For instance, a function that takes a positive integer [math]\displaystyle{ n }[/math] may return an array of length [math]\displaystyle{ n }[/math], where the array length is part of the type of the array. (Note that this is different from polymorphism and generic programming, both of which include the type as an argument.) A dependent pair may have a second value the type of which depends on the first value. Sticking with the array example, a dependent pair may be used to pair an array with its length in a type-safe way.

Dependent types add complexity to a type system. Deciding the equality of dependent types in a program may require computations. If arbitrary values are allowed in dependent types, then deciding type equality may involve deciding whether two arbitrary programs produce the same result; hence the decidability of type checking may depend on the given type theory's semantics of equality, that is, whether the type theory is intensional or extensional.[1]

History

In 1934, Haskell Curry noticed that the types used in typed lambda calculus, and in its combinatory logic counterpart, followed the same pattern as axioms in propositional logic. Going further, for every proof in the logic, there was a matching function (term) in the programming language. One of Curry's examples was the correspondence between simply typed lambda calculus and intuitionistic logic.[2]

Predicate logic is an extension of propositional logic, adding quantifiers. Howard and de Bruijn extended lambda calculus to match this more powerful logic by creating types for dependent functions, which correspond to "for all", and dependent pairs, which correspond to "there exists".[3]

(Because of this and other work by Howard, propositions-as-types is known as the Curry–Howard correspondence.)

Formal definition

Loosely speaking, dependent types are similar to the type of an indexed family of sets. More formally, given a type [math]\displaystyle{ A:\mathcal{U} }[/math] in a universe of types [math]\displaystyle{ \mathcal{U} }[/math], one may have a family of types [math]\displaystyle{ B:A\to\mathcal{U} }[/math], which assigns to each term [math]\displaystyle{ a:A }[/math] a type [math]\displaystyle{ B(a):\mathcal{U} }[/math]. We say that the type B(a) varies with a.

Π type

A function whose type of return value varies with its argument (i.e. there is no fixed codomain) is a dependent function and the type of this function is called dependent product type, pi-type (Π type) or dependent function type.[4] From a family of types [math]\displaystyle{ B: A \to \mathcal{U} }[/math] we may construct the type of dependent functions [math]\displaystyle{ \prod_{x : A} B(x) }[/math], whose terms are functions that take a term [math]\displaystyle{ a : A }[/math] and return a term in [math]\displaystyle{ B(a) }[/math]. For this example, the dependent function type is typically written as [math]\displaystyle{ \prod_{x:A} B(x), }[/math] [math]\displaystyle{ \prod_{x:A} B(x), }[/math] or [math]\displaystyle{ \prod {(x:A)} B(x) }[/math].

If [math]\displaystyle{ B:A\to\mathcal{U} }[/math] is a constant function, the corresponding dependent product type is equivalent to an ordinary function type. That is, [math]\displaystyle{ \prod_{x:A}B }[/math] is judgmentally equal to [math]\displaystyle{ A\to B }[/math] when B does not depend on x.

The name 'Π-type' comes from the idea that these may be viewed as a Cartesian product of types. Π-types can also be understood as models of universal quantifiers.

For example, if we write [math]\displaystyle{ \operatorname{Vec}(\mathbb{R},n) }[/math] for n-tuples of real numbers, then [math]\displaystyle{ \prod_{n:\mathbb{N}} \operatorname{Vec}(\mathbb{R},n) }[/math] would be the type of a function which, given a natural number n, returns a tuple of real numbers of size n. The usual function space arises as a special case when the range type does not actually depend on the input. E.g. [math]\displaystyle{ \prod_{n:\mathbb{N}} {\mathbb{R}} }[/math] is the type of functions from natural numbers to the real numbers, which is written as [math]\displaystyle{ \mathbb{N}\to\mathbb{R} }[/math] in typed lambda calculus.

For a more concrete example, taking A to be the type of unsigned integers from 0 to 255 (the ones that fit into 8 bits or 1 byte) and B(a) = Xa for a : A, then [math]\displaystyle{ \prod_{x:A} B(x) }[/math] devolves into the product of X0 × X1 × X2 × ... × X253 × X254 × X255.

Σ type

The dual of the dependent product type is the dependent pair type, dependent sum type, sigma-type, or (confusingly) dependent product type.[4] Sigma-types can also be understood as existential quantifiers. Continuing the above example, if, in the universe of types [math]\displaystyle{ \mathcal{U} }[/math], there is a type [math]\displaystyle{ A:\mathcal{U} }[/math] and a family of types [math]\displaystyle{ B:A\to\mathcal{U} }[/math], then there is a dependent pair type [math]\displaystyle{ \sum_{x:A} B(x) }[/math]. (The alternative notations are similar to that of Π types.)

The dependent pair type captures the idea of an ordered pair where the type of the second term is dependent on the value of the first. If [math]\displaystyle{ (a,b):\sum_{x:A} B(x), }[/math] then [math]\displaystyle{ a:A }[/math] and [math]\displaystyle{ b:B(a) }[/math]. If B is a constant function, then the dependent pair type becomes (is judgementally equal to) the product type, that is, an ordinary Cartesian product [math]\displaystyle{ A\times B }[/math].[full citation needed]

For a more concrete example, taking A to again be type of unsigned integers from 0 to 255, and B(a) to again be equal to Xa for 256 more arbitrary Xa, then [math]\displaystyle{ \sum_{x:A} B(x) }[/math] devolves into the sum X0 + X1 + X2 + ... + X253 + X254 + X255.

Example as existential quantification

Let [math]\displaystyle{ A:\mathcal{U} }[/math] be some type, and let [math]\displaystyle{ B:A\to\mathcal{U} }[/math]. By the Curry–Howard correspondence, B can be interpreted as a logical predicate on terms of A. For a given [math]\displaystyle{ a:A }[/math], whether the type B(a) is inhabited indicates whether a satisfies this predicate. The correspondence can be extended to existential quantification and dependent pairs: the proposition [math]\displaystyle{ \exists{a}{\in}A\,B(a) }[/math] is true if and only if the type [math]\displaystyle{ \sum_{a:A}B(a) }[/math] is inhabited.

For example, [math]\displaystyle{ m:\mathbb{N} }[/math] is less than or equal to [math]\displaystyle{ n:\mathbb{N} }[/math] if and only if there exists another natural number [math]\displaystyle{ k:\mathbb{N} }[/math] such that m + k = n. In logic, this statement is codified by existential quantification:

[math]\displaystyle{ m\le n \iff \exists{k}{\in}\mathbb{N}\,m+k=n. }[/math]

This proposition corresponds to the dependent pair type:

[math]\displaystyle{ \sum_{k:\mathbb{N}} m+k=n. }[/math]

That is, a proof of the statement that m is less than or equal to n is a pair that contains both a non-negative number k, which is the difference between m and n, and a proof of the equality m + k = n.

Systems of the lambda cube

Henk Barendregt developed the lambda cube as a means of classifying type systems along three axes. The eight corners of the resulting cube-shaped diagram each correspond to a type system, with simply typed lambda calculus in the least expressive corner, and calculus of constructions in the most expressive. The three axes of the cube correspond to three different augmentations of the simply typed lambda calculus: the addition of dependent types, the addition of polymorphism, and the addition of higher kinded type constructors (functions from types to types, for example). The lambda cube is generalized further by pure type systems.

First order dependent type theory

The system [math]\displaystyle{ \lambda \Pi }[/math] of pure first order dependent types, corresponding to the logical framework LF, is obtained by generalising the function space type of the simply typed lambda calculus to the dependent product type.

Second order dependent type theory

The system [math]\displaystyle{ \lambda \Pi 2 }[/math] of second order dependent types is obtained from [math]\displaystyle{ \lambda \Pi }[/math] by allowing quantification over type constructors. In this theory the dependent product operator subsumes both the [math]\displaystyle{ \to }[/math] operator of simply typed lambda calculus and the [math]\displaystyle{ \forall }[/math] binder of System F.

Higher order dependently typed polymorphic lambda calculus

The higher order system [math]\displaystyle{ \lambda \Pi \omega }[/math] extends [math]\displaystyle{ \lambda \Pi 2 }[/math] to all four forms of abstraction from the lambda cube: functions from terms to terms, types to types, terms to types and types to terms. The system corresponds to the calculus of constructions whose derivative, the calculus of inductive constructions is the underlying system of the Coq proof assistant.

Simultaneous programming language and logic

The Curry–Howard correspondence implies that types can be constructed that express arbitrarily complex mathematical properties. If the user can supply a constructive proof that a type is inhabited (i.e., that a value of that type exists) then a compiler can check the proof and convert it into executable computer code that computes the value by carrying out the construction. The proof checking feature makes dependently typed languages closely related to proof assistants. The code-generation aspect provides a powerful approach to formal program verification and proof-carrying code, since the code is derived directly from a mechanically verified mathematical proof.

Comparison of languages with dependent types

Language Actively developed Paradigm[lower-alpha 1] Tactics Proof terms Termination checking Types can depend on[lower-alpha 2] Universes Proof irrelevance Program extraction Extraction erases irrelevant terms
Ada 2012 Yes[5] Imperative Yes[6] No ? Any term[lower-alpha 3] ? ? Ada ?
Agda Yes[7] Purely functional Few/limited[lower-alpha 4] Yes Yes (optional) Any term Yes (optional)[lower-alpha 5] Proof-irrelevant arguments[9] Proof-irrelevant propositions[10] Haskell, JavaScript Yes[9]
ATS Yes[11] Functional / imperative No[12] Yes Yes Static terms[13] ? Yes Yes Yes
Cayenne No Purely functional No Yes No Any term No No ? ?
Gallina
(Coq)
Yes[14] Purely functional Yes Yes Yes Any term Yes[lower-alpha 6] Yes[15] Haskell, Scheme and OCaml Yes
Dependent ML No[lower-alpha 7] ? ? Yes ? Natural numbers ? ? ? ?
F* Yes[16] Functional and imperative Yes[17] Yes Yes (optional) Any pure term Yes Yes OCaml, F#, and C Yes
Guru No[18] Purely functional[19] hypjoin[20] Yes[19] Yes Any term No Yes Carraway Yes
Idris Yes[21] Purely functional[22] Yes[23] Yes Yes (optional) Any term Yes No Yes Yes[23]
Lean Yes Purely functional Yes Yes Yes Any term Yes Yes Yes Yes
Matita Yes[24] Purely functional Yes Yes Yes Any term Yes Yes OCaml Yes
NuPRL Yes Purely functional Yes Yes Yes Any term Yes ? Yes ?
PVS Yes ? Yes ? ? ? ? ? ? ?
Sage No[lower-alpha 8] Purely functional No No No ? No ? ? ?
Twelf Yes Logic programming ? Yes Yes (optional) Any (LF) term No No ? ?
  1. This refers to the core language, not to any tactic (theorem proving procedure) or code generation sublanguage.
  2. Subject to semantic constraints, such as universe constraints
  3. Static_Predicate for restricted terms, Dynamic_Predicate for Assert-like checking of any term in type cast
  4. Ring solver[8]
  5. Optional universes, optional universe polymorphism, and optional explicitly specified universes
  6. Universes, automatically inferred universe constraints (not the same as Agda's universe polymorphism) and optional explicit printing of universe constraints
  7. Has been superseded by ATS
  8. Last Sage paper and last code snapshot are both dated 2006

See also

References

  1. Hofmann, Martin (1995), Extensional concepts in intensional type theory, https://ncatlab.org/nlab/files/HofmannExtensionalIntensionalTypeTheory.pdf 
  2. Sørensen, Morten Heine B.; Urzyczyn, Pawel (1998), Lectures on the Curry-Howard Isomorphism 
  3. Bove, Ana; Dybjer, Peter (2008), Dependent Types at Work, Chalmers University of Technology, http://www.cse.chalmers.se/~peterd/papers/DependentTypesAtWork.pdf 
  4. 4.0 4.1 "ΠΣ: Dependent Types without the Sugar". http://www.cs.nott.ac.uk/~psztxa/publ/pisigma-new.pdf. 
  5. "GNAT Community download page". https://www.adacore.com/download/. 
  6. "§3.2.4 Subtype Predicates". Ada Reference Manual. http://www.ada-auth.org/standards/12rm/html/RM-3-2-4.html. 
  7. "Agda download page". http://wiki.portal.chalmers.se/agda/pmwiki.php?n=Main.Download. 
  8. "Agda Ring Solver". http://www.cs.nott.ac.uk/~nad/listings/lib/Algebra.RingSolver.html. 
  9. 9.0 9.1 "Announce: Agda 2.2.8". http://permalink.gmane.org/gmane.comp.lang.agda/2051. 
  10. "Agda 2.6.0 changelog". http://hackage.haskell.org/package/Agda-2.6.0/changelog. 
  11. "ATS2 downloads". http://sourceforge.net/projects/ats2-lang/files/. 
  12. "email from ATS inventor Hongwei Xi". http://sourceforge.net/mailarchive/message.php?msg_id=27050673. 
  13. Xi, Hongwei (March 2017). "Applied Type System: An Approach to Practical Programming with Theorem-Proving". http://www.ats-lang.org/MYDATA/ATSfoundation.pdf. 
  14. "Coq CHANGES in Subversion repository". https://gforge.inria.fr/scm/viewvc.php/trunk/CHANGES?root=coq&view=log. 
  15. "Introduction of SProp in Coq 8.10". https://coq.github.io/doc/master/refman/changes.html#version-8-10. 
  16. "F* changes on GitHub". https://github.com/FStarLang/FStar/commits/master. 
  17. "F* v0.9.5.0 release notes on GitHub". https://github.com/FStarLang/FStar/releases/tag/v0.9.5.0. 
  18. "Guru SVN". https://code.google.com/p/guru-lang/source/list. 
  19. 19.0 19.1 Aaron Stump (6 April 2009). "Verified Programming in Guru". http://guru-lang.googlecode.com/svn/branches/1.0/doc/book.pdf. 
  20. Petcher, Adam (May 2008). Deciding Joinability Modulo Ground Equations in Operational Type Theory (PDF) (MSc). Washington University. Retrieved 14 October 2010.
  21. "Idris git repository". 17 May 2022. https://github.com/idris-lang/Idris-dev/. 
  22. Brady, Edwin. "Idris, a language with dependent types — extended abstract". https://eb.host.cs.st-andrews.ac.uk/drafts/ifl08.pdf. 
  23. 23.0 23.1 Brady, Edwin. "How does Idris compare to other dependently-typed programming languages?". http://www.quora.com/How-does-Idris-compare-to-other-dependently-typed-programming-languages. 
  24. "Matita SVN". http://helm.cs.unibo.it/websvn/listing.php?repname=helm&path=%2F&sc=0. 

Further reading

External links