Least fixed point

From HandWiki
The function f(x) = x2 − 4 has two fixed points, shown as the intersection with the blue line; its least one is at 1/2 − 17/2.

In order theory, a branch of mathematics, the least fixed point (lfp or LFP, sometimes also smallest fixed point) of a function from a partially ordered set to itself is the fixed point which is less than each other fixed point, according to the order of the poset. A function need not have a least fixed point, but if it does then the least fixed point is unique.

Examples

With the usual order on the real numbers, the least fixed point of the real function f(x) = x2 is x = 0 (since the only other fixed point is 1 and 0 < 1). In contrast, f(x) = x + 1 has no fixed points at all, so has no least one, and f(x) = x has infinitely many fixed points, but has no least one.

Let [math]\displaystyle{ G = (V, A) }[/math] be a directed graph and [math]\displaystyle{ v }[/math] be a vertex. The set of vertices accessible from [math]\displaystyle{ v }[/math] can be defined as the least fixed-point of the function [math]\displaystyle{ f: \wp(V) \to \wp(V) }[/math], defined as [math]\displaystyle{ f(X) = \{ v \} \cup \{ x \in V: \text{ for some } w \in X \text{ there is an arc from } w \text{ to } x \} . }[/math] The set of vertices which are co-accessible from [math]\displaystyle{ v }[/math] is defined by a similar least fix-point. The strongly connected component of [math]\displaystyle{ v }[/math] is the intersection of those two least fixed-points.

Let [math]\displaystyle{ G = (V, \Sigma, R, S_0) }[/math] be a context-free grammar. The set [math]\displaystyle{ E }[/math] of symbols which produces the empty string [math]\displaystyle{ \varepsilon }[/math] can be obtained as the least fixed-point of the function [math]\displaystyle{ f: \wp(V) \to \wp(V) }[/math], defined as [math]\displaystyle{ f ( X ) = \{ S \in V: \; S \in X \text{ or } (S \to \varepsilon) \in R \text{ or } (S \to S^1 \dots S^n) \in R \text{ and } S^i \in X \text{, for all } i \} }[/math], where [math]\displaystyle{ \wp(V) }[/math] denotes the power set of [math]\displaystyle{ V }[/math].

Applications

Many fixed-point theorems yield algorithms for locating the least fixed point. Least fixed points often have desirable properties that arbitrary fixed points do not.

Denotational semantics

Partial order on [math]\displaystyle{ \mathbb{Z}_\bot }[/math]

In computer science, the denotational semantics approach uses least fixed points to obtain from a given program text a corresponding mathematical function, called its semantics. To this end, an artificial mathematical object, [math]\displaystyle{ \bot }[/math], is introduced, denoting the exceptional value "undefined". Given e.g. the program datatype int, its mathematical counterpart is defined as [math]\displaystyle{ \mathbb{Z}_\bot = \mathbb{Z} \cup \{ \bot \} ; }[/math] it is made a partially ordered set by defining [math]\displaystyle{ \bot \sqsubset n }[/math] for each [math]\displaystyle{ n \in \mathbb{Z} }[/math] and letting any two different members [math]\displaystyle{ n,m \in \mathbb{Z} }[/math] be uncomparable w.r.t. [math]\displaystyle{ \sqsubset }[/math], see picture.

The semantics of a program definition int f(int n){...} is some mathematical function [math]\displaystyle{ f: \mathbb{Z}_\bot \to \mathbb{Z}_\bot . }[/math] If the program definition f does not terminate for some input n, this can be expressed mathematically as [math]\displaystyle{ f(n) = \bot . }[/math] The set of all mathematical functions is made partially ordered by defining [math]\displaystyle{ f \sqsubseteq g }[/math] if, for each [math]\displaystyle{ n , }[/math] the relation [math]\displaystyle{ f(n) \sqsubseteq g(n) }[/math] holds, that is, if [math]\displaystyle{ f(n) }[/math] is less defined or equal to [math]\displaystyle{ g(n) . }[/math] For example, the semantics of the expression x+x/x is less defined than that of x+1, since the former, but not the latter, maps [math]\displaystyle{ 0 }[/math] to [math]\displaystyle{ \bot , }[/math] and they agree otherwise.

Given some program text f, its mathematical counterpart is obtained as least fixed point of some mapping from functions to functions that can be obtained by "translating" f. For example, the C definition

int fact(int n) { if (n == 0) return 1; else return n * fact(n-1); }

is translated to a mapping

[math]\displaystyle{ F: (\mathbb{Z}_\bot \to \mathbb{Z}_\bot) \to (\mathbb{Z}_\bot \to \mathbb{Z}_\bot) , }[/math] defined as [math]\displaystyle{ (F(f))(n) = \begin{cases} 1 & \text{if } n = 0, \\ n \cdot f(n-1) & \text{if } n \neq \bot \text{ and } n \neq 0, \\ \bot & \text{if } n = \bot. \\ \end{cases} }[/math]

The mapping [math]\displaystyle{ F }[/math] is defined in a non-recursive way, although fact was defined recursively. Under certain restrictions (see Kleene fixed-point theorem), which are met in the example, [math]\displaystyle{ F }[/math] necessarily has a least fixed point, [math]\displaystyle{ \operatorname{fact} }[/math], that is [math]\displaystyle{ (F(\operatorname{fact}))(n) = \operatorname{fact}(n) }[/math] for all [math]\displaystyle{ n \in \mathbb{Z}_\bot }[/math].[1] It is possible to show that

[math]\displaystyle{ \operatorname{fact}(n) = \begin{cases} n! & \text{if } n \geq 0, \\ \bot & \text{if } n \lt 0 \text{ or } n = \bot. \end{cases} }[/math]

A larger fixed point of [math]\displaystyle{ F }[/math] is e.g. the function [math]\displaystyle{ \operatorname{fact}_0 , }[/math] defined by

[math]\displaystyle{ \operatorname{fact}_0(n) = \begin{cases} n! & \text{if } n \geq 0, \\ 0 & \text{if } n \lt 0, \\ \bot & \text{if } n = \bot, \end{cases} }[/math]

however, this function does not correctly reflect the behavior of the above program text for negative [math]\displaystyle{ n ; }[/math] e.g. the call fact(-1) will not terminate at all, let alone return 0. Only the least fixed point, [math]\displaystyle{ \operatorname{fact} , }[/math] can reasonably be used as a mathematical program semantic.

Descriptive complexity

Immerman[2][3] and Vardi[4] independently showed the descriptive complexity result that the polynomial-time computable properties of linearly ordered structures are definable in FO(LFP), i.e. in first-order logic with a least fixed point operator. However, FO(LFP) is too weak to express all polynomial-time properties of unordered structures (for instance that a structure has even size).

Greatest fixed points

The greatest fixed point of a function can be defined analogously to the least fixed point, as the fixed point which is greater than any other fixed point, according to the order of the poset. In computer science, greatest fixed points are much less commonly used than least fixed points. Specifically, the posets found in domain theory usually do not have a greatest element, hence for a given function, there may be multiple, mutually incomparable maximal fixed points, and the greatest fixed point of that function may not exist. To address this issue, the optimal fixed point has been defined as the most-defined fixed point compatible with all other fixed points. The optimal fixed point always exists, and is the greatest fixed point if the greatest fixed point exists. The optimal fixed point allows formal study of recursive and corecursive functions that do not converge with the least fixed point.[5] Unfortunately, whereas Kleene's recursion theorem shows that the least fixed point is effectively computable, the optimal fixed point of a computable function may be a non-computable function.[6]

See also

Notes

  1. C.A. Gunter; D.S. Scott (1990). "Semantic Domains". in Jan van Leeuwen. Formal Models and Semantics. Handbook of Theoretical Computer Science. B. Elsevier. pp. 633–674. ISBN 0-444-88074-7.  Here: pp. 636–638
  2. N. Immerman, Relational queries computable in polynomial time, Information and Control 68 (1–3) (1986) 86–104.
  3. Immerman, Neil (1982). "Relational Queries Computable in Polynomial Time". pp. 147–152. doi:10.1145/800070.802187.  Revised version in Information and Control, 68 (1986), 86–104.
  4. Vardi, Moshe Y. (1982). "The Complexity of Relational Query Languages". pp. 137–146. doi:10.1145/800070.802186. 
  5. Charguéraud, Arthur (2010). "The Optimal Fixed Point Combinator". Interactive Theorem Proving 6172: 195–210. doi:10.1007/978-3-642-14052-5_15. https://www.chargueraud.org/research/2010/fix/fix.pdf. Retrieved 30 October 2021. 
  6. Shamir, Adi (October 1976). The fixedpoints of recursive definitions (Ph.D. thesis). Weizmann Institute of Science. OCLC 884951223. Here: Example 12.1, pp. 12.2–3

References