Nested intervals

From HandWiki
4 members of a sequence of nested intervals

In mathematics, a sequence of nested intervals can be intuitively understood as an ordered collection of intervals [math]\displaystyle{ I_n }[/math] on the real number line with natural numbers [math]\displaystyle{ n=1,2,3,\dots }[/math] as an index. In order for a sequence of intervals to be considered nested intervals, two conditions have to be met:

  1. Every interval in the sequence is contained in the previous one ([math]\displaystyle{ I_{n+1} }[/math] is always a subset of [math]\displaystyle{ I_n }[/math]).
  2. The length of the intervals get arbitrarily small (meaning the length falls below every possible threshold [math]\displaystyle{ \varepsilon }[/math] after a certain index [math]\displaystyle{ N }[/math]).

In other words, the left bound of the interval [math]\displaystyle{ I_n }[/math] can only increase ([math]\displaystyle{ a_{n+1}\geq a_n }[/math]), and the right bound can only decrease ([math]\displaystyle{ b_{n+1}\leq b_n }[/math]).

Historically - long before anyone defined nested intervals in a textbook - people implicitly constructed such nestings for concrete calculation purposes. For example, the ancient Babylonians discovered a method for computing square roots of numbers. In contrast, the famed Archimedes constructed sequences of polygons, that inscribed and surcumscribed a unit circle, in order to get a lower and upper bound for the circles circumference - which is the circle number Pi ([math]\displaystyle{ \pi }[/math]).

The central question to be posed is the nature of the intersection over all the natural numbers, or, put differently, the set of numbers, that are found in every Interval [math]\displaystyle{ I_n }[/math] (thus, for all [math]\displaystyle{ n\in\mathbb{N} }[/math]). In modern mathematics, nested intervals are used as a construction method for the real numbers (in order to complete the field of rational numbers).

Historic motivation

As stated in the introduction, historic users of mathematics discovered the nesting of intervals and closely related algorithms as methods for specific calculations. Some variations and modern interpretations of these ancient techniques will be introduced here:

Computation of square roots

When trying to find the square root of a number [math]\displaystyle{ x\gt 1 }[/math], one can be certain that [math]\displaystyle{ 1\leq \sqrt{x} \leq x }[/math], which gives the first interval [math]\displaystyle{ I_1=[1, x] }[/math], in which [math]\displaystyle{ x }[/math] has to be found. If one knows the next higher perfect square [math]\displaystyle{ k^2 \gt x }[/math], one can get an even better candidate for the first interval: [math]\displaystyle{ I_1=[1, k] }[/math].

The other intervals [math]\displaystyle{ I_n=[a_n, b_n], n\in\mathbb{N} }[/math] can now be defined recursively by looking at the sequence of midpoints [math]\displaystyle{ m_n=\frac{a_n + b_n}{2} }[/math]. Given the interval [math]\displaystyle{ I_n }[/math] is already known (starting at [math]\displaystyle{ I_1 }[/math]), one can define

[math]\displaystyle{ I_{n+1} := \left\{\begin{matrix} \left[m_n, b_n\right] && \text{if}\;\; m_n^2 \leq x \\ \left[a_n, m_n\right] && \text{if}\;\; m_n^2 \gt x \end{matrix}\right. }[/math]

To put this into words, one can compare the midpoint of [math]\displaystyle{ I_{n} }[/math] to [math]\displaystyle{ \sqrt{x} }[/math] in order to determine whether the midpoint is smaller or larger than [math]\displaystyle{ \sqrt{x} }[/math]. If the midpoint is smaller, one can set it as the lower bound of the next interval [math]\displaystyle{ I_{n+1} }[/math], and if the midpoint is larger, one can set it as the upper bound of the next interval. This guarantees that [math]\displaystyle{ \sqrt{x}\in I_{n+1} }[/math]. With this construction the intervals are nested and their length [math]\displaystyle{ |I_n| }[/math] get halved in every step of the recursion. Therefore, it is possible to get lower and upper bounds for [math]\displaystyle{ \sqrt{x} }[/math] with arbitrarily good precision (given enough computational time).

One can also compute [math]\displaystyle{ \sqrt{y} }[/math], when [math]\displaystyle{ 0\lt y\lt 1 }[/math]. In this case [math]\displaystyle{ 1/y\gt 1 }[/math], and the algorithm can be used by setting [math]\displaystyle{ x:=1/y }[/math] and calculating the reciprocal after the desired level of precision has been acquired.

Example

To demonstrate this algorithm, here is an example of how it can be used to find the value of [math]\displaystyle{ \sqrt{19} }[/math]. Note that since[math]\displaystyle{ 1^2\lt 19\lt 5^2 }[/math], the first interval for the algorithm can be defined as[math]\displaystyle{ I_1:=[1,5] }[/math], since [math]\displaystyle{ \sqrt{19} }[/math] must certainly found within this interval. Thus, using this interval, one can continue to the next step of the algorithm by calculating the midpoint of the interval, determining whether the square of the midpoint is greater than or less than 19, and setting the boundaries of the next interval accordingly before repeating the process:

[math]\displaystyle{ \begin{aligned} m_1&=\dfrac{1+5}{2}=3 &&\Rightarrow\; m_1^2=9 \leq 19 &&\Rightarrow\; I_2=[3, 5]\\ m_2&=\dfrac{3+5}{2}=4 &&\Rightarrow\; m_2^2=16 \leq 19 &&\Rightarrow\; I_3=[4, 5]\\ m_3&=\dfrac{4+5}{2}=4.5 &&\Rightarrow\; m_3^2=20.25 \gt 19 &&\Rightarrow\; I_4=[4, 4.5]\\ m_4&=\dfrac{4+4.5}{2}=4.25 &&\Rightarrow\; m_4^2=18.0625 \leq 19 &&\Rightarrow\; I_5=[4.25, 4.5]\\ m_5&=\dfrac{4.25+4.5}{2}=4.375 &&\Rightarrow\; m_5^2=19.140625 \gt 19 &&\Rightarrow\; I_5=[4.25, 4.375]\\ &\vdots & & \end{aligned} }[/math]
Each time a new midpoint is calculated, the range of possible values for [math]\displaystyle{ \sqrt{19} }[/math] is able to be constricted so that the values that remain within the interval are closer and closer to the actual value of [math]\displaystyle{ \sqrt{19}=4.35889894\dots }[/math]. That is to say, each successive change in the bounds of the interval within which [math]\displaystyle{ \sqrt{19} }[/math] must lie allows the value of [math]\displaystyle{ \sqrt{19} }[/math] to be estimated with a greater precision, either by increasing the lower bounds of the interval or decreasing the upper bounds of the interval.
This procedure can be repeated as many times as needed to attain the desired level of precision. Theoretically, by repeating the steps indefinitely, one can arrive at the true value of this square root.

Herons method

The Babylonian method uses an even more efficient algorithm that yields accurate approximations of [math]\displaystyle{ \sqrt{x} }[/math] for an [math]\displaystyle{ x\gt 0 }[/math] even faster. The modern description using nested intervals is similar to the algorithm above, but instead of using a sequence of midpoints, one uses a sequence [math]\displaystyle{ (c_n)_{n\in\mathbb{N}} }[/math] given by

[math]\displaystyle{ c_{n+1}:=\frac{1}{2}\cdot\left(c_n + \frac{x}{c_n}\right) }[/math].

This results in a sequence of intervals given by [math]\displaystyle{ I_{n+1}:=\left[\frac{x}{c_n}, c_n\right] }[/math] and [math]\displaystyle{ I_1=[0, k] }[/math], where [math]\displaystyle{ k^2\gt x }[/math], will provide accurate upper and lower bounds for [math]\displaystyle{ \sqrt{x} }[/math] very fast. In practice, only [math]\displaystyle{ c_n }[/math] has to be considered, which converges to [math]\displaystyle{ \sqrt{x} }[/math] (as does of course the lower interval bound). This algorithm is a special case of Newton's method.

Archimedes' circle measurement

Diagram of a hexagon and pentagon circumscribed outside a circle
π can be estimated by computing the perimeters of circumscribed and inscribed polygons.

As shown in the image, lower and upper bounds for the circumference of a circle can be obtained with inscribed and circumscribed regular polygons. When examining a circle with diameter [math]\displaystyle{ 1 }[/math], the circumference is (by definition of Pi) the circle number [math]\displaystyle{ \pi }[/math].

Around 250 BCE Archimedes of Syracuse started with regular hexagons, whose side lengths (and therefore circumference) can be directly calculated from the circle diameter. Furthermore, a way to compute the side length of a regular [math]\displaystyle{ 2n }[/math]-gon from the previous [math]\displaystyle{ n }[/math]-gon can be found, starting at the regular hexagon ([math]\displaystyle{ 6 }[/math]-gon). By successively doubling the number of edges until reaching 96-sided polygons, Archimedes reached an interval with [math]\displaystyle{ \tfrac{223}{71}\lt \pi \lt \tfrac{22}{7} }[/math]. The upper bound [math]\displaystyle{ 22/7 \approx 3.143 }[/math] is still often used as a rough, but pragmatic approximation of [math]\displaystyle{ \pi }[/math].

Around the year 1600 CE, Archimedes' method was still the gold standard for calculating Pi and was used by Dutch mathematician Ludolph van Ceulen, to compute more than thirty digits of [math]\displaystyle{ \pi }[/math], which took him decades. Soon after, more powerful methods for the computation were found.

Other implementations

Early uses of sequences of nested intervals (or can be described as such with modern mathematics), can be found in the predecessors of calculus (differentiation and integration). In computer science, sequences of nested intervals is used in algorithms for numerical computation. I.e. the Bisection method can be used for calculating the roots of continuous functions. In contrast to mathematically infinite sequences, an applied computational algorithm terminates at some point, when the desired zero has been found or sufficiently well approximated.

The construction of the real numbers

In mathematical analysis, nested intervals provide one method of axiomatically introducing the real numbers as the completion of the rational numbers, being a necessity for discussing the concepts of continuity and differentiability. Historically, Isaac Newton's and Gottfried Wilhelm Leibniz's discovery of differential and integral calculus from the late 1600s has posed a huge challenge for mathematicians trying to prove their methods rigorously; despite their success in physics, engineering and other sciences. The axiomatic description of nested intervals (or an equivalent axiom) has become an important foundation for the modern understanding of calculus.

In the context of this article, [math]\displaystyle{ \mathbb{R} }[/math] in conjunction with [math]\displaystyle{ + }[/math] and [math]\displaystyle{ \cdot }[/math] is an Archimedean ordered field, meaning the axioms of order and the Archimedean property hold.

Definition[1]

Let [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] be a sequence of closed intervals of the type [math]\displaystyle{ I_n=[a_n, b_n] }[/math], where [math]\displaystyle{ |I_n|:=b_n - a_n }[/math] denotes the length of such an interval. One can call [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] a sequence of nested intervals, if

  1. [math]\displaystyle{ \quad \forall n \in \mathbb{N}: \;\; I_{n+1} \subseteq I_n }[/math]
  2. [math]\displaystyle{ \quad \forall \varepsilon \gt 0 \; \exists N\in\mathbb{N}: \;\; |I_N| \lt \varepsilon }[/math].

Put into words, property 1 means, that the intervals are nested according to their index. The second property formalizes the notion, that interval sizes get arbitrarily small; meaning, that for an arbitrary constant [math]\displaystyle{ \varepsilon \gt 0 }[/math] one can always find an interval (with index [math]\displaystyle{ N }[/math]) with a length strictly smaller than that number [math]\displaystyle{ \varepsilon }[/math]. It is also worth noting that property 1 immediately implies that every interval with an index [math]\displaystyle{ n \geq N }[/math] must also have a length [math]\displaystyle{ |I_n| \lt \varepsilon }[/math].

Remark

Note that some authors refer to such interval-sequences, satisfying both properties above, as shrinking nested intervals. In this case a sequence of nested intervals refers to a sequence that only satisfies property 1.

Axiom of completeness

If [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] is a sequence of nested intervals, there always exists a real number, that is contained in every interval [math]\displaystyle{ I_n }[/math]. In formal notation this axiom guarantees, that

[math]\displaystyle{ \exists x\in\mathbb{R}: \;x\in\bigcap_{n\in\mathbb{N}} I_n }[/math].

Theorem

The intersection of each sequence [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] of nested intervals contains exactly one real number [math]\displaystyle{ x }[/math].

Proof: This statement can easily be verified by contradiction. Assume that there exist two different numbers [math]\displaystyle{ x,y\in\cap_{n\in\mathbb{N}} I_n }[/math]. From [math]\displaystyle{ x\neq y }[/math] it follows, that they differ by [math]\displaystyle{ |x-y|\gt 0. }[/math] Since both numbers have to be contained in every interval, it follows that [math]\displaystyle{ |I_n|\geq |x-y| }[/math] for all [math]\displaystyle{ n\in\mathbb{N} }[/math]. This contradicts property 2 from the definition of nested intervals; therefore, the intersection can contain at most one number [math]\displaystyle{ x }[/math]. The completeness axiom guarantees, that such a real number [math]\displaystyle{ x }[/math] exists. [math]\displaystyle{ \; \square }[/math]

Notes

  • This axiom is fundamental in the sense that a sequence of nested intervals does not necessarily contain a rational number - meaning that [math]\displaystyle{ \cap_{n\in\mathbb{N}}I_n }[/math] could yield [math]\displaystyle{ \emptyset }[/math], if only considering the rationals.
  • The axiom is equivalent to the existence of the infimum and supremum (proof below), the convergence of Cauchy sequences and the Bolzano–Weierstrass theorem. This means that one of the four has to be introduced axiomatically, while the other three can be successively proven.

Direct consequences of the axiom

Existence of roots

By generalizing the algorithm shown above for square roots, one can prove that in the real numbers, the equation [math]\displaystyle{ x=y^j,\; j\in\mathbb{N}, x\gt 0 }[/math] can always be solved for [math]\displaystyle{ y=\sqrt[j]{x}=x^{1/j} }[/math]. This means there exists a unique real number [math]\displaystyle{ y\gt 0 }[/math], such that [math]\displaystyle{ x=y^k }[/math]. Comparing to the section above, one achieves a sequence of nested intervals for the [math]\displaystyle{ k }[/math]-th root of [math]\displaystyle{ x }[/math], namely [math]\displaystyle{ y }[/math], by looking at whether the midpoint [math]\displaystyle{ m_n }[/math] of the [math]\displaystyle{ n }[/math]-th interval is lower or equal or greater than [math]\displaystyle{ m_n^k }[/math].

Existence of infimum and supremum in bounded Sets

Definition

If [math]\displaystyle{ A\subset \mathbb{R} }[/math] has an upper bound, i.e. there exists a number [math]\displaystyle{ b }[/math], such that [math]\displaystyle{ x\leq b }[/math] for all [math]\displaystyle{ x\in A }[/math], one can call the number [math]\displaystyle{ s=\sup(A) }[/math] the supremum of [math]\displaystyle{ A }[/math], if

  1. the number [math]\displaystyle{ s }[/math] is an upper bound of [math]\displaystyle{ A }[/math], meaning [math]\displaystyle{ \forall x \in A: \; x\leq s }[/math]
  2. [math]\displaystyle{ s }[/math] is the least upper bound of [math]\displaystyle{ A }[/math], meaning [math]\displaystyle{ \forall \sigma \lt s : \; \exists x\in A: \; x \gt \sigma }[/math]

Only one such number [math]\displaystyle{ s }[/math] can exist. Analogously one can define the infimum ([math]\displaystyle{ \inf(B) }[/math]) of a set [math]\displaystyle{ B\subset \mathbb{R} }[/math], that is bounded from below, as the greatest lower bound of that set.

Theorem

Each set [math]\displaystyle{ A\subset \mathbb{R} }[/math] has a supremum (infimum), if it is bounded from above (below).

Proof: Without loss of generality one can look at a set [math]\displaystyle{ A\subset \mathbb{R} }[/math] that has an upper bound. One can now construct a sequence [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] of nested intervals [math]\displaystyle{ I_n=[a_n, b_n] }[/math], that has the following two properties:

  1. [math]\displaystyle{ b_n }[/math] is an upper bound of [math]\displaystyle{ A }[/math] for all [math]\displaystyle{ n\in\mathbb{N} }[/math]
  2. [math]\displaystyle{ a_n }[/math] is never an upper bound of [math]\displaystyle{ A }[/math] for any [math]\displaystyle{ n\in\mathbb{N} }[/math].

The construction follows a recursion by starting with any number [math]\displaystyle{ a_1 }[/math], that is not an upper bound (e.g. [math]\displaystyle{ a_1=c - 1 }[/math], where [math]\displaystyle{ c\in A }[/math] and an arbitrary upper bound [math]\displaystyle{ b_1 }[/math] of [math]\displaystyle{ A }[/math]). Given [math]\displaystyle{ I_n=[a_n, b_n] }[/math] for some [math]\displaystyle{ n\in\mathbb{N} }[/math] one can compute the midpoint [math]\displaystyle{ m_n:= \frac{a_n+b_n}{2} }[/math] and define

[math]\displaystyle{ I_{n+1} := \left\{\begin{matrix} \left[a_n, m_n\right] && \text{if}\; m_n \;\text{is an upper bound of}\; A \\ \left[m_n, b_n\right] && \text{if}\; m_n \;\text{is not an upper bound} \end{matrix}\right. }[/math]

Note that this interval sequence is well defined and obviously a sequence of nested intervals by construction.

Now let [math]\displaystyle{ s }[/math] be the number in every interval (whose existence is guaranteed by the axiom). [math]\displaystyle{ s }[/math] is an upper bound of [math]\displaystyle{ A }[/math], otherwise there exists a number [math]\displaystyle{ x\in A }[/math], such that [math]\displaystyle{ x\gt s }[/math]. Furthermore, this would imply the existence of an interval [math]\displaystyle{ I_m=[a_m, b_m] }[/math] with [math]\displaystyle{ b_m - a_m \lt x-s }[/math], from which [math]\displaystyle{ b_m - s \lt x-s }[/math] follows, due to [math]\displaystyle{ s }[/math] also being an element of [math]\displaystyle{ I_m }[/math]. But this is a contradiction to property 1 of the supremum (meaning [math]\displaystyle{ b_m\lt s }[/math] for all [math]\displaystyle{ m\in\mathbb{N} }[/math]). Therefore [math]\displaystyle{ s }[/math] is in fact an upper bound of [math]\displaystyle{ A }[/math].

Assume that there exists a lower upper bound [math]\displaystyle{ \sigma \lt s }[/math] of [math]\displaystyle{ A }[/math]. Since [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] is a sequence of nested intervals, the interval lengths get arbitrarily small; in particular, there exists an interval with a length smaller than [math]\displaystyle{ s-\sigma }[/math]. But from [math]\displaystyle{ s\in I_n }[/math] one gets [math]\displaystyle{ s-a_n\lt s-\sigma }[/math] and therefore [math]\displaystyle{ a_n\gt \sigma }[/math]. Following the rules of this construction, [math]\displaystyle{ a_n }[/math] would have to be an upper bound of [math]\displaystyle{ A }[/math], contradicting property 2 of all sequences of nested intervals.

In two steps, it has been shown that [math]\displaystyle{ s }[/math] is an upper bound of [math]\displaystyle{ A }[/math] and that a lower upper bound cannot exist. Therefore [math]\displaystyle{ s }[/math] is the supremum of [math]\displaystyle{ A }[/math] by definition.

Remark

As was seen, the existence of suprema and infima of bounded sets is a consequence of the completeness of [math]\displaystyle{ \mathbb{R} }[/math]. In effect the two are actually equivalent, meaning that either of the two can be introduced axiomatically.

Proof: Let [math]\displaystyle{ (I_n)_{n\in\mathbb{N}} }[/math] with [math]\displaystyle{ I_n=[a_n, b_n] }[/math] be a sequence of nested intervals. Then the set [math]\displaystyle{ A:=\{a_1, a_2,\dots\} }[/math] is bounded from above, where every [math]\displaystyle{ b_n }[/math] is an upper bound. This implies, that the least upper bound [math]\displaystyle{ s=\sup(A) }[/math] fulfills [math]\displaystyle{ a_n\leq s\leq b_n }[/math] for all [math]\displaystyle{ n\in\mathbb{N} }[/math]. Therefore [math]\displaystyle{ s\in I_n }[/math] for all [math]\displaystyle{ n\in\mathbb{N} }[/math], respectively [math]\displaystyle{ s\in\cap_{n\in\mathbb{N}} I_n }[/math].

Further consequences

After formally defining the convergence of sequences and accumulation points of sequences, one can also prove the Bolzano–Weierstrass theorem using nested intervals. In a follow-up, the fact, that Cauchy sequences are convergent (and that all convergent sequences are Cauchy sequences) can be proven. This in turn allows for a proof of the completeness property above, showing their equivalence.

Further discussion of related aspects

Without any specifying what is meant by interval, all that can be said about the intersection [math]\displaystyle{ \cap_{n\in\mathbb{N}} I_n }[/math] over all the naturals (i.e. the set of all points common to each interval) is that it is either the empty set [math]\displaystyle{ \emptyset }[/math], a point on the number line (called a singleton [math]\displaystyle{ \{x\} }[/math]), or some interval.

The possibility of an empty intersection can be illustrated by looking at a sequence of open intervals [math]\displaystyle{ I_n=\left(0, \frac{1}{n}\right) = \left\{x\in\mathbb{R}:0\lt x\lt \frac{1}{n}\right\} }[/math].

In this case, the empty set [math]\displaystyle{ \emptyset }[/math] results from the intersection [math]\displaystyle{ \cap_{n\in\mathbb{N}} I_n }[/math]. This result comes from the fact that, for any number [math]\displaystyle{ x\gt 0 }[/math] there exists some value of [math]\displaystyle{ n\in\mathbb{N} }[/math] (namely any [math]\displaystyle{ n\gt 1/x }[/math]), such that [math]\displaystyle{ 1/n\lt x }[/math]. This is given by the Archimedean property of the real numbers. Therefore, no matter how small [math]\displaystyle{ x \gt 0 }[/math], one can always find intervals [math]\displaystyle{ I_n }[/math] in the sequence, such that [math]\displaystyle{ x\notin I_n, }[/math] implying that the intersection has to be empty.

The situation is different for closed intervals. If one changes the situation above by looking at closed intervals of the type [math]\displaystyle{ I_n=\left[0, \frac{1}{n}\right] = \left\{x\in\mathbb{R}:0 \leq x \leq \frac{1}{n}\right\} }[/math], one can see this very clearly. Now for each [math]\displaystyle{ x\gt 0 }[/math] one still can always find intervals not containing said [math]\displaystyle{ x }[/math], but for [math]\displaystyle{ x=0 }[/math], the property [math]\displaystyle{ 0\leq x \leq 1/n }[/math] holds true for any [math]\displaystyle{ n\in\mathbb{N} }[/math]. One can conclude that, in this case, [math]\displaystyle{ \cap_{n\in\mathbb{N}} I_n = \{0\} }[/math].

One can also consider the complement of each interval, written as [math]\displaystyle{ (-\infty,a_n) \cup (b_n, \infty) }[/math] - which, in our last example, is [math]\displaystyle{ (-\infty,0) \cup (1/n, \infty) }[/math]. By De Morgan's laws, the complement of the intersection is a union of two disjoint open sets. By the connectedness of the real line there must be something between them. This shows that the intersection of (even an uncountable number of) nested, closed, and bounded intervals is nonempty.

Higher dimensions

In two dimensions there is a similar result: nested closed disks in the plane must have a common intersection. This result was shown by Hermann Weyl to classify the singular behaviour of certain differential equations.

See also


References

  1. Königsberger, Konrad (2004). Analysis 1. Springer. p. 11. ISBN 354040371X.