Rokhlin lemma

From HandWiki

In mathematics, the Rokhlin lemma, or Kakutani–Rokhlin lemma is an important result in ergodic theory. It states that an aperiodic measure preserving dynamical system can be decomposed to an arbitrary high tower of measurable sets and a remainder of arbitrarily small measure. It was proven by Vladimir Abramovich Rokhlin and independently by Shizuo Kakutani. The lemma is used extensively in ergodic theory, for example in Ornstein theory and has many generalizations.

Rokhlin lemma belongs to the group mathematical statements such as Zorn's lemma in set theory and Schwarz lemma in complex analysis which are traditionally called lemmas despite the fact that their roles in their respective fields are fundamental.

Terminology

A Lebesgue space is a measure space [math]\displaystyle{ (X, \mathcal B, \mu) }[/math] composed of two parts. One atomic part with finite/countably many atoms, and one continuum part isomorphic to an interval on [math]\displaystyle{ \R }[/math].

We consider only measure-preserving maps. As typical in measure theory, we can freely discard countably many sets of measure zero.

An ergodic map is a map [math]\displaystyle{ T }[/math] such that if [math]\displaystyle{ T^{-1}(A) = A }[/math] (except on a measure-zero set) then [math]\displaystyle{ A }[/math] or [math]\displaystyle{ X-A }[/math] has measure zero.

An aperiodic map is a map such that the set of periodic points is measure zero:[math]\displaystyle{ \mu( \cup_{n \geq 1}\{x = T^n x\}) = 0 }[/math]A Rokhlin tower is a family of sets [math]\displaystyle{ S, TS, \dots, T^{N-1}S }[/math] that are disjoint. [math]\displaystyle{ S }[/math] is called the base of the tower, and each [math]\displaystyle{ T^nS }[/math] is a rung or level of the tower. [math]\displaystyle{ N }[/math] is the height of the tower. The tower itself is [math]\displaystyle{ R := (S \cup TS \cup \dots \cup T^{N-1}S) }[/math]. The set outside the tower [math]\displaystyle{ X - R }[/math] is the error set.

There are several Rokhlin lemmas. Each states that, under some assumptions, we can construct Rokhlin towers that are arbitrarily high with arbitrarily small error sets.

Theorems

[1][2]

(ergodic) — If [math]\displaystyle{ T }[/math] is ergodic, and the space contains sets of arbitrarily small sizes, then we can construct Rokhlin towers.

(aperiodic) — If [math]\displaystyle{ T }[/math] is aperiodic, and the space is Lebesgue, and has measure 1, then we can construct Rokhlin towers.

(aperiodic, invertible, independent base) — Assume that [math]\displaystyle{ T }[/math] is aperiodic and invertible, and the space is Lebesgue, and has measure 1.

Given any partition of [math]\displaystyle{ X }[/math] into finitely many events [math]\displaystyle{ P := \{P_0, P_1, \dots, P_{K-1}\} }[/math], we can construct Rokhlin towers where each level is probabilistically independent of the partition.

Applications

The Rokhlin lemma can be used to prove some theorems. For example, (Section 2.5 [2])

Countable generator theorem (Rokhlin 1965) — Given a dynamical system on a Lebesgue space of measure 1, where [math]\displaystyle{ T }[/math] is invertible and measure preserving, it is isomorphic to a stationary process on a countable alphabet.

(Section 4.6 [2])

Krieger finite generator theorem (Krieger 1970) — Given a dynamical system on a Lebesgue space of measure 1, where [math]\displaystyle{ T }[/math] is invertible, measure preserving, and ergodic.

If its entropy is less than [math]\displaystyle{ \ln k }[/math], then the system is generated by a partition into [math]\displaystyle{ k }[/math] subsets.

Ornstein isomorphism theorem (Chapter 6 [2]).

Topological Rokhlin lemmas

Let [math]\displaystyle{ \textstyle (X,T) }[/math] be a topological dynamical system consisting of a compact metric space [math]\displaystyle{ \textstyle X }[/math] and a homeomorphism [math]\displaystyle{ \textstyle T:X\rightarrow X }[/math]. The topological dynamical system [math]\displaystyle{ \textstyle (X,T) }[/math] is called minimal if it has no proper non-empty closed [math]\displaystyle{ \textstyle T }[/math]-invariant subsets. It is called (topologically) aperiodic if it has no periodic points ([math]\displaystyle{ T^{k}x=x }[/math] for some [math]\displaystyle{ x\in X }[/math] and [math]\displaystyle{ k\in\mathbb{{Z}} }[/math] implies [math]\displaystyle{ k=0 }[/math]). A topological dynamical system [math]\displaystyle{ \textstyle (Y,S) }[/math] is called a factor of [math]\displaystyle{ \textstyle (X,T) }[/math] if there exists a continuous surjective mapping [math]\displaystyle{ \textstyle \varphi:X\rightarrow Y }[/math] which is equivariant, i.e., [math]\displaystyle{ \textstyle \varphi(Tx)=S\varphi(x) }[/math] for all [math]\displaystyle{ \textstyle x\in X }[/math].

Elon Lindenstrauss proved the following theorem:[3]

Theorem: Let [math]\displaystyle{ \textstyle (X,T) }[/math] be a topological dynamical system which has an aperiodic minimal factor. Then for integer [math]\displaystyle{ \textstyle n\in\N }[/math] there is a continuous function [math]\displaystyle{ \textstyle f\colon X\rightarrow\R }[/math] such that the set [math]\displaystyle{ \textstyle E=\{x\in X\mid f(Tx)\neq f(x)+1\} }[/math] satisfies [math]\displaystyle{ \textstyle E,TE,\ldots,T^{n-1}E }[/math] are pairwise disjoint.

Gutman proved the following theorem:[4]

Theorem: Let [math]\displaystyle{ (X,T) }[/math] be a topological dynamical system which has an aperiodic factor with the small boundary property. Then for every [math]\displaystyle{ \varepsilon\gt 0 }[/math], there exists a continuous function [math]\displaystyle{ f\colon X\rightarrow\R }[/math] such that the set [math]\displaystyle{ \textstyle E=\{x\in X \mid f(Tx)\neq f(x)+1\} }[/math] satisfies [math]\displaystyle{ \operatorname{ocap}(\textstyle E)\lt \varepsilon }[/math], where [math]\displaystyle{ \operatorname{ocap} }[/math] denotes orbit capacity.

Other generalizations

  • There are versions for non-invertible measure-preserving transformations.[5][6]
  • Donald Ornstein and Benjamin Weiss proved a version for free actions by countable discrete amenable groups.[7]
  • Carl Linderholm proved a version for periodic non-singular transformations.[8]

Proofs

Proofs taken from.[2]

Useful results

Proposition. An ergodic map on an atomless Lebesgue space is aperiodic.

Proof. If the map is not aperiodic, then there exists a number [math]\displaystyle{ n }[/math], such that the set of periodic points of period [math]\displaystyle{ n }[/math] has positive measure. Call the set [math]\displaystyle{ S }[/math]. Since measure is preserved, points outside of [math]\displaystyle{ S }[/math] do not map into it, nor the other way. Since the space is atomless, we can divide [math]\displaystyle{ S }[/math] into two halves, and [math]\displaystyle{ T }[/math] maps each into itself, so [math]\displaystyle{ T }[/math] is not ergodic.

Proposition. If there is an aperiodic map on a Lebesgue space of measure 1, then the space is atomless.

Proof. If there are atoms, then by measure-preservation, each atom can only map into another atom of greater or equal measure. If it maps into an atom of greater measure, it would drain out measure from the lighter atoms, so each atom maps to another atom of equal measure. Since the space has finite total measure, there are only finitely many atoms of a certain measure, and they must cycle back to the start eventually.

Proposition. If [math]\displaystyle{ T }[/math] is ergodic, then any set [math]\displaystyle{ A \gt 0 }[/math] satisfies (up to a null set)[math]\displaystyle{ X = \cup_{k \geq 0} T^k A = \cup_{k \leq 0} T^k A }[/math]Proof. [math]\displaystyle{ T^{-1}(\cup_{k \leq 0} T^k A) }[/math] is a subset of [math]\displaystyle{ \cup_{k \leq 0} T^k A }[/math], so by measure-preservation they are equal. Thus [math]\displaystyle{ \cup_{k \leq 0} T^k A }[/math] is a factor of [math]\displaystyle{ T }[/math], and since it contains [math]\displaystyle{ A\gt 0 }[/math], it is all of [math]\displaystyle{ X }[/math].

Similarly, [math]\displaystyle{ T(\cup_{k \leq 0} T^k A) }[/math] is a subset of [math]\displaystyle{ \cup_{k \leq 0} T^k A }[/math], so by measure-preservation they are equal, etc.

Ergodic case

Let [math]\displaystyle{ A }[/math] be a set of measure [math]\displaystyle{ \lt \epsilon }[/math]. Since [math]\displaystyle{ T }[/math] is ergodic, [math]\displaystyle{ X = \cup_{k \leq 0} T^k A }[/math], almost any point sooner or later falls into [math]\displaystyle{ A }[/math]. So we define a “time till arrival” function: [math]\displaystyle{ f(x) := \min\{n \geq 0: T^n x \in A\} }[/math] with [math]\displaystyle{ f(x) := +\infty }[/math] if [math]\displaystyle{ x }[/math] never falls into [math]\displaystyle{ A }[/math]. The set of [math]\displaystyle{ \{f(x) = +\infty\} }[/math] is null.

Now let [math]\displaystyle{ S = \{x: f(x) \in \{N, 2N, 3N, \dots\}\} }[/math].

Aperiodic case

Simplify

By a previous proposition, [math]\displaystyle{ X }[/math] is atomless, so we can map it to the unit interval [math]\displaystyle{ (0, 1) }[/math].

If we can pick a near-zero set with near-full coverage, namely some [math]\displaystyle{ A = O(\epsilon) }[/math] such that [math]\displaystyle{ X - \cup_{k \in \Z} T^k A = O(\epsilon) }[/math], then there exists some [math]\displaystyle{ n }[/math], such that [math]\displaystyle{ X - \cup_{k \leq n} T^k A = O(\epsilon) }[/math], and since [math]\displaystyle{ T^{-i}(T^n A) \supset T^{n-i}A }[/math] for each [math]\displaystyle{ i = 0, 1, 2, \dots }[/math], we have[math]\displaystyle{ X - \cup_{k \leq 0} T^k (T^nA) = O(\epsilon) }[/math]Now, repeating the previous construction with [math]\displaystyle{ T^n A }[/math], we obtain a Rokhlin tower of height [math]\displaystyle{ N }[/math] and coverage [math]\displaystyle{ 1-O(\epsilon) }[/math].


Thus, our task reduces to picking a near-zero set with near-full coverage.

Constructing A

Pick [math]\displaystyle{ M \gt 1/\epsilon }[/math]. Let [math]\displaystyle{ S }[/math] be the family of sets [math]\displaystyle{ A }[/math] such that [math]\displaystyle{ A, T^{-1}A, \dots, T^{-M}A }[/math] are disjoint. Since [math]\displaystyle{ T }[/math] preserves measure, any [math]\displaystyle{ A \in S }[/math] has size [math]\displaystyle{ \lt \epsilon }[/math].

The set [math]\displaystyle{ S }[/math] nonempty, because [math]\displaystyle{ \emptyset \in S }[/math]. It is preordered by [math]\displaystyle{ A \lt B }[/math] iff [math]\displaystyle{ \mu(B-A) = 0 }[/math]. Any totally ordered chain contains an upper bound. So by a simple Zorn-lemma–like argument, there exists a maximal element [math]\displaystyle{ A }[/math] in it. This is the desired set.

We prove by contradiction that [math]\displaystyle{ X = \cup_{k\in \Z}T^k A }[/math]. Assume not, then we will construct a set [math]\displaystyle{ I\cap E \gt 0 }[/math], disjoint from [math]\displaystyle{ A }[/math], such that [math]\displaystyle{ A \cup (I \cap E) \in S }[/math], which makes [math]\displaystyle{ A }[/math] no longer a maximal element, a contradiction.

Constructing E

Since we assumed [math]\displaystyle{ X - \cup_{k\in \Z}T^k A= \epsilon' \gt 0 }[/math], with positive probability, [math]\displaystyle{ x \not\in \cup_{k\in \Z}T^k A }[/math].

Since [math]\displaystyle{ T }[/math] is aperiodic, with probability 1,[math]\displaystyle{ (x \neq Tx) \wedge(x \neq T^2x) \wedge \dots \wedge (x \neq T^Mx) }[/math]And so, for a small enough [math]\displaystyle{ \delta }[/math], with probability [math]\displaystyle{ \gt 1- \epsilon'/2 }[/math],[math]\displaystyle{ (|x - Tx| \gt \delta) \wedge(|x - T^2 x| \gt \delta) \wedge \dots \wedge (|x - T^M x| \gt \delta) }[/math]And so, for a small enough [math]\displaystyle{ \delta }[/math], with probability [math]\displaystyle{ \gt \epsilon'/2 }[/math], these two events occur simultaneously. Let the event be [math]\displaystyle{ E }[/math].

Proof that E works


Invertible case

Simplify

It suffices to prove the case where only the base of the tower is probabilistically independent of the partition. Once that case is proved, we can apply the base case to the partition [math]\displaystyle{ P \vee T^{-1} P \vee \dots \vee T^{-N+1}P }[/math].

Since events with zero probability can be ignored, we only consider partitions where each event [math]\displaystyle{ P_k }[/math] has positive probability.

The goal is to construct a Rokhlin tower [math]\displaystyle{ R' }[/math] with base [math]\displaystyle{ S' }[/math], such that [math]\displaystyle{ \mu(S' \cap P_i ) =\frac{1-\epsilon}{N} \mu(P_i) }[/math] for each [math]\displaystyle{ i \in 0:K-1 }[/math].

Symbolic dynamics

Given a partition [math]\displaystyle{ P }[/math] and a map [math]\displaystyle{ T }[/math], we can trace out the orbit of every point [math]\displaystyle{ x }[/math] as a string of symbols [math]\displaystyle{ a_0(x), a_1(x), a_2(x), \dots }[/math], such that each [math]\displaystyle{ T^i x \in P_{a_i(x)} }[/math]. That is, we follow [math]\displaystyle{ x }[/math] to [math]\displaystyle{ T^ix }[/math], then check which partition it has ended up in, and write that partition’s name as [math]\displaystyle{ a_i(x) }[/math].

Given any Rokhlin tower of height [math]\displaystyle{ N }[/math], we can take its base [math]\displaystyle{ S }[/math], and divide it into [math]\displaystyle{ K^N }[/math] equivalence classes. The equivalence is defined thus: two elements are equivalent iff their names have the same first-[math]\displaystyle{ N }[/math] symbols.

Let [math]\displaystyle{ E \subset S }[/math] be one such equivalence class, then we call [math]\displaystyle{ E, TE, \dots, T^{N-1}E }[/math] a column of the Rokhlin tower.

For each word [math]\displaystyle{ a_{0:N-1}\in (0:K-1)^N }[/math], let the corresponding equivalence class be [math]\displaystyle{ E_{a} }[/math].

Since [math]\displaystyle{ T }[/math] is invertible, the columns partition the tower. One can imagine the tower made of string cheese, cut up the base of the tower into the [math]\displaystyle{ K^N }[/math] equivalence classes, then pull it apart into [math]\displaystyle{ K^N }[/math] columns.

First Rokhlin tower R

Let [math]\displaystyle{ \delta \ll \epsilon }[/math] be very small, and let [math]\displaystyle{ M \gg N }[/math] be very large. Construct a Rokhlin tower with [math]\displaystyle{ M }[/math] levels and error set of size [math]\displaystyle{ \delta }[/math]. Let its base be [math]\displaystyle{ S }[/math]. The tower [math]\displaystyle{ R = S\cup TS \cup\dots\cup T^{M-1}S }[/math] has mass [math]\displaystyle{ 1-\delta }[/math].

Divide its base into [math]\displaystyle{ K^N }[/math] equivalence classes, as previously described. This divides it into [math]\displaystyle{ K^N }[/math] columns [math]\displaystyle{ \{E_a\}_{a} }[/math] where [math]\displaystyle{ a }[/math] ranges over the possible words [math]\displaystyle{ (0:K-1)^N }[/math].

Because of how we defined the equivalence classes, each level in each column [math]\displaystyle{ T^nE_a }[/math] falls entirely within one of the partitions [math]\displaystyle{ P_0, \dots, P_{K-1} }[/math]. Therefore, the column levels [math]\displaystyle{ \{T^nE_a\}_{a, n} }[/math] almost make up a refinement of the partition [math]\displaystyle{ P }[/math], except for an error set of size [math]\displaystyle{ \delta }[/math].

That is,[math]\displaystyle{ \mu(R\cap P_i) = \sum_{a \in (0:K-1)^N,\; n \in 0:M-1}\mu(T^nE_a) = \mu(P_i) + O(\delta) }[/math]The critical idea: If we partition each [math]\displaystyle{ T^nE_a }[/math] equally into [math]\displaystyle{ N }[/math] parts, and put one into a new Rokhlin tower base [math]\displaystyle{ S' }[/math], we will have[math]\displaystyle{ \mu(S'\cap P_i) = \frac{1}{N}\mu(P_i) + O(\delta) }[/math]

Second Rokhlin tower R'

Now we construct a new base [math]\displaystyle{ S' }[/math] as follows: For each column based on [math]\displaystyle{ E_a }[/math], add to [math]\displaystyle{ S' }[/math], in a staircase pattern, the sets[math]\displaystyle{ E_{a, 0}, TE_{a, 1}, \dots, T^{N-1}E_{a, N-1} }[/math]then wrap back to the start: [math]\displaystyle{ T^NE_{a, 0}, T^{N+1}E_{a, 1}, \dots, T^{2N-1}E_{a, N-1} }[/math]and so on, until the column is exhausted. The new Rokhlin tower base [math]\displaystyle{ S' }[/math] is almost correct, but needs to be trimmed slightly into another set [math]\displaystyle{ S'' }[/math], which would satisfy [math]\displaystyle{ \mu(S''\cap P_i) = \frac{1-\epsilon}{N}\mu(P_i) }[/math] for each [math]\displaystyle{ i \in 0:K-1 }[/math], finishing the construction. (Only now do we use the assumption that there are only finitely many partitions. If there are countably many partitions, then the trimming cannot be done.)

Trimming the new Rokhlin tower base

The new Rokhlin tower [math]\displaystyle{ S', TS', \dots, T^{N-1}S' }[/math], contains almost as much mass as the original Rokhlin tower. The only lost mass is due to a small corner on the top right and bottom left of each column, which takes up [math]\displaystyle{ \leq \frac{2N^2}{MN} }[/math] proportion of the whole column’s mass. If we set [math]\displaystyle{ M \gg N/\delta }[/math], this lost mass is still [math]\displaystyle{ O(\delta) }[/math]. Thus, the new Rokhlin tower still has a very small error set.

Even after accounting for the mass lost from cutting off the column corners, we still have[math]\displaystyle{ \begin{aligned} \mu(S'\cap P_i) &= \frac{1}{N}\mu(P_i) + O(\delta) + O(\delta) \\ &= \frac{1}{N}\mu(P_i) + O(\delta) \\ &= \frac{1}{N}\mu(P_i)\times (1 + O(N\delta/\mu(P_i)))\quad\forall i = 0, 1, \dots, K-1 \end{aligned} }[/math]

Since there are only finitely many partitions, we can set [math]\displaystyle{ \delta = o(\frac{\epsilon}{N\min_i \mu(P_i)}) }[/math], we then have[math]\displaystyle{ \mu(S'\cap P_i) = \frac{1}{N}\mu(P_i)\times (1 + o(1) \epsilon) }[/math]In other words, we have real numbers [math]\displaystyle{ c_0, c_1, \dots, c_{K-1} = o(1) }[/math] such that [math]\displaystyle{ \mu(S'\cap P_i) = \frac{1-c_i \epsilon}{N}\mu(P_i) }[/math].

Now for each column [math]\displaystyle{ i = 0, 1, \dots, K-1 }[/math], trim away a part of [math]\displaystyle{ S'\cap P_i }[/math] into [math]\displaystyle{ S''\cap P_i }[/math], so that [math]\displaystyle{ \mu(S''\cap P_i) = \frac{1-\epsilon}{N}\mu(P_i) }[/math]. This finishes the construction.

References

  1. Shields, Paul (1973). The theory of Bernoulli shifts. Chicago Lectures in Mathematics. Chicago, Illinois and London: The University of Chicago Press. pp. Chapter 3. http://www.impan.pl/~gutman/The%20Theory%20of%20Bernoulli%20Shifts.pdf. 
  2. 2.0 2.1 2.2 2.3 2.4 Kalikow, Steven; McCutcheon, Randall (2010). "2.4. Rohlin tower theorem". An outline of ergodic theory. Cambridge studies in advanced mathematics (1. publ ed.). Cambridge: Cambridge Univ. Press. ISBN 978-0-521-19440-2. 
  3. Lindenstrauss, Elon (1999-12-01). "Mean dimension, small entropy factors and an embedding theorem". Publications Mathématiques de l'IHÉS 89 (1): 227–262. doi:10.1007/BF02698858. ISSN 0073-8301. http://www.numdam.org/item/PMIHES_1999__89__227_0/. 
  4. Gutman, Yonatan. "Embedding ℤk-actions in cubical shifts and ℤk-symbolic extensions." Ergodic Theory and Dynamical Systems 31.2 (2011): 383-403.
  5. Kornfeld, Isaac (2004). "Some old and new Rokhlin towers". Contemporary Mathematics 356: 145–169. doi:10.1090/conm/356/06502. ISBN 9780821833131. 
  6. Avila, Artur; Candela, Pablo (2016). "Towers for commuting endomorphisms, and combinatorial applications". Annales de l'Institut Fourier 66 (4): 1529–1544. doi:10.5802/aif.3042. 
  7. Ornstein, Donald S.; Weiss, Benjamin (1987-12-01). "Entropy and isomorphism theorems for actions of amenable groups". Journal d'Analyse Mathématique 48 (1): 1–141. doi:10.1007/BF02790325. ISSN 0021-7670. 
  8. Ionescu Tulcea, Alexandra (1965-01-01). "On the Category of Certain Classes of Transformations in Ergodic Theory". Transactions of the American Mathematical Society 114 (1): 261–279. doi:10.2307/1994001. 

Notes

  • Vladimir Rokhlin. A "general" measure-preserving transformation is not mixing. Doklady Akademii Nauk SSSR (N.S.), 60:349–351, 1948.
  • Shizuo Kakutani. Induced measure preserving transformations. Proc. Imp. Acad. Tokyo, 19:635–641, 1943.
  • Benjamin Weiss. On the work of V. A. Rokhlin in ergodic theory. Ergodic Theory and Dynamical Systems, 9(4):619–627, 1989.
  • Isaac Kornfeld (de). Some old and new Rokhlin towers. Contemporary Mathematics, 356:145, 2004.

See also