Bernoulli scheme

From HandWiki
Short description: Generalization of the Bernoulli process to more than two possible outcomes

In mathematics, the Bernoulli scheme or Bernoulli shift is a generalization of the Bernoulli process to more than two possible outcomes.[1][2] Bernoulli schemes appear naturally in symbolic dynamics, and are thus important in the study of dynamical systems. Many important dynamical systems (such as Axiom A systems) exhibit a repellor that is the product of the Cantor set and a smooth manifold, and the dynamics on the Cantor set are isomorphic to that of the Bernoulli shift.[3] This is essentially the Markov partition. The term shift is in reference to the shift operator, which may be used to study Bernoulli schemes. The Ornstein isomorphism theorem[4][5] shows that Bernoulli shifts are isomorphic when their entropy is equal.

Definition

A Bernoulli scheme is a discrete-time stochastic process where each independent random variable may take on one of N distinct possible values, with the outcome i occurring with probability [math]\displaystyle{ p_i }[/math], with i = 1, ..., N, and

[math]\displaystyle{ \sum_{i=1}^N p_i = 1. }[/math]

The sample space is usually denoted as

[math]\displaystyle{ X=\{1,\ldots,N \}^\mathbb{Z} }[/math]

as a shorthand for

[math]\displaystyle{ X=\{ x=(\ldots,x_{-1},x_0,x_1,\ldots) : x_k \in \{1,\ldots,N\} \; \forall k \in \mathbb{Z} \}. }[/math]

The associated measure is called the Bernoulli measure[6]

[math]\displaystyle{ \mu = \{p_1,\ldots,p_N\}^\mathbb{Z} }[/math]

The σ-algebra [math]\displaystyle{ \mathcal{A} }[/math] on X is the product sigma algebra; that is, it is the (countable) direct product of the σ-algebras of the finite set {1, ..., N}. Thus, the triplet

[math]\displaystyle{ (X,\mathcal{A},\mu) }[/math]

is a measure space. A basis of [math]\displaystyle{ \mathcal{A} }[/math] is the cylinder sets. Given a cylinder set [math]\displaystyle{ [x_0, x_1,\ldots,x_n] }[/math], its measure is

[math]\displaystyle{ \mu\left([x_0, x_1,\ldots,x_n]\right)= \prod_{i=0}^n p_{x_i} }[/math]

The equivalent expression, using the notation of probability theory, is

[math]\displaystyle{ \mu\left([x_0, x_1,\ldots,x_n]\right)= \mathrm{Pr}(X_0=x_0, X_1=x_1, \ldots, X_n=x_n) }[/math]

for the random variables [math]\displaystyle{ \{X_k\} }[/math]

The Bernoulli scheme, as any stochastic process, may be viewed as a dynamical system by endowing it with the shift operator T where

[math]\displaystyle{ T(x_k) = x_{k+1}. }[/math]

Since the outcomes are independent, the shift preserves the measure, and thus T is a measure-preserving transformation. The quadruplet

[math]\displaystyle{ (X,\mathcal{A},\mu, T) }[/math]

is a measure-preserving dynamical system, and is called a Bernoulli scheme or a Bernoulli shift. It is often denoted by

[math]\displaystyle{ BS(p)=BS(p_1,\ldots,p_N). }[/math]

The N = 2 Bernoulli scheme is called a Bernoulli process. The Bernoulli shift can be understood as a special case of the Markov shift, where all entries in the adjacency matrix are one, the corresponding graph thus being a clique.

Matches and metrics

The Hamming distance provides a natural metric on a Bernoulli scheme. Another important metric is the so-called [math]\displaystyle{ \overline f }[/math] metric, defined via a supremum over string matches.[7]

Let [math]\displaystyle{ A = a_1a_2\cdots a_m }[/math] and [math]\displaystyle{ B = b_1b_2\cdots b_n }[/math] be two strings of symbols. A match is a sequence M of pairs [math]\displaystyle{ (i_k, j_k) }[/math] of indexes into the string, i.e. pairs such that [math]\displaystyle{ a_{i_k}=b_{j_k}, }[/math] understood to be totally ordered. That is, each individual subsequence [math]\displaystyle{ (i_k) }[/math] and [math]\displaystyle{ (j_k) }[/math] are ordered: [math]\displaystyle{ 1\le i_1 \lt i_2\lt \cdots \lt i_r\le m }[/math] and likewise [math]\displaystyle{ 1\le j_1 \lt j_2\lt \cdots \lt j_r\le n. }[/math]

The [math]\displaystyle{ \overline f }[/math]-distance between [math]\displaystyle{ A }[/math] and [math]\displaystyle{ B }[/math] is

[math]\displaystyle{ \overline f(A,B) = 1-\frac{2\sup |M|}{m+n} }[/math]

where the supremum is being taken over all matches [math]\displaystyle{ M }[/math] between [math]\displaystyle{ A }[/math] and [math]\displaystyle{ B }[/math]. This satisfies the triangle inequality only when [math]\displaystyle{ m=n, }[/math] and so is not quite a true metric; despite this, it is commonly called a "distance" in the literature.

Generalizations

Most of the properties of the Bernoulli scheme follow from the countable direct product, rather than from the finite base space. Thus, one may take the base space to be any standard probability space [math]\displaystyle{ (Y,\mathcal{B},\nu) }[/math], and define the Bernoulli scheme as

[math]\displaystyle{ (X, \mathcal{A}, \mu)=(Y,\mathcal{B},\nu)^\mathbb{Z} }[/math]

This works because the countable direct product of a standard probability space is again a standard probability space.

As a further generalization, one may replace the integers [math]\displaystyle{ \mathbb{Z} }[/math] by a countable discrete group [math]\displaystyle{ G }[/math], so that

[math]\displaystyle{ (X, \mathcal{A}, \mu)=(Y,\mathcal{B},\nu)^G }[/math]

For this last case, the shift operator is replaced by the group action

[math]\displaystyle{ gx(f)=x(g^{-1}f) }[/math]

for group elements [math]\displaystyle{ f,g\in G }[/math] and [math]\displaystyle{ x\in Y^G }[/math] understood as a function [math]\displaystyle{ x:G\to Y }[/math] (any direct product [math]\displaystyle{ Y^G }[/math] can be understood to be the set of functions [math]\displaystyle{ [G\to Y] }[/math], as this is the exponential object). The measure [math]\displaystyle{ \mu }[/math] is taken as the Haar measure, which is invariant under the group action:

[math]\displaystyle{ \mu(gx)=\mu(x). \, }[/math]

These generalizations are also commonly called Bernoulli schemes, as they still share most properties with the finite case.

Properties

Ya. Sinai demonstrated that the Kolmogorov entropy of a Bernoulli scheme is given by[8][9]

[math]\displaystyle{ H = -\sum_{i=1}^N p_i \log p_i . }[/math]

This may be seen as resulting from the general definition of the entropy of a Cartesian product of probability spaces, which follows from the asymptotic equipartition property. For the case of a general base space [math]\displaystyle{ (Y, \mathcal{B}, \nu) }[/math] (i.e. a base space which is not countable), one typically considers the relative entropy. So, for example, if one has a countable partition [math]\displaystyle{ Y'\subset Y }[/math] of the base Y, such that [math]\displaystyle{ \nu(Y')=1 }[/math], one may define the entropy as

[math]\displaystyle{ H_{Y'} = -\sum_{y'\in Y'} \nu(y') \log \nu(y') . }[/math]

In general, this entropy will depend on the partition; however, for many dynamical systems, it is the case that the symbolic dynamics is independent of the partition (or rather, there are isomorphisms connecting the symbolic dynamics of different partitions, leaving the measure invariant), and so such systems can have a well-defined entropy independent of the partition.

Ornstein isomorphism theorem

The Ornstein isomorphism theorem states that two Bernoulli schemes with the same entropy are isomorphic.[4] The result is sharp,[10] in that very similar, non-scheme systems, such as Kolmogorov automorphisms, do not have this property.

The Ornstein isomorphism theorem is in fact considerably deeper: it provides a simple criterion by which many different measure-preserving dynamical systems can be judged to be isomorphic to Bernoulli schemes. The result was surprising, as many systems previously believed to be unrelated proved to be isomorphic. These include all finite[clarification needed] stationary stochastic processes, subshifts of finite type, finite Markov chains, Anosov flows, and Sinai's billiards: these are all isomorphic to Bernoulli schemes.

For the generalized case, the Ornstein isomorphism theorem still holds if the group G is a countably infinite amenable group. [11][12]

Bernoulli automorphism

An invertible, measure-preserving transformation of a standard probability space (Lebesgue space) is called a Bernoulli automorphism if it is isomorphic to a Bernoulli shift.[13]

Loosely Bernoulli

A system is termed "loosely Bernoulli" if it is Kakutani-equivalent to a Bernoulli shift; in the case of zero entropy, if it is Kakutani-equivalent to an irrational rotation of a circle.

See also

References

  1. P. Shields, The theory of Bernoulli shifts, Univ. Chicago Press (1973)
  2. Michael S. Keane, "Ergodic theory and subshifts of finite type", (1991), appearing as Chapter 2 in Ergodic Theory, Symbolic Dynamics and Hyperbolic Spaces, Tim Bedford, Michael Keane and Caroline Series, Eds. Oxford University Press, Oxford (1991). ISBN:0-19-853390-X
  3. Pierre Gaspard, Chaos, scattering and statistical mechanics (1998), Cambridge University press
  4. 4.0 4.1 "Bernoulli shifts with the same entropy are isomorphic". Advances in Mathematics 4: 337–352. 1970. doi:10.1016/0001-8708(70)90029-0. 
  5. Hazewinkel, Michiel, ed. (2001), "Ornstein isomorphism theorem", Encyclopedia of Mathematics, Springer Science+Business Media B.V. / Kluwer Academic Publishers, ISBN 978-1-55608-010-4, https://www.encyclopediaofmath.org/index.php?title=Ornstein_isomorphism_theorem&oldid=17997 
  6. Klenke, Achim (2006). Probability Theory. Springer-Verlag. ISBN 978-1-84800-047-6. 
  7. Feldman, Jacob (1976). "New [math]\displaystyle{ K }[/math]-automorphisms and a problem of Kakutani". Israel Journal of Mathematics 24 (1): 16–38. doi:10.1007/BF02761426. 
  8. Ya.G. Sinai, (1959) "On the Notion of Entropy of a Dynamical System", Doklady of Russian Academy of Sciences 124, pp. 768–771.
  9. Ya. G. Sinai, (2007) "Metric Entropy of Dynamical System"
  10. Hoffman, Christopher (1999). "A [math]\displaystyle{ K }[/math] Counterexample Machine". Transactions of the American Mathematical Society 351: 4263–4280. https://www.ams.org/journals/tran/1999-351-10/S0002-9947-99-02446-0/. 
  11. "Entropy and isomorphism theorems for actions of amenable groups". Journal d'Analyse Mathématique 48: 1–141. 1987. doi:10.1007/BF02790325. 
  12. Bowen, Lewis (2012). "Every countably infinite group is almost Ornstein". Contemporary Mathematics 567: 67–78. 
  13. Peter Walters (1982) An Introduction to Ergodic Theory, Springer-Verlag, ISBN:0-387-90599-5