Doob's martingale convergence theorems

From HandWiki
Revision as of 22:42, 6 February 2024 by Gametune (talk | contribs) (link)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

In mathematics – specifically, in the theory of stochastic processes – Doob's martingale convergence theorems are a collection of results on the limits of supermartingales, named after the American mathematician Joseph L. Doob.[1] Informally, the martingale convergence theorem typically refers to the result that any supermartingale satisfying a certain boundedness condition must converge. One may think of supermartingales as the random variable analogues of non-increasing sequences; from this perspective, the martingale convergence theorem is a random variable analogue of the monotone convergence theorem, which states that any bounded monotone sequence converges. There are symmetric results for submartingales, which are analogous to non-decreasing sequences.

Statement for discrete-time martingales

A common formulation of the martingale convergence theorem for discrete-time martingales is the following. Let [math]\displaystyle{ X_1, X_2, X_3, \dots }[/math] be a supermartingale. Suppose that the supermartingale is bounded in the sense that

[math]\displaystyle{ \sup_{t \in \mathbf{N}} \operatorname{E}[X_t^-] \lt \infty }[/math]

where [math]\displaystyle{ X_t^- }[/math] is the negative part of [math]\displaystyle{ X_t }[/math], defined by [math]\displaystyle{ X_t^- = -\min(X_t, 0) }[/math]. Then the sequence converges almost surely to a random variable [math]\displaystyle{ X }[/math] with finite expectation.

There is a symmetric statement for submartingales with bounded expectation of the positive part. A supermartingale is a stochastic analogue of a non-increasing sequence, and the condition of the theorem is analogous to the condition in the monotone convergence theorem that the sequence be bounded from below. The condition that the martingale is bounded is essential; for example, an unbiased [math]\displaystyle{ \pm 1 }[/math] random walk is a martingale but does not converge.

As intuition, there are two reasons why a sequence may fail to converge. It may go off to infinity, or it may oscillate. The boundedness condition prevents the former from happening. The latter is impossible by a "gambling" argument. Specifically, consider a stock market game in which at time [math]\displaystyle{ t }[/math], the stock has price [math]\displaystyle{ X_t }[/math]. There is no strategy for buying and selling the stock over time, always holding a non-negative amount of stock, which has positive expected profit in this game. The reason is that at each time the expected change in stock price, given all past information, is at most zero (by definition of a supermartingale). But if the prices were to oscillate without converging, then there would be a strategy with positive expected profit: loosely, buy low and sell high. This argument can be made rigorous to prove the result.

Proof sketch

The proof is simplified by making the (stronger) assumption that the supermartingale is uniformly bounded; that is, there is a constant [math]\displaystyle{ M }[/math] such that [math]\displaystyle{ |X_n| \leq M }[/math] always holds. In the event that the sequence [math]\displaystyle{ X_1,X_2,\dots }[/math] does not converge, then [math]\displaystyle{ \liminf X_n }[/math] and [math]\displaystyle{ \limsup X_n }[/math] differ. If also the sequence is bounded, then there are some real numbers [math]\displaystyle{ a }[/math] and [math]\displaystyle{ b }[/math] such that [math]\displaystyle{ a \lt b }[/math] and the sequence crosses the interval [math]\displaystyle{ [a,b] }[/math] infinitely often. That is, the sequence is eventually less than [math]\displaystyle{ a }[/math], and at a later time exceeds [math]\displaystyle{ b }[/math], and at an even later time is less than [math]\displaystyle{ a }[/math], and so forth ad infinitum. These periods where the sequence starts below [math]\displaystyle{ a }[/math] and later exceeds [math]\displaystyle{ b }[/math] are called "upcrossings".

Consider a stock market game in which at time [math]\displaystyle{ t }[/math], one may buy or sell shares of the stock at price [math]\displaystyle{ X_t }[/math]. On the one hand, it can be shown from the definition of a supermartingale that for any [math]\displaystyle{ N \in \mathbf{N} }[/math] there is no strategy which maintains a non-negative amount of stock and has positive expected profit after playing this game for [math]\displaystyle{ N }[/math] steps. On the other hand, if the prices cross a fixed interval [math]\displaystyle{ [a,b] }[/math] very often, then the following strategy seems to do well: buy the stock when the price drops below [math]\displaystyle{ a }[/math], and sell it when the price exceeds [math]\displaystyle{ b }[/math]. Indeed, if [math]\displaystyle{ u_N }[/math] is the number of upcrossings in the sequence by time [math]\displaystyle{ N }[/math], then the profit at time [math]\displaystyle{ N }[/math] is at least [math]\displaystyle{ (b-a)u_N - 2M }[/math]: each upcrossing provides at least [math]\displaystyle{ b-a }[/math] profit, and if the last action was a "buy", then in the worst case the buying price was [math]\displaystyle{ a \leq M }[/math] and the current price is [math]\displaystyle{ -M }[/math]. But any strategy has expected profit at most [math]\displaystyle{ 0 }[/math], so necessarily

[math]\displaystyle{ \operatorname{E} \big[u_N\big] \leq \frac{2M}{b-a}. }[/math]

By the monotone convergence theorem for expectations, this means that

[math]\displaystyle{ \operatorname{E} \big[\lim_{N \to \infty} u_N \big]\leq \frac{2M}{b-a} , }[/math]

so the expected number of upcrossings in the whole sequence is finite. It follows that the infinite-crossing event for interval [math]\displaystyle{ [a,b] }[/math] occurs with probability [math]\displaystyle{ 0 }[/math]. By a union bound over all rational [math]\displaystyle{ a }[/math] and [math]\displaystyle{ b }[/math], with probability [math]\displaystyle{ 1 }[/math], no interval exists which is crossed infinitely often. If for all [math]\displaystyle{ a, b \in \mathbf{Q} }[/math] there are finitely many upcrossings of interval [math]\displaystyle{ [a,b] }[/math], then the limit inferior and limit superior of the sequence must agree, so the sequence must converge. This shows that the martingale converges with probability [math]\displaystyle{ 1 }[/math].

Failure of convergence in mean

Under the conditions of the martingale convergence theorem given above, it is not necessarily true that the supermartingale [math]\displaystyle{ (X_n)_{n \in \mathbf{N}} }[/math] converges in mean (i.e. that [math]\displaystyle{ \lim_{n \to \infty} \operatorname{E}[|X_n - X|] = 0 }[/math]).

As an example,[2] let [math]\displaystyle{ (X_n)_{n \in \mathbf{N}} }[/math] be a [math]\displaystyle{ \pm 1 }[/math] random walk with [math]\displaystyle{ X_0 = 1 }[/math]. Let [math]\displaystyle{ N }[/math] be the first time when [math]\displaystyle{ X_n = 0 }[/math], and let [math]\displaystyle{ (Y_n)_{n \in \mathbf{N}} }[/math] be the stochastic process defined by [math]\displaystyle{ Y_n := X_{\min(N, n)} }[/math]. Then [math]\displaystyle{ N }[/math] is a stopping time with respect to the martingale [math]\displaystyle{ (X_n)_{n \in \mathbf{N}} }[/math], so [math]\displaystyle{ (Y_n)_{n \in \mathbf{N}} }[/math] is also a martingale, referred to as a stopped martingale. In particular, [math]\displaystyle{ (Y_n)_{n \in \mathbf{N}} }[/math] is a supermartingale which is bounded below, so by the martingale convergence theorem it converges pointwise almost surely to a random variable [math]\displaystyle{ Y }[/math]. But if [math]\displaystyle{ Y_n \gt 0 }[/math] then [math]\displaystyle{ Y_{n+1} = Y_n \pm 1 }[/math], so [math]\displaystyle{ Y }[/math] is almost surely zero.

This means that [math]\displaystyle{ \operatorname{E}[Y] = 0 }[/math]. However, [math]\displaystyle{ \operatorname{E}[Y_n] = 1 }[/math] for every [math]\displaystyle{ n \geq 1 }[/math], since [math]\displaystyle{ (Y_n)_{n \in \mathbf{N}} }[/math] is a random walk which starts at [math]\displaystyle{ 1 }[/math] and subsequently makes mean-zero moves (alternately, note that [math]\displaystyle{ \operatorname{E}[Y_n] = \operatorname{E}[Y_0] = 1 }[/math] since [math]\displaystyle{ (Y_n)_{n \in \mathbf{N}} }[/math] is a martingale). Therefore [math]\displaystyle{ (Y_n)_{n \in \mathbf{N}} }[/math] cannot converge to [math]\displaystyle{ Y }[/math] in mean. Moreover, if [math]\displaystyle{ (Y_n)_{n \in \mathbb{N}} }[/math] were to converge in mean to any random variable [math]\displaystyle{ R }[/math], then some subsequence converges to [math]\displaystyle{ R }[/math] almost surely. So by the above argument [math]\displaystyle{ R = 0 }[/math] almost surely, which contradicts convergence in mean.

Statements for the general case

In the following, [math]\displaystyle{ (\Omega, F, F_*, \mathbf{P}) }[/math] will be a filtered probability space where [math]\displaystyle{ F_* = (F_t)_{t \geq 0} }[/math], and [math]\displaystyle{ N: [0,\infty) \times \Omega \to \mathbf{R} }[/math] will be a right-continuous supermartingale with respect to the filtration [math]\displaystyle{ F_* }[/math]; in other words, for all [math]\displaystyle{ 0 \leq s \leq t \lt +\infty }[/math],

[math]\displaystyle{ N_s \geq \operatorname{E} \big[ N_t \mid F_s \big]. }[/math]

Doob's first martingale convergence theorem

Doob's first martingale convergence theorem provides a sufficient condition for the random variables [math]\displaystyle{ N_t }[/math] to have a limit as [math]\displaystyle{ t\to+\infty }[/math] in a pointwise sense, i.e. for each [math]\displaystyle{ \omega }[/math] in the sample space [math]\displaystyle{ \Omega }[/math] individually.

For [math]\displaystyle{ t\geq 0 }[/math], let [math]\displaystyle{ N_t^- = \max(-N_t,0) }[/math] and suppose that

[math]\displaystyle{ \sup_{t \gt 0} \operatorname{E} \big[ N_t^{-} \big] \lt + \infty. }[/math]

Then the pointwise limit

[math]\displaystyle{ N(\omega) = \lim_{t \to + \infty} N_t (\omega) }[/math]

exists and is finite for [math]\displaystyle{ \mathbf{P} }[/math]-almost all [math]\displaystyle{ \omega \in \Omega }[/math].[3]

Doob's second martingale convergence theorem

It is important to note that the convergence in Doob's first martingale convergence theorem is pointwise, not uniform, and is unrelated to convergence in mean square, or indeed in any Lp space. In order to obtain convergence in L1 (i.e., convergence in mean), one requires uniform integrability of the random variables [math]\displaystyle{ N_t }[/math]. By Chebyshev's inequality, convergence in L1 implies convergence in probability and convergence in distribution.

The following are equivalent:

  • [math]\displaystyle{ (N_t)_{t\gt 0} }[/math] is uniformly integrable, i.e.
[math]\displaystyle{ \lim_{C \to \infty} \sup_{t \gt 0} \int_{\{ \omega \in \Omega \, \mid \, | N_t (\omega) | \gt C \}} \left| N_t (\omega) \right| \, \mathrm{d} \mathbf{P} (\omega) = 0; }[/math]
  • there exists an integrable random variable [math]\displaystyle{ N \in L^1(\Omega,\mathbf{P};\mathbf{R}) }[/math] such that [math]\displaystyle{ N_t \to N }[/math] as [math]\displaystyle{ t\to\infty }[/math] both [math]\displaystyle{ \mathbf{P} }[/math]-almost surely and in [math]\displaystyle{ L^1(\Omega,\mathbf{P};\mathbf{R}) }[/math], i.e.
[math]\displaystyle{ \operatorname{E} \left[ \left| N_t - N \right| \right] = \int_\Omega \left| N_t (\omega) - N (\omega) \right| \, \mathrm{d} \mathbf{P} (\omega) \to 0 \text{ as } t \to + \infty. }[/math]

Doob's upcrossing inequality

The following result, called Doob's upcrossing inequality or, sometimes, Doob's upcrossing lemma, is used in proving Doob's martingale convergence theorems.[3] A "gambling" argument shows that for uniformly bounded supermartingales, the number of upcrossings is bounded; the upcrossing lemma generalizes this argument to supermartingales with bounded expectation of their negative parts.

Let [math]\displaystyle{ N }[/math] be a natural number. Let [math]\displaystyle{ (X_n)_{n \in \mathbf{N}} }[/math] be a supermartingale with respect to a filtration [math]\displaystyle{ (\mathcal{F}_n)_{n \in \mathbf{N}} }[/math]. Let [math]\displaystyle{ a }[/math], [math]\displaystyle{ b }[/math] be two real numbers with [math]\displaystyle{ a \lt b }[/math]. Define the random variables [math]\displaystyle{ (U_n)_{n \in \mathbf{N}} }[/math] so that [math]\displaystyle{ U_n }[/math] is the maximum number of disjoint intervals [math]\displaystyle{ [n_{i_1}, n_{i_2}] }[/math] with [math]\displaystyle{ n_{i_2} \leq n }[/math], such that [math]\displaystyle{ X_{n_{i_1}} \lt a \lt b \lt X_{n_{i_2}} }[/math]. These are called upcrossings with respect to interval [math]\displaystyle{ [a,b] }[/math]. Then

[math]\displaystyle{ (b - a) \operatorname{E}[U_n] \le \operatorname{E}[(X_n - a)^-].\quad }[/math]

where [math]\displaystyle{ X^- }[/math] is the negative part of [math]\displaystyle{ X }[/math], defined by [math]\displaystyle{ X^- = -\min(X, 0) }[/math].[4][5]

Applications

Convergence in Lp

Let [math]\displaystyle{ M:[0,\infty) \times \Omega \to \mathbf{R} }[/math] be a continuous martingale such that

[math]\displaystyle{ \sup_{t \gt 0} \operatorname{E} \big[ \big| M_t \big|^p \big] \lt + \infty }[/math]

for some [math]\displaystyle{ p\gt 1 }[/math]. Then there exists a random variable [math]\displaystyle{ M \in L^p(\Omega,\mathbf{P};\mathbf{R}) }[/math] such that [math]\displaystyle{ M_t \to M }[/math] as [math]\displaystyle{ t\to +\infty }[/math] both [math]\displaystyle{ \mathbf{P} }[/math]-almost surely and in [math]\displaystyle{ L^p(\Omega,\mathbf{P};\mathbf{R}) }[/math].

The statement for discrete-time martingales is essentially identical, with the obvious difference that the continuity assumption is no longer necessary.

Lévy's zero–one law

Doob's martingale convergence theorems imply that conditional expectations also have a convergence property.

Let [math]\displaystyle{ (\Omega,F,\mathbf{P}) }[/math] be a probability space and let [math]\displaystyle{ X }[/math] be a random variable in [math]\displaystyle{ L^1 }[/math]. Let [math]\displaystyle{ F_* = (F_k)_{k \in \mathbf{N}} }[/math] be any filtration of [math]\displaystyle{ F }[/math], and define [math]\displaystyle{ F_\infty }[/math] to be the minimal σ-algebra generated by [math]\displaystyle{ (F_k)_{k \in \mathbf{N}} }[/math]. Then

[math]\displaystyle{ \operatorname{E} \big[ X \mid F_k \big] \to \operatorname{E} \big[ X \mid F_\infty \big] \text{ as } k \to \infty }[/math]

both [math]\displaystyle{ \mathbf{P} }[/math]-almost surely and in [math]\displaystyle{ L^1 }[/math].

This result is usually called Lévy's zero–one law or Levy's upwards theorem. The reason for the name is that if [math]\displaystyle{ A }[/math] is an event in [math]\displaystyle{ F_\infty }[/math], then the theorem says that [math]\displaystyle{ \mathbf{P}[ A \mid F_k ] \to \mathbf{1}_A }[/math] almost surely, i.e., the limit of the probabilities is 0 or 1. In plain language, if we are learning gradually all the information that determines the outcome of an event, then we will become gradually certain what the outcome will be. This sounds almost like a tautology, but the result is still non-trivial. For instance, it easily implies Kolmogorov's zero–one law, since it says that for any tail event A, we must have [math]\displaystyle{ \mathbf{P}[ A ] = \mathbf{1}_A }[/math] almost surely, hence [math]\displaystyle{ \mathbf{P}[ A ] \in \{0,1\} }[/math].

Similarly we have the Levy's downwards theorem :

Let [math]\displaystyle{ (\Omega,F,\mathbf{P}) }[/math] be a probability space and let [math]\displaystyle{ X }[/math] be a random variable in [math]\displaystyle{ L^1 }[/math]. Let [math]\displaystyle{ (F_k)_{k \in \mathbf{N}} }[/math] be any decreasing sequence of sub-sigma algebras of [math]\displaystyle{ F }[/math], and define [math]\displaystyle{ F_\infty }[/math] to be the intersection. Then

[math]\displaystyle{ \operatorname{E} \big[ X \mid F_k \big] \to \operatorname{E} \big[ X \mid F_\infty \big] \text{ as } k \to \infty }[/math]

both [math]\displaystyle{ \mathbf{P} }[/math]-almost surely and in [math]\displaystyle{ L^1 }[/math].

See also

  • Backwards martingale convergence theorem[6]


References

  1. Doob, J. L. (1953). Stochastic Processes. New York: Wiley. 
  2. Durrett (1996). Probability: theory and examples (Second ed.). Duxbury Press. ISBN 978-0-534-24318-0. ; Durrett, Rick (2010). 4th edition. Cambridge University Press. ISBN 9781139491136. https://books.google.com/books?id=evbGTPhuvSoC. 
  3. 3.0 3.1 "Martingale Convergence Theorem". https://ocw.mit.edu/courses/sloan-school-of-management/15-070j-advanced-stochastic-processes-fall-2013/lecture-notes/MIT15_070JF13_Lec11Add.pdf. 
  4. Bobrowski, Adam (2005). Functional Analysis for Probability and Stochastic Processes: An Introduction. Cambridge University Press. pp. 113–114. ISBN 9781139443883. https://books.google.com/books?id=4XDpBwXEVVkC&pg=PA113. 
  5. Gushchin, A. A. (2014). "On pathwise counterparts of Doob's maximal inequalities". Proceedings of the Steklov Institute of Mathematics 287 (287): 118–121. doi:10.1134/S0081543814080070. 
  6. Doob, Joseph L. (1994). Measure theory. Graduate Texts in Mathematics, Vol. 143. Springer. p. 197. ISBN 9781461208778. https://books.google.com/books?id=H0PhBwAAQBAJ&pg=PA197.