Empirical process

In probability theory, an empirical process is a stochastic process that characterizes the deviation of the empirical distribution function its expectation. In mean field theory, limit theorems (as the number of objects becomes large) are considered and generalise the central limit theorem for empirical measures. Applications of the theory of empirical processes arise in non-parametric statistics.^[1]

Definition

For X₁, X₂, ... X_n independent and identically-distributed random variables in R with common cumulative distribution function F(x), the empirical distribution function is defined by

[math]\displaystyle{ F_n(x)=\frac{1}{n}\sum_{i=1}^n I_{(-\infty,x]}(X_i), }[/math]

where I_C is the indicator function of the set C.

For every (fixed) x, F_n(x) is a sequence of random variables which converge to F(x) almost surely by the strong law of large numbers. That is, F_n converges to F pointwise. Glivenko and Cantelli strengthened this result by proving uniform convergence of F_n to F by the Glivenko–Cantelli theorem.^[2]

A centered and scaled version of the empirical measure is the signed measure

[math]\displaystyle{ G_n(A)=\sqrt{n}(P_n(A)-P(A)) }[/math]

It induces a map on measurable functions f given by

[math]\displaystyle{ f\mapsto G_n f=\sqrt{n}(P_n-P)f=\sqrt{n}\left(\frac{1}{n}\sum_{i=1}^n f(X_i)-\mathbb{E}f\right) }[/math]

By the central limit theorem, [math]\displaystyle{ G_n(A) }[/math] converges in distribution to a normal random variable N(0, P(A)(1 − P(A))) for fixed measurable set A. Similarly, for a fixed function f, [math]\displaystyle{ G_nf }[/math] converges in distribution to a normal random variable [math]\displaystyle{ N(0,\mathbb{E}(f-\mathbb{E}f)^2) }[/math], provided that [math]\displaystyle{ \mathbb{E}f }[/math] and [math]\displaystyle{ \mathbb{E}f^2 }[/math] exist.

Definition

[math]\displaystyle{ \bigl(G_n(c)\bigr)_{c\in\mathcal{C}} }[/math] is called an empirical process indexed by [math]\displaystyle{ \mathcal{C} }[/math], a collection of measurable subsets of S.

[math]\displaystyle{ \bigl(G_nf\bigr)_{f\in\mathcal{F}} }[/math] is called an empirical process indexed by [math]\displaystyle{ \mathcal{F} }[/math], a collection of measurable functions from S to [math]\displaystyle{ \mathbb{R} }[/math].

A significant result in the area of empirical processes is Donsker's theorem. It has led to a study of Donsker classes: sets of functions with the useful property that empirical processes indexed by these classes converge weakly to a certain Gaussian process. While it can be shown that Donsker classes are Glivenko–Cantelli classes, the converse is not true in general.

Example

As an example, consider empirical distribution functions. For real-valued iid random variables X₁, X₂, ..., X_n they are given by

[math]\displaystyle{ F_n(x)=P_n((-\infty,x])=P_nI_{(-\infty,x]}. }[/math]

In this case, empirical processes are indexed by a class [math]\displaystyle{ \mathcal{C}=\{(-\infty,x]:x\in\mathbb{R}\}. }[/math] It has been shown that [math]\displaystyle{ \mathcal{C} }[/math] is a Donsker class, in particular,

[math]\displaystyle{ \sqrt{n}(F_n(x)-F(x)) }[/math] converges weakly in [math]\displaystyle{ \ell^\infty(\mathbb{R}) }[/math] to a Brownian bridge B(F(x)) .

References

↑ Mojirsheibani, M. (2007). "Nonparametric curve estimation with missing data: A general empirical process approach". Journal of Statistical Planning and Inference 137 (9): 2733–2758. doi:10.1016/j.jspi.2006.02.016.
↑ Wolfowitz, J. (1954). "Generalization of the Theorem of Glivenko-Cantelli". The Annals of Mathematical Statistics 25: 131–138. doi:10.1214/aoms/1177728852.

External links

Empirical Processes: Theory and Applications, by David Pollard, a textbook available online.
Introduction to Empirical Processes and Semiparametric Inference, by Michael Kosorok, another textbook available online.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Empirical process. Read more

[1] Mojirsheibani, M. (2007). "Nonparametric curve estimation with missing data: A general empirical process approach". Journal of Statistical Planning and Inference 137 (9): 2733–2758. doi:10.1016/j.jspi.2006.02.016.

[2] Wolfowitz, J. (1954). "Generalization of the Theorem of Glivenko-Cantelli". The Annals of Mathematical Statistics 25: 131–138. doi:10.1214/aoms/1177728852.

[1]

[2]

Collapse v t e Stochastic processes
Discrete time	Bernoulli process Branching process Chinese restaurant process Galton–Watson process Independent and identically distributed random variables Markov chain Moran process Random walk Loop-erased Self-avoiding Biased Maximal entropy
Continuous time	Bessel process Birth–death process Brownian motion Bridge Excursion Fractional Geometric Meander Cauchy process Contact process Continuous-time random walk Cox process Diffusion process Empirical process Feller process Fleming–Viot process Gamma process Geometric process Hunt process Interacting particle systems Itô diffusion Itô process Jump diffusion Jump process Lévy process Local time Markov additive process McKean–Vlasov process Ornstein–Uhlenbeck process Poisson process Compound Non-homogeneous Point process Schramm–Loewner evolution Semimartingale Sigma-martingale Stable process Superprocess Telegraph process Variance gamma process Wiener process Wiener sausage
Both	Branching process Galves–Löcherbach model Gaussian process Hidden Markov model (HMM) Markov process Martingale Differences Local Sub- Super- Random dynamical system Regenerative process Renewal process Stochastic chains with memory of variable length White noise
Fields and other	Dirichlet process Gaussian random field Gibbs measure Hopfield model Ising model Potts model Boolean network Markov random field Percolation Pitman–Yor process Point process Cox Poisson Random field Random graph
Time series models	Autoregressive conditional heteroskedasticity (ARCH) model Autoregressive integrated moving average (ARIMA) model Autoregressive (AR) model Autoregressive–moving-average (ARMA) model Generalized autoregressive conditional heteroskedasticity (GARCH) model Moving-average (MA) model
Financial models	Black–Derman–Toy Black–Karasinski Black–Scholes Chen Constant elasticity of variance (CEV) Cox–Ingersoll–Ross (CIR) Garman–Kohlhagen Heath–Jarrow–Morton (HJM) Heston Ho–Lee Hull–White LIBOR market Rendleman–Bartter SABR volatility Vašíček Wilkie
Actuarial models	Bühlmann Cramér–Lundberg Risk process Sparre–Anderson
Queueing models	Bulk Fluid Generalized queueing network M/G/1 M/M/1 M/M/c
Properties	Càdlàg paths Continuous Continuous paths Ergodic Exchangeable Feller-continuous Gauss–Markov Markov Mixing Piecewise deterministic Predictable Progressively measurable Self-similar Stationary Time-reversible
Limit theorems	Central limit theorem Donsker's theorem Doob's martingale convergence theorems Ergodic theorem Fisher–Tippett–Gnedenko theorem Large deviation principle Law of large numbers (weak/strong) Law of the iterated logarithm Maximal ergodic theorem Sanov's theorem
Inequalities	Burkholder–Davis–Gundy Doob's martingale Kunita–Watanabe
Tools	Cameron–Martin formula Convergence of random variables Doléans-Dade exponential Doob decomposition theorem Doob–Meyer decomposition theorem Doob's optional stopping theorem Dynkin's formula Feynman–Kac formula Filtration Girsanov theorem Infinitesimal generator Itô integral Itô's lemma Karhunen–Loève_theorem Kolmogorov continuity theorem Kolmogorov extension theorem Lévy–Prokhorov metric Malliavin calculus Martingale representation theorem Optional stopping theorem Prokhorov's theorem Quadratic variation Reflection principle Skorokhod integral Skorokhod's representation theorem Skorokhod space Snell envelope Stochastic differential equation Tanaka Stopping time Stratonovich integral Uniform integrability Usual hypotheses Wiener space Classical Abstract
Disciplines	Actuarial mathematics Econometrics Ergodic theory Extreme value theory (EVT) Large deviations theory Mathematical finance Mathematical statistics Probability theory Queueing theory Renewal theory Ruin theory Statistics System on Chip design Stochastic analysis Time series analysis Machine learning
List of topics Category

Anonymous

Search

Empirical process

Namespaces

More

Page actions

Contents

Definition

Example

See also

References

Further reading

External links

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Empirical process

Definition

Example

See also

References

Further reading

External links

Navigation

Wiki tools

Page tools

Other projects

Categories