Hajek projection

In statistics, Hájek projection of a random variable [math]\displaystyle{ T }[/math] on a set of independent random vectors [math]\displaystyle{ X_1,\dots,X_n }[/math] is a particular measurable function of [math]\displaystyle{ X_1,\dots,X_n }[/math] that, loosely speaking, captures the variation of [math]\displaystyle{ T }[/math] in an optimal way. It is named after the Czech statistician Jaroslav Hájek .

Definition

Given a random variable [math]\displaystyle{ T }[/math] and a set of independent random vectors [math]\displaystyle{ X_1,\dots,X_n }[/math], the Hájek projection [math]\displaystyle{ \hat{T} }[/math] of [math]\displaystyle{ T }[/math] onto [math]\displaystyle{ \{X_1,\dots,X_n\} }[/math] is given by^[1]

[math]\displaystyle{ \hat{T} = \operatorname{E}(T) + \sum_{i=1}^n \left[ \operatorname{E}(T\mid X_i) - \operatorname{E}(T)\right] = \sum_{i=1}^n \operatorname{E}(T\mid X_i) - (n-1)\operatorname{E}(T) }[/math]

Properties

Hájek projection [math]\displaystyle{ \hat{T} }[/math] is an [math]\displaystyle{ L^2 }[/math]projection of [math]\displaystyle{ T }[/math] onto a linear subspace of all random variables of the form [math]\displaystyle{ \sum_{i=1}^n g_i(X_i) }[/math], where [math]\displaystyle{ g_i:\mathbb{R}^d \to \mathbb{R} }[/math] are arbitrary measurable functions such that [math]\displaystyle{ \operatorname{E}(g_i^2(X_i))\lt \infty }[/math] for all [math]\displaystyle{ i=1,\dots,n }[/math]
[math]\displaystyle{ \operatorname{E} (\hat{T}\mid X_i)=\operatorname{E}(T\mid X_i) }[/math] and hence [math]\displaystyle{ \operatorname{E}(\hat{T})=\operatorname{E}(T) }[/math]
Under some conditions, asymptotic distributions of the sequence of statistics [math]\displaystyle{ T_n=T_n(X_1,\dots,X_n) }[/math] and the sequence of its Hájek projections [math]\displaystyle{ \hat{T}_n = \hat{T}_n(X_1,\dots,X_n) }[/math] coincide, namely, if [math]\displaystyle{ \operatorname{Var}(T_n)/\operatorname{Var}(\hat{T}_n) \to 1 }[/math], then [math]\displaystyle{ \frac{T_n-\operatorname{E}(T_n)}{\sqrt{\operatorname{Var}(T_n)}} - \frac{\hat{T}_n-\operatorname{E}(\hat{T}_n)}{\sqrt{\operatorname{Var}(\hat{T}_n)}} }[/math] converges to zero in probability.

References

↑ Vaart, Aad W. van der (1959-....). (2012). Asymptotic statistics. Cambridge University Press. ISBN 9780511802256. OCLC 928629884. http://worldcat.org/oclc/928629884.

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Hajek projection. Read more

[1] Vaart, Aad W. van der (1959-....). (2012). Asymptotic statistics. Cambridge University Press. ISBN 9780511802256. OCLC 928629884. http://worldcat.org/oclc/928629884.

[1]

Anonymous

Search

Hajek projection

Namespaces

More

Page actions

Definition

Properties

References

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Hajek projection

Definition

Properties

References

Navigation

Wiki tools

Page tools

Other projects

Categories