Uncorrelatedness (probability theory)

From HandWiki
Revision as of 19:15, 6 February 2024 by Len Stevenson (talk | contribs) (add)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

In probability theory and statistics, two real-valued random variables, [math]\displaystyle{ X }[/math], [math]\displaystyle{ Y }[/math], are said to be uncorrelated if their covariance, [math]\displaystyle{ \operatorname{cov}[X,Y] = \operatorname{E}[XY] - \operatorname{E}[X] \operatorname{E}[Y] }[/math], is zero. If two variables are uncorrelated, there is no linear relationship between them.

Uncorrelated random variables have a Pearson correlation coefficient, when it exists, of zero, except in the trivial case when either variable has zero variance (is a constant). In this case the correlation is undefined.

In general, uncorrelatedness is not the same as orthogonality, except in the special case where at least one of the two random variables has an expected value of 0. In this case, the covariance is the expectation of the product, and [math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math] are uncorrelated if and only if [math]\displaystyle{ \operatorname{E}[XY] = 0 }[/math].

If [math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math] are independent, with finite second moments, then they are uncorrelated. However, not all uncorrelated variables are independent.[1]:p. 155

Definition

Definition for two real random variables

Two random variables [math]\displaystyle{ X,Y }[/math] are called uncorrelated if their covariance [math]\displaystyle{ \operatorname{Cov}[X,Y]=\operatorname{E}[(X-\operatorname{E}[X]) (Y-\operatorname{E}[Y])] }[/math] is zero.[1]:p. 153[2]:p. 121 Formally:

[math]\displaystyle{ X,Y \text{ uncorrelated} \quad \iff \quad \operatorname{E}[XY] = \operatorname{E}[X] \cdot \operatorname{E}[Y] }[/math]

Definition for two complex random variables

Two complex random variables [math]\displaystyle{ Z,W }[/math] are called uncorrelated if their covariance [math]\displaystyle{ \operatorname{K}_{ZW}=\operatorname{E}[(Z-\operatorname{E}[Z])\overline{(W-\operatorname{E}[W])}] }[/math] and their pseudo-covariance [math]\displaystyle{ \operatorname{J}_{ZW}=\operatorname{E}[(Z-\operatorname{E}[Z]) (W-\operatorname{E}[W])] }[/math] is zero, i.e.

[math]\displaystyle{ Z,W \text{ uncorrelated} \quad \iff \quad \operatorname{E}[Z\overline{W}] = \operatorname{E}[Z] \cdot \operatorname{E}[\overline{W}] \text{ and } \operatorname{E}[ZW] = \operatorname{E}[Z] \cdot \operatorname{E}[W] }[/math]

Definition for more than two random variables

A set of two or more random variables [math]\displaystyle{ X_1,\ldots,X_n }[/math] is called uncorrelated if each pair of them is uncorrelated. This is equivalent to the requirement that the non-diagonal elements of the autocovariance matrix [math]\displaystyle{ \operatorname{K}_{\mathbf{X}\mathbf{X}} }[/math] of the random vector [math]\displaystyle{ \mathbf{X} = (X_1,\ldots,X_n)^\mathrm{T} }[/math] are all zero. The autocovariance matrix is defined as:

[math]\displaystyle{ \operatorname{K}_{\mathbf{X}\mathbf{X}} = \operatorname{cov}[\mathbf{X},\mathbf{X}] = \operatorname{E}[(\mathbf{X}-\operatorname{E}[\mathbf{X}])(\mathbf{X}-\operatorname{E}[\mathbf{X}])^{\rm T}]= \operatorname{E}[\mathbf{X} \mathbf{X}^T] - \operatorname{E}[\mathbf{X}]\operatorname{E}[\mathbf{X}]^T }[/math]

Examples of dependence without correlation

Main page: Correlation and dependence

Example 1

  • Let [math]\displaystyle{ X }[/math] be a random variable that takes the value 0 with probability 1/2, and takes the value 1 with probability 1/2.
  • Let [math]\displaystyle{ Y }[/math] be a random variable, independent of [math]\displaystyle{ X }[/math], that takes the value −1 with probability 1/2, and takes the value 1 with probability 1/2.
  • Let [math]\displaystyle{ U }[/math] be a random variable constructed as [math]\displaystyle{ U=XY }[/math].

The claim is that [math]\displaystyle{ U }[/math] and [math]\displaystyle{ X }[/math] have zero covariance (and thus are uncorrelated), but are not independent.

Proof:

Taking into account that

[math]\displaystyle{ \operatorname{E}[U] = \operatorname{E}[XY] = \operatorname{E}[X] \operatorname{E}[Y] = \operatorname{E}[X] \cdot 0 = 0, }[/math]

where the second equality holds because [math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math] are independent, one gets

[math]\displaystyle{ \begin{align} \operatorname{cov}[U,X] & = \operatorname{E}[(U-\operatorname E[U])(X-\operatorname E[X])] = \operatorname{E}[ U (X-\tfrac12)] \\ & = \operatorname{E}[X^2 Y - \tfrac12 XY] = \operatorname{E}[(X^2-\tfrac12 X)Y] = \operatorname{E}[(X^2-\tfrac12 X)] \operatorname E[Y] = 0 \end{align} }[/math]

Therefore, [math]\displaystyle{ U }[/math] and [math]\displaystyle{ X }[/math] are uncorrelated.

Independence of [math]\displaystyle{ U }[/math] and [math]\displaystyle{ X }[/math] means that for all [math]\displaystyle{ a }[/math] and [math]\displaystyle{ b }[/math], [math]\displaystyle{ \Pr(U=a\mid X=b) = \Pr(U=a) }[/math]. This is not true, in particular, for [math]\displaystyle{ a=1 }[/math] and [math]\displaystyle{ b=0 }[/math].

  • [math]\displaystyle{ \Pr(U=1\mid X=0) = \Pr(XY=1\mid X=0) = 0 }[/math]
  • [math]\displaystyle{ \Pr(U=1) = \Pr(XY=1) = 1/4 }[/math]

Thus [math]\displaystyle{ \Pr(U=1\mid X=0)\ne \Pr(U=1) }[/math] so [math]\displaystyle{ U }[/math] and [math]\displaystyle{ X }[/math] are not independent.

Q.E.D.

Example 2

If [math]\displaystyle{ X }[/math] is a continuous random variable uniformly distributed on [math]\displaystyle{ [-1,1] }[/math] and [math]\displaystyle{ Y = X^2 }[/math], then [math]\displaystyle{ X }[/math] and [math]\displaystyle{ Y }[/math] are uncorrelated even though [math]\displaystyle{ X }[/math] determines [math]\displaystyle{ Y }[/math] and a particular value of [math]\displaystyle{ Y }[/math] can be produced by only one or two values of [math]\displaystyle{ X }[/math] :

[math]\displaystyle{ f_X(t)= {1 \over 2} I_{[-1,1]} ; f_Y(t)= {1 \over {2 \sqrt{t}}} I_{]0,1]} }[/math]

on the other hand, [math]\displaystyle{ f_{X,Y} }[/math] is 0 on the triangle defined by [math]\displaystyle{ 0\lt X\lt Y\lt 1 }[/math] although [math]\displaystyle{ f_X \times f_Y }[/math] is not null on this domain. Therefore [math]\displaystyle{ f_{X,Y} (X,Y) \neq f_X (X) \times f_Y (Y) }[/math] and the variables are not independent.

[math]\displaystyle{ E[X] = {{1-1} \over 4} = 0 ; E[Y]= {{1^3 - (-1)^3}\over {3 \times 2} } = {1 \over 3} }[/math]

[math]\displaystyle{ Cov[X,Y]=E \left [(X-E[X])(Y-E[Y]) \right ] = E \left [X^3- {X \over 3} \right ] = {{1^4-(-1)^4}\over{4 \times 2}}=0 }[/math]

Therefore the variables are uncorrelated.

When uncorrelatedness implies independence

There are cases in which uncorrelatedness does imply independence. One of these cases is the one in which both random variables are two-valued (so each can be linearly transformed to have a Bernoulli distribution).[3] Further, two jointly normally distributed random variables are independent if they are uncorrelated,[4] although this does not hold for variables whose marginal distributions are normal and uncorrelated but whose joint distribution is not joint normal (see Normally distributed and uncorrelated does not imply independent).

Generalizations

Uncorrelated random vectors

Two random vectors [math]\displaystyle{ \mathbf{X}=(X_1,\ldots,X_m)^T }[/math] and [math]\displaystyle{ \mathbf{Y}=(Y_1,\ldots,Y_n)^T }[/math] are called uncorrelated if

[math]\displaystyle{ \operatorname{E}[\mathbf{X} \mathbf{Y}^T] = \operatorname{E}[\mathbf{X}]\operatorname{E}[\mathbf{Y}]^T }[/math].

They are uncorrelated if and only if their cross-covariance matrix [math]\displaystyle{ \operatorname{K}_{\mathbf{X}\mathbf{Y}} }[/math] is zero.[5]:p.337

Two complex random vectors [math]\displaystyle{ \mathbf{Z} }[/math] and [math]\displaystyle{ \mathbf{W} }[/math] are called uncorrelated if their cross-covariance matrix and their pseudo-cross-covariance matrix is zero, i.e. if

[math]\displaystyle{ \operatorname{K}_{\mathbf{Z}\mathbf{W}}=\operatorname{J}_{\mathbf{Z}\mathbf{W}}=0 }[/math]

where

[math]\displaystyle{ \operatorname{K}_{\mathbf{Z}\mathbf{W}} =\operatorname{E}[(\mathbf{Z}-\operatorname{E}[\mathbf{Z}]){(\mathbf{W}-\operatorname{E}[\mathbf{W}])}^{\mathrm H}] }[/math]

and

[math]\displaystyle{ \operatorname{J}_{\mathbf{Z}\mathbf{W}} =\operatorname{E}[(\mathbf{Z}-\operatorname{E}[\mathbf{Z}]){(\mathbf{W}-\operatorname{E}[\mathbf{W}])}^{\mathrm T}] }[/math].

Uncorrelated stochastic processes

Two stochastic processes [math]\displaystyle{ \left\{X_t\right\} }[/math] and [math]\displaystyle{ \left\{Y_t\right\} }[/math] are called uncorrelated if their cross-covariance [math]\displaystyle{ \operatorname{K}_{\mathbf{X}\mathbf{Y}}(t_1,t_2) = \operatorname{E} \left[ \left( X(t_1)- \mu_X(t_1) \right) \left( Y(t_2)- \mu_Y(t_2) \right) \right] }[/math] is zero for all times.[2]:p. 142 Formally:

[math]\displaystyle{ \left\{X_t\right\},\left\{Y_t\right\} \text{ uncorrelated} \quad :\iff \quad \forall t_1,t_2 \colon \operatorname{K}_{\mathbf{X}\mathbf{Y}}(t_1,t_2) = 0 }[/math].

See also

References

  1. 1.0 1.1 Papoulis, Athanasios (1991). Probability, Random Variables and Stochastic Processes. MCGraw Hill. ISBN 0-07-048477-5. 
  2. 2.0 2.1 Kun Il Park, Fundamentals of Probability and Stochastic Processes with Applications to Communications, Springer, 2018, 978-3-319-68074-3
  3. Virtual Laboratories in Probability and Statistics: Covariance and Correlation, item 17.
  4. Bain, Lee; Engelhardt, Max (1992). "Chapter 5.5 Conditional Expectation". Introduction to Probability and Mathematical Statistics (2nd ed.). pp. 185–186. ISBN 0534929303. 
  5. Gubner, John A. (2006). Probability and Random Processes for Electrical and Computer Engineers. Cambridge University Press. ISBN 978-0-521-86470-1. 

Further reading

  • Probability for Statisticians, Galen R. Shorack, Springer (c2000) ISBN:0-387-98953-6