White test

From HandWiki
Revision as of 16:15, 6 February 2024 by John Stpola (talk | contribs) (linkage)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Statistical test

In statistics, the White test is a statistical test that establishes whether the variance of the errors in a regression model is constant: that is for homoskedasticity.

This test, and an estimator for heteroscedasticity-consistent standard errors, were proposed by Halbert White in 1980.[1] These methods have become widely used, making this paper one of the most cited articles in economics.[2]

In cases where the White test statistic is statistically significant, heteroskedasticity may not necessarily be the cause; instead the problem could be a specification error. In other words, the White test can be a test of heteroskedasticity or specification error or both. If no cross product terms are introduced in the White test procedure, then this is a test of pure heteroskedasticity. If cross products are introduced in the model, then it is a test of both heteroskedasticity and specification bias.

Testing constant variance

To test for constant variance one undertakes an auxiliary regression analysis: this regresses the squared residuals from the original regression model onto a set of regressors that contain the original regressors along with their squares and cross-products.[3] One then inspects the R2. The Lagrange multiplier (LM) test statistic is the product of the R2 value and sample size:

[math]\displaystyle{ \text{LM} = n R^2 . }[/math]

This follows a chi-squared distribution, with degrees of freedom equal to P − 1, where P is the number of estimated parameters (in the auxiliary regression).

The logic of the test is as follows. First, the squared residuals from the original model serve as a proxy for the variance of the error term at each observation. (The error term is assumed to have a mean of zero, and the variance of a zero-mean random variable is just the expectation of its square.) The independent variables in the auxiliary regression account for the possibility that the error variance depends on the values of the original regressors in some way (linear or quadratic). If the error term in the original model is in fact homoskedastic (has a constant variance) then the coefficients in the auxiliary regression (besides the constant) should be statistically indistinguishable from zero and the R2 should be “small". Conversely, a “large" R2 (scaled by the sample size so that it follows the chi-squared distribution) counts against the hypothesis of homoskedasticity.

An alternative to the White test is the Breusch–Pagan test, where the Breusch-Pagan test is designed to detect only linear forms of heteroskedasticity. Under certain conditions and a modification of one of the tests, they can be found to be algebraically equivalent.[4]

If homoskedasticity is rejected one can use heteroskedasticity-consistent standard errors.

Software implementations

  • In R, White's Test can be implemented using the white function of the skedastic package.[5]
  • In Python, White's Test can be implemented using the het_white function of the statsmodels.stats.diagnostic.het_white [6]
  • In Stata, the test can be implemented using the estat imtest, white function.[7]

See also

References

  1. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity". Econometrica 48 (4): 817–838. 1980. doi:10.2307/1912934. 
  2. Kim, E.H.; Morse, A.; Zingales, L. (2006). "What Has Mattered to Economics since 1970". Journal of Economic Perspectives 20 (4): 189–202. doi:10.1257/jep.20.4.189. https://www.nber.org/papers/w12526.pdf. 
  3. Verbeek, Marno (2008). A Guide to Modern Econometrics (Third ed.). Wiley. pp. 99–100. ISBN 978-0-470-51769-7. https://archive.org/details/guidetomoderneco0003verb. 
  4. Waldman, Donald M. (1983). "A note on algebraic equivalence of White's test and a variation of the Godfrey/Breusch-Pagan test for heteroscedasticity". Economics Letters 13 (2–3): 197–200. doi:10.1016/0165-1765(83)90085-X. 
  5. "skedastic: Heteroskedasticity Diagnostics for Linear Regression Models". CRAN. https://cran.r-project.org/web/packages/skedastic/index.html. 
  6. "statsmodels v0.12.1". https://www.statsmodels.org/stable/generated/statsmodels.stats.diagnostic.het_white.html. 
  7. Stata. "regress postestimation — Postestimation tools for regress". https://www.stata.com/manuals13/rregresspostestimation.pdf. 

Further reading