G-prior

From HandWiki

In statistics, the g-prior is an objective prior for the regression coefficients of a multiple regression. It was introduced by Arnold Zellner.[1] It is a key tool in Bayes and empirical Bayes variable selection.[2][3]

Definition

Consider a data set (x1,y1),,(xn,yn), where the xi are Euclidean vectors and the yi are scalars. The multiple regression model is formulated as

yi=xiβ+εi.

where the εi are random errors. Zellner's g-prior for β is a multivariate normal distribution with covariance matrix proportional to the inverse Fisher information matrix for β, similar to a Jeffreys prior.

Assume the εi are i.i.d. normal with zero mean and variance ψ1. Let X be the matrix with ith row equal to xi. Then the g-prior for β is the multivariate normal distribution with prior mean a hyperparameter β0 and covariance matrix proportional to ψ1(XX)1, i.e.,

β|ψN[β0,gψ1(XX)1].

where g is a positive scalar parameter.

Posterior distribution of beta

The posterior distribution of β is given as

β|ψ,x,yN[qβ^+(1q)β0,qψ(XX)1].

where q=g/(1+g) and

β^=(XX)1Xy.

is the maximum likelihood (least squares) estimator of β. The vector of regression coefficients β can be estimated by its posterior mean under the g-prior, i.e., as the weighted average of the maximum likelihood estimator and β0,

β~=qβ^+(1q)β0.

Clearly, as g →∞, the posterior mean converges to the maximum likelihood estimator.

Selection of g

Estimation of g is slightly less straightforward than estimation of β. A variety of methods have been proposed, including Bayes and empirical Bayes estimators.[3]

References

  1. Zellner, A. (1986). "On Assessing Prior Distributions and Bayesian Regression Analysis with g Prior Distributions". in Goel, P.; Zellner, A.. Bayesian Inference and Decision Techniques: Essays in Honor of Bruno de Finetti. Studies in Bayesian Econometrics and Statistics. 6. New York: Elsevier. pp. 233–243. ISBN 978-0-444-87712-3. 
  2. George, E.; Foster, D. P. (2000). "Calibration and empirical Bayes variable selection". Biometrika 87 (4): 731–747. doi:10.1093/biomet/87.4.731. 
  3. 3.0 3.1 Liang, F.; Paulo, R.; Molina, G.; Clyde, M. A.; Berger, J. O. (2008). "Mixtures of g priors for Bayesian variable selection". Journal of the American Statistical Association 103 (481): 410–423. doi:10.1198/016214507000001337. 

Further reading

  • Datta, Jyotishka; Ghosh, Jayanta K. (2015). "In Search of Optimal Objective Priors for Model Selection and Estimation". in Upadhyay, Satyanshu Kumar; Singh, Umesh; Dey, Dipak K. et al.. Current Trends in Bayesian Methodology with Applications. CRC Press. pp. 225–243. ISBN 978-1-4822-3511-1. 
  • Marin, Jean-Michel; Robert, Christian P. (2007). "Regression and Variable Selection". Bayesian Core : A Practical Approach to Computational Bayesian Statistics. New York: Springer. pp. 47–84. doi:10.1007/978-0-387-38983-7_3. ISBN 978-0-387-38979-0.