Dynamic unobserved effects model

From HandWiki
Revision as of 13:49, 18 August 2023 by WikiEditor (talk | contribs) (correction)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Short description: Statistical model used in econometrics


A dynamic unobserved effects model is a statistical model used in econometrics for panel analysis. It is characterized by the influence of previous values of the dependent variable on its present value, and by the presence of unobservable explanatory variables.

The term “dynamic” here means the dependence of the dependent variable on its past history; this is usually used to model the “state dependence” in economics. For instance, for a person who cannot find a job this year, it will be harder to find a job next year because her present lack of a job will be a negative signal for the potential employers. “Unobserved effects” means that one or some of the explanatory variables are unobservable: for example, consumption choice of one flavor of ice cream over another is a function of personal preference, but preference is unobservable.

Continuous dependent variable

Censored dependent variable

In a panel data tobit model,[1][2] if the outcome [math]\displaystyle{ Y_{i,t} }[/math] partially depends on the previous outcome history [math]\displaystyle{ Y_{i,0},\ldots,Y_{t-1} }[/math] this tobit model is called "dynamic". For instance, taking a person who finds a job with a high salary this year, it will be easier for her to find a job with a high salary next year because the fact that she has a high-wage job this year will be a very positive signal for the potential employers. The essence of this type of dynamic effect is the state dependence of the outcome. The "unobservable effects" here refers to the factor which partially determines the outcome of individual but cannot be observed in the data. For instance, the ability of a person is very important in job-hunting, but it is not observable for researchers. A typical dynamic unobserved effects tobit model can be represented as

[math]\displaystyle{ Y_{i,t}=Y_{i,t}^1[Y_{i,t}\gt 0]; }[/math]
[math]\displaystyle{ Y_{i,t}=z_{i,t}\delta+\rho y_{i,t-1}+c_{i}+u_{i,t}; }[/math]
[math]\displaystyle{ c_i\mid y_{i,0},\ldots, y_{i,t-1} \sim F (y_{i,0}x_i); }[/math]
[math]\displaystyle{ u_{i,t}\mid z_{i,t},y_{i,0},\ldots,y_{i,t-1}\sim N(0,1). }[/math]

In this specific model, [math]\displaystyle{ \rho y_{i,t-1} }[/math] is the dynamic effect part and [math]\displaystyle{ c_{i} }[/math] is the unobserved effect part whose distribution is determined by the initial outcome of individual i and some exogenous features of individual i.

Based on this setup, the likelihood function conditional on [math]\displaystyle{ \{y_{i,0}\}^N_{i-1} }[/math] can be given as

[math]\displaystyle{ \prod_{i=1}^N \int f_\theta(c_i \mid y_{i,0},x_i) \left[ \prod_{t=1}^T\Bigl(1[y_{i,t}=0][1-\Phi(z_{i,t}\delta+\rho y_{i,t-1}\gt 0] \frac{\varphi(z_{i,t}\delta+\rho y_{i,t-1}+c_i)}{\Phi(z_{i,t} \delta+\rho y_{i,t-1}+c_i)}\biggr) \right] \, dc_i }[/math]

For the initial values [math]\displaystyle{ \{y_{i,0}\}^N_{i-1} }[/math] ,there are two different ways to treat them in the construction of the likelihood function: treating them as constant, or imposing a distribution on them and calculate out the unconditional likelihood function. But whichever way is chosen to treat the initial values in the likelihood function, we cannot get rid of the integration inside the likelihood function when estimating the model by maximum likelihood estimation (MLE). Expectation Maximum (EM) algorithm is usually a good solution for this computation issue.[3] Based on the consistent point estimates from MLE, Average Partial Effect (APE)[4] can be calculated correspondingly.[5]

Binary dependent variable

Formulation

A typical dynamic unobserved effects model with a binary dependent variable is represented[6] as:

[math]\displaystyle{ P(y_{it} = 1 \mid y_{i,t-1}, \dots , y_{i,0}, z_i, c_i) = G (z_{it} \delta + \rho y_{i,t-1} + c_i) }[/math]

where ci is an unobservable explanatory variable, zit are explanatory variables which are exogenous conditional on the ci, and G(∙) is a cumulative distribution function.

Estimates of parameters

In this type of model, economists have a special interest in ρ, which is used to characterize the state dependence. For example, yi,t can be a woman's choice whether to work or not, zit includes the i-th individual's age, education level, number of children, and other factors. ci can be some individual specific characteristic which cannot be observed by economists.[7] It is a reasonable conjecture that one's labor choice in period t should depend on his or her choice in period t − 1 due to habit formation or other reasons. This dependence is characterized by parameter ρ.

There are several MLE-based approaches to estimate δ and ρ consistently. The simplest way is to treat yi,0 as non-stochastic and assume ci is independent with zi. Then by integrating P(yi,t , yi,t-1 , … , yi,1 | yi,0 , zi , ci) against the density of ci, we can obtain the conditional density P(yi,t , yi,t-1 , ... , yi,1 |yi,0 , zi). The objective function for the conditional MLE can be represented as: [math]\displaystyle{ \sum_{i=1}^N }[/math] log (P (yi,t , yi,t-1, … , yi,1 | yi,0 , zi)).

Treating yi,0 as non-stochastic implicitly assumes the independence of yi,0 on zi. But in most cases in reality, yi,0 depends on ci and ci also depends on zi. An improvement on the approach above is to assume a density of yi,0 conditional on (ci, zi) and conditional likelihood P(yi,t , yi,t-1 , … , yt,1,yi,0 | ci, zi) can be obtained. By integrating this likelihood against the density of ci conditional on zi, we can obtain the conditional density P(yi,t , yi,t-1 , … , yi,1 , yi,0 | zi). The objective function for the conditional MLE[8] is [math]\displaystyle{ \sum_{i=1}^N }[/math] log (P (yi,t , yi,t-1, … , yi,1 | yi,0 , zi)).

Based on the estimates for (δ, ρ) and the corresponding variance, values of the coefficients can be tested[9] and the average partial effect can be calculated.[10]

References

  1. Greene, W. H. (2003). Econometric Analysis. Upper Saddle River, NJ: Prentice Hall. 
  2. The model framework comes from Wooldridge, J. (2002). Econometric Analysis of Cross Section and Panel Data. Cambridge, Mass: MIT Press. p. 542. ISBN 9780262232197. https://archive.org/details/econometricanaly0000wool.  But the author revises the model more general here.
  3. For more details, refer to: Cappé, O.; Moulines, E.; Ryden, T. (2005). "Part II: Parameter Inference". Inference in Hidden Markov Models. New York: Springer-Verlag. ISBN 9780387289823. https://books.google.com/books?id=4d_oEYn8Fl0C&pg=PA347. 
  4. Wooldridge, J. (2002). Econometric Analysis of Cross Section and Panel Data. Cambridge, Mass: MIT Press. p. 22. ISBN 9780262232197. https://archive.org/details/econometricanaly0000wool. 
  5. For more details, refer to: Amemiya, Takeshi (1984). "Tobit models: A survey". Journal of Econometrics 24 (1–2): 3–61. doi:10.1016/0304-4076(84)90074-5. 
  6. Wooldridge, J. (2002): Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass, pp 300.
  7. James J. Heckman (1981): Studies in Labor Markets, University of Chicago Press, Chapter Heterogeneity and State Dependence
  8. Greene, W. H. (2003), Econometric Analysis , Prentice Hall , Upper Saddle River, NJ .
  9. Whitney K. Newey, Daniel McFadden, Chapter 36 Large sample estimation and hypothesis testing, In: Robert F. Engle and Daniel L. McFadden, Editor(s), Handbook of Econometrics, Elsevier, 1994, Volume 4, Pages 2111–2245, ISSN 1573-4412, ISBN:9780444887665,
  10. Chamberlain, G. (1980), “Analysis of Covariance with Qualitative Data,” Journal of Econometrics 18, 5–46