Discrete-time proportional hazards

From HandWiki

Hazard rate models are widely used to model duration data in a wide range of disciplines, from bio-statistics to economics.[1]

Grouped duration data are widespread in many applications. Unemployment durations are typically measured over weeks or months and these time intervals may be considered too large for continuous approximations to hold. In this case, we will typically have grouping points [math]\displaystyle{ t_{a} }[/math], where [math]\displaystyle{ a=1,...,A. }[/math]. Models allow for time-invariant and time-variant covariates, but the latter require stronger assumptions in terms of exogeneity.[2] The discrete-time hazard function can be written as:

[math]\displaystyle{ \lambda_{d}(t_{a}| \chi)= Pr(t_{a-1}\leqslant T \lt t_{a} | T \geqslant t_{a-1},x[t_{a-1}])= \frac{S(t_{a-1} | \chi ) - S(t_{a} | \chi) }{S (t_{a-1}| \chi) } }[/math]

where [math]\displaystyle{ S(t_{a} |\chi) }[/math] is the survivor function. It can be shown that this can be rewritten as:

[math]\displaystyle{ \lambda_{d}(t_{a} | \chi) = 1-exp \biggl(-\int \lambda (s) ds \biggr)= 1-exp \Bigl(-exp (ln\lambda_{0s}+x (t_{s-1})'\beta)\biggl) }[/math]

These probabilities provide the building blocks for setting up the Likelihood function, which ends up being:[3]

[math]\displaystyle{ L(\beta,\lambda) = \textstyle \prod[\prod exp(-exp(ln\lambda_{0s}+x_{i}(t_{s}-1)'\beta)\bigr)]\times \bigl(1-exp\bigl(-exp (ln\lambda0_{ai}+ x_{i}(t_{a-1})'\beta)\bigr)\Bigr) }[/math]

This maximum likelihood maximization depends on the specification of the baseline hazard functions. These specifications include fully parametric models, piece-wise-constant proportional hazard models, or partial likelihood approaches that estimate the baseline hazard as a nuisance function.[4] Alternatively, one can be more flexible for the baseline hazard [math]\displaystyle{ \lambda^d_{0}(t) }[/math] and impose more structure for [math]\displaystyle{ \lambda^d_{i}(t)=\lambda^d_{0}(t)exp(-x_{i}'\beta). }[/math] This approach performs well for certain measures and can approximate arbitrary hazard functions relatively well, while not imposing stringent computational requirements.[5] When the covariates are omitted from the analysis, the maximum likelihood boils down to the Kaplan-Meier estimator of the survivor function.[6]

Another way to model discrete duration data is to model transitions using binary choice models.[7]

References

  1. Jenkins, Stephen P. Estimation of discrete time (grouped duration data) proportional hazards models: pgmhaz (Report). ESRC Research Centre on Micro-Social Change, University of Essex. https://www.iser.essex.ac.uk/files/teaching/stephenj/ec968/pdfs/STB-39-pgmhaz.pdf. 
  2. Wooldridge, J. (2002): Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.
  3. Cameron A. C. and P. K. Trivedi (2005): Microeconometrics: Methods and Applications. Cambridge University Press, New York.
  4. Wooldridge, J. (2002): Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.
  5. Han, A. K., and J. A. Hausman (1990): Flexible Parametric Estimation of Duration and Competing Risk Models. Journal of Applied Econometrics, 5, pp. 1-28
  6. Lancaster, T. (1990): The Econometric Analysis of Transition Data. Cambridge University Press, Cambridge.
  7. Cameron A. C. and P. K. Trivedi (2005): Microeconometrics: Methods and Applications. Cambridge University Press, New York.