Mixed-data sampling

From HandWiki
Revision as of 21:58, 8 February 2024 by MainAI6 (talk | contribs) (fix)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Econometric models involving data sampled at different frequencies are of general interest. Mixed-data sampling (MIDAS) is an econometric regression developed by Eric Ghysels with several co-authors. There is now a substantial literature on MIDAS regressions and their applications, including Ghysels, Santa-Clara and Valkanov (2006),[1] Ghysels, Sinko and Valkanov,[2] Andreou, Ghysels and Kourtellos (2010)[3] and Andreou, Ghysels and Kourtellos (2013).[4]

MIDAS Regressions

A MIDAS regression is a direct forecasting tool which can relate future low-frequency data with current and lagged high-frequency indicators, and yield different forecasting models for each forecast horizon. It can flexibly deal with data sampled at different frequencies and provide a direct forecast of the low-frequency variable. It incorporates each individual high-frequency data in the regression, which solves the problems of losing potentially useful information and including mis-specification.

A simple regression example has the independent variable appearing at a higher frequency than the dependent variable:

[math]\displaystyle{ y_t = \beta_0 + \beta_1 B(L^{1/m};\theta)x_t^{(m)} + \varepsilon_t^{(m)}, }[/math]

where y is the dependent variable, x is the regressor, m denotes the frequency – for instance if y is yearly [math]\displaystyle{ x_t^{(4)} }[/math] is quarterly – [math]\displaystyle{ \varepsilon }[/math] is the disturbance and [math]\displaystyle{ B(L^{1/m};\theta) }[/math] is a lag distribution, for instance the Beta function or the Almon Lag. For example [math]\displaystyle{ B(L^{1/m};\theta) = \sum_{k=0}^K B(k; \theta) L^{k/m} }[/math].

The regression models can be viewed in some cases as substitutes for the Kalman filter when applied in the context of mixed frequency data. Bai, Ghysels and Wright (2013)[5] examine the relationship between MIDAS regressions and Kalman filter state space models applied to mixed frequency data. In general, the latter involves a system of equations, whereas, in contrast, MIDAS regressions involve a (reduced form) single equation. As a consequence, MIDAS regressions might be less efficient, but also less prone to specification errors. In cases where the MIDAS regression is only an approximation, the approximation errors tend to be small.

Machine Learning MIDAS Regressions

The MIDAS can also be used for machine learning time series and panel data nowcasting.[6][7] The machine learning MIDAS regressions involve Legendre polynomials. High-dimensional mixed frequency time series regressions involve certain data structures that once taken into account should improve the performance of unrestricted estimators in small samples. These structures are represented by groups covering lagged dependent variables and groups of lags for a single (high-frequency) covariate. To that end, the machine learning MIDAS approach exploits the sparse-group LASSO (sg-LASSO) regularization that accommodates conveniently such structures.[8] The attractive feature of the sg-LASSO estimator is that it allows us to combine effectively the approximately sparse and dense signals.

Software packages

Several software packages feature MIDAS regressions and related econometric methods. These include:

  • MIDAS Matlab Toolbox[9]
  • midasr, R package[10]
  • midasml, R package for High-Dimensional Mixed Frequency Time Series Data[11]
  • EViews[12]
  • Python[13]
  • Julia[14]
  • Stata,midasreg

References

  1. Ghysels, Eric, Pedro Santa-Clara and Rossen Valkanov (2006) Predicting Volatility: How to Get Most Out of Returns Data Sampled at Different Frequencies, Journal of Econometrics, 131, 59-95
  2. Ghysels, Eric and Arthur Sinko and Rossen Valkanov (2006) MIDAS Regressions: Further Results and New Directions, Econometric Reviews, 26, 53-90.
  3. Andreou, Elena & Eric Ghysels & Andros Kourtellos "Regression Models with Mixed Sampling Frequencies", Journal of Econometrics, 158, 246-261.
  4. Andreou, Elena & Eric Ghysels & Andros Kourtellos "Should macroeconomic forecasters use daily financial data and how?", Journal of Business and Economic Statistics 31, 240-251.
  5. Bai, Jennie and Eric Ghysels and Jonathan Wright (2013) State Space Models and MIDAS Regressions, Econometric Reviews, 32, 779–813.
  6. Babii, Andrii; Ghysels, Eric; Striaukas, Jonas (2022-07-03). "Machine Learning Time Series Regressions With an Application to Nowcasting" (in en). Journal of Business & Economic Statistics 40 (3): 1094–1106. doi:10.1080/07350015.2021.1899933. ISSN 0735-0015. https://www.tandfonline.com/doi/full/10.1080/07350015.2021.1899933. 
  7. Babii, Andrii; Ball, Ryan T.; Ghysels, Eric; Striaukas, Jonas (2022-07-26). "Machine learning panel data regressions with heavy-tailed dependent data: Theory and application". Journal of Econometrics: 105315. doi:10.1016/j.jeconom.2022.07.001. ISSN 0304-4076. https://www.sciencedirect.com/science/article/pii/S0304407622001282. 
  8. Simon, N., J. Friedman, T. Hastie, and R. Tibshirani (2013): A sparse-group LASSO, Journal of Computational and Graphical Statistics, 22(2), 231-245.
  9. "MIDAS Matlab Toolbox maintained by Hang Qian". https://www.mathworks.com/matlabcentral/fileexchange/45150-midas-matlab-toolbox. 
  10. "midasr: Mixed Data Sampling Regression maintained by Virmantas Kvedaras and Vaidotas Zemlys-Balevicius". 23 February 2021. https://cran.r-project.org/web/packages/midasr/index.html. 
  11. "midasml: Estimation and Prediction Methods for High-Dimensional Mixed Frequency Time Series Data maintained by Jonas Striaukas". 29 April 2022. https://cran.r-project.org/web/packages/midasml/index.html. 
  12. "EViews 9.5 MIDAS Forecasting Demonstration". https://www.eviews.com/EViews9/ev95midas.html. 
  13. "MIDAS Python code". https://github.com/sapphire921/midas_pro. 
  14. "MIDAS Julia". https://github.com/mikemull/Midas.jl. 

See also