Time-dependent shrinkage of time-varying parameter regression models: Econometric Reviews: Vol 43 , No 1

Sample our Economics, Finance,Business & Industry journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
Read this article /doi/full/10.1080/07474938.2023.2237274?needAccess=true

Abstract

This article studies the time-varying parameter (TVP) regression model in which the regression coefficients are random walk latent states with time-dependent conditional variances. This TVP model is flexible to accommodate a wide variety of time variation patterns but requires effective shrinkage on the state variances to avoid over-fitting. A Bayesian shrinkage prior is proposed based on reparameterization that translates the variance shrinkage problem into a variable shrinkage one in a conditionally linear regression with fixed coefficients. The proposed prior allows strong shrinkage for the state variances while maintaining the flexibility to accommodate local signals. A Bayesian estimation method is developed that employs the ancilarity-sufficiency interweaving strategy to boost sampling efficiency. Simulation study and an empirical application to forecast inflation rate illustrate the benefits of the proposed approach.

KEYWORDS:

ASIS
Bayesian shrinkage
horseshoe
MCMC
TVP

JEL Classification:

Acknowledgments

I would like to thank Professor Esfandiar Maasoumi (the editor), an AE and two referees for many invaluable comments that have greatly improved the article. All remaining errors are my own. The views in this article are solely the author’s responsibility and are not related to the company the author works in. The author reports that there are no competing interests to declare.

Notes

1 Other recent examples of shrinkage TVP models with homoskedastic latent states include Cadonna et al. (Citation2020), Chan et al. (Citation2020) etc.

2 Another strand of the literature allows heteroskedastic latent states by applying time-dependent spike-and-slab mixture priors for state variances (e.g. Giordani and Kohn (Citation2008), Chan et al. (Citation2012), Hauzenberger (Citation2021), Rockova and McAlinn (Citation2021)) but faces the computational hurdle due to the combinatorial complexity of sampling the mixture indicators of the spike-and-slab priors.

3 See Hauzenberger et al. (Citation2020) for similar strategies for versions of TVP models where latent states follow independent Gaussian distributions rather than random walks.

4 To see this, let $β_{t}^{*}$ = β_t - β₀. The TVP model can be rewritten as y_t = $x_{t}^{'} β_{0}$ + $x_{t}^{'} β_{t}^{*}$ + ϵ_t, $β_{t}^{*}$ = $β_{t - 1}^{*}$ + η_t and $β_{0}^{*}$ = 0.

5 Alternative shrinkage priors for linear regressions include the spike-and-slab one (George and McCulloch (Citation1993), Ishwaran and Rao (Citation2005)) and the normal-gamma one (Griffin and Brown (Citation2010)) etc. A comprehensive comparison of the various shrinkage priors in the current TVP context is left for future research.

6 The density of an inverted beta distribution IB(a, b) is $p (x) = \frac{x^{a - 1} {(1 + x)}^{- a - b}}{B (a, b)} I {x > 0}$ where $B (\cdot, \cdot)$ is the beta function and a and b are positive real numbers. If $x \sim I B (0.5, 0.5)$ , then $\sqrt{x} \sim C^{+} (0, 1)$ and vice versa, where $C^{+} (0, 1)$ is a standard half-Cauchy distribution with the density $p (z) = \frac{2}{π (1 + z^{2})} I {z > 0}$ .

7 If $\pm \sqrt{x} \sim N (0, a)$ , then $x \sim G (0.5, 2 a)$ and vice versa, where the gamma distribution $G (α, β)$ has the density $\frac{1}{Γ (α) β^{α}} x^{α - 1} \exp (- \frac{x}{β})$ .

8 See Johndrow et al. (Citation2020) and Hauzenberger et al. (Citation2020) for methods that use approximations to the exact algorithm of Bhattacharya et al. (Citation2016).

9 Alternative approaches to simulate the latent states from a linear Gaussian state space system include Fruhwirth-Schnatter (Citation1994), Rue (Citation2001) and McCausland et al. (Citation2011) etc.

10 In experiments, another two data generating processes are studied where the ratio of $σ^{2}$ to the variance of the dependent variable is 0.5 and 0.8 respectively. The estimation results are qualitatively similar.

11 Generating 1,000 posterior draws from the GHS and DGHS prior specifications takes about 20 and 26 seconds respectively on a standard desktop computer with a 3.0 GHz Intel Core i5 CPU, running in MATLAB R2020b.

12 The ESS is computed by the initial monotone sequence method of Geyer (Citation1992) and is normalized by dividing by the number of posterior draws.

13 The data source is the FRED database of the U.S. federal reserve bank of St. Louis. The series names are CPILFESL, TB3MS and UNRATE for consumer price index, 3-month treasury bill rate and unemployment rate respectively. Quarterly average is computed as the average monthly values within each quarter.

14 Sampling from the GIG distribution is by adapting the Matlab function gigrnd written by Enes Makalic and Daniel Schimidt that implements an algorithm from Devroye (Citation2014).

15 A Gamma distribution $G (0.5, 2)$ is equivalent to a $χ^{2} (1)$ distribution.

16 Sampling from the polya-gamma distribution is by the Matlab function pgdraw written by Enes Makalic and Daniel Schimidt that implements an algorithm from Windle (Citation2013).

Cadonna, A., Frühwirth-Schnatter, S., Knaus, P. (2020). Triple the gamma-a unifying shrinkage prior for variance and variable selection in sparse state space and TVP models. Econometrics 8(2):20. doi:10.3390/econometrics8020020

Web of Science ®Google Scholar

Chan, J. C., Eisenstat, E., Strachan, R. W. (2020). Reducing the state space dimension in a large TVP-VAR. Journal of Econometrics 218(1):105–118. doi:10.1016/j.jeconom.2019.11.006

Web of Science ®Google Scholar

Giordani, P., Kohn, R. (2008). Efficient bayesian inference for multiple change-point and mixture innovation models. Journal of Business & Economic Statistics 26(1):66–77. doi:10.1198/073500107000000241

Web of Science ®Google Scholar

Chan, J. C., Koop, G., Leon-Gonzalez, R., Strachan, R. W. (2012). Time varying dimension models. Journal of Business & Economic Statistics 30(3):358–367. doi:10.1080/07350015.2012.663258

Web of Science ®Google Scholar

Hauzenberger, N. (2021). Flexible mixture priors for large time-varying parameter models. Econometrics and Statistics 20:87–108. doi:10.1016/j.ecosta.2021.06.001

Web of Science ®Google Scholar

Rockova, V., Mcalinn, K. (2021). Dynamic variable selection with spike-AND-slab process priors. Bayesian Analysis 16(1):233–269. 10.1214/20-BA1199

Web of Science ®Google Scholar

Hauzenberger, N., Huber, F., Koop, G. (2020). Dynamic shrinkage priors for large time-varying parameter regressions using scalable Markov chain Monte Carlo methods. econ.EM. arXiv:2005.03906v1.

Google Scholar

George, E. I., McCulloch, R. E. (1993). Variable selection via gibbs sampling. Journal of the American Statistical Association 88(423):881–889. doi:10.1080/01621459.1993.10476353

Web of Science ®Google Scholar

Ishwaran, H., Rao, J. (2005). Spike and slab variable selection: Frequentist and bayesian strategies. Annals of Statistics 33:730–773.

Web of Science ®Google Scholar

Griffin, J., Brown, P. (2010). Inference with normal-gamma prior distributions in regression problems. Bayesian Analysis 5(1):171–188.

Web of Science ®Google Scholar

Johndrow, J., Orenstein, P., Bhattacharya, A. (2020). Scalable approximate MCMC algorithms for the horseshoe prior. Journal of Machine Learning Research 21(73):1–61.

PubMedGoogle Scholar

Hauzenberger, N., Huber, F., Koop, G. (2020). Dynamic shrinkage priors for large time-varying parameter regressions using scalable Markov chain Monte Carlo methods. econ.EM. arXiv:2005.03906v1.

Google Scholar

Bhattacharya, A., Chakraborty, A., Mallick, B. K. (2016). Fast sampling with gaussian scale-mixture priors in high-dimensional regression. Biometrika 103(4):985–991. doi:10.1093/biomet/asw042 28435166

PubMed Web of Science ®Google Scholar

Fruhwirth-Schnatter, S. (1994). Data augmentation and dynamic linear models. Journal of Time Series Analysis 15: 183–202. 10.1111/j.1467-9892.1994.tb00184.x

Google Scholar

Rue, H. (2001). Fast sampling of gaussian markov random fields. Journal of the Royal Statistical Society Series B: Statistical Methodology 63(2):325–338. doi:10.1111/1467-9868.00288

Google Scholar

McCausland, W. J., Miller, S., Pelletier, D. (2011). Simulation smoothing for state–space models: A computational efficiency analysis. Computational Statistics & Data Analysis 55(1):199–212. doi:10.1016/j.csda.2010.07.009

Web of Science ®Google Scholar

Geyer, C. (1992). Practical Markov chain Monte Carlo. Statistical Science 7:473–483. 10.1214/ss/1177011137

Google Scholar

Devroye, L. (2014). Random variate generation for the generalized inverse gaussian distribution. Statistics and Computing 24(2):239–246. doi:10.1007/s11222-012-9367-z

Web of Science ®Google Scholar

Windle, J. (2013). Forecasting high-dimensional, time-varying variance-covariance matrices with high-frequency data and sampling polya-gamma random variates for posterior distributions derived from logistic likelihoods, PhD Thesis. Austin, TX: University of Texas at Austin.

Google Scholar

Log in via your institution

Access through your institution

Log in to Taylor & Francis Online

Shibboleth

Log in to Taylor & Francis Online

Username Password

Forgot password?

Keep me logged in (not suitable for shared devices).

You will otherwise be logged out automatically, after a limited period, and will need to log in again.

Restore content access

Restore content access for purchases made as guest

Purchase options * Save for later Item saved, go to cart

PDF download + Online access

48 hours access to article PDF & online version
Article PDF can be downloaded
Article PDF can be printed

USD 61.00 Add to cart

PDF download + Online access - Online Checkout

Issue Purchase

30 days online access to complete issue
Article PDFs can be downloaded
Article PDFs can be printed

USD 578.00 Add to cart

Issue Purchase - Online Checkout

* Local tax will be added as applicable

Share icon
Back to Top

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

Time-dependent shrinkage of time-varying parameter regression models

Log in via your institution

Log in to Taylor & Francis Online

Restore content access

Related Research

Information for

Open access

Opportunities

Help and information

Time-dependent shrinkage of time-varying parameter regression models

Abstract

Acknowledgments

Notes

Log in via your institution

Log in to Taylor & Francis Online

Log in to Taylor & Francis Online

Restore content access

Related Research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature