1,229
Views
38
CrossRef citations to date
0
Altmetric
Original Articles

Markov-Switching Three-Pass Regression Filter

, &
Pages 285-302 | Received 01 Dec 2016, Published online: 16 Oct 2018
 

ABSTRACT

We introduce a new approach for the estimation of high-dimensional factor models with regime-switching factor loadings by extending the linear three-pass regression filter to settings where parameters can vary according to Markov processes. The new method, denoted as Markov-switching three-pass regression filter (MS-3PRF), is suitable for datasets with large cross-sectional dimensions, since estimation and inference are straightforward, as opposed to existing regime-switching factor models where computational complexity limits applicability to few variables. In a Monte Carlo experiment, we study the finite sample properties of the MS-3PRF and find that it performs favorably compared with alternative modeling approaches whenever there is structural instability in factor loadings. For empirical applications, we consider forecasting economic activity and bilateral exchange rates, finding that the MS-3PRF approach is competitive in both cases. Supplementary materials for this article are available online.

ACKNOWLEDGMENTS

The authors thank the editor Todd Clark, an anonymous associate editor, three anonymous referees, Dalibor Stevanovic, James MacKinnon, seminar participants at Queen’s University, Banco de España and the 2016 conference of the Canadian Econometric Study Group held at Western University for helpful comments on a previous version of this article. The authors also thank Agustín Díaz for excellent research assistance. The views expressed in this article are those of the authors. No responsibility for them should be attributed to the Banco de España, the Eurosystem, the OECD or its member countries.

Notes

1 Groen and Kapetanios (Citation2016) showed that partial least squares (and Bayesian methods) perform better than principal components when forecasting based on a large dataset with a weak factor structure. As partial least squares is obtained as a special case of the 3PRF (see Kelly and Pruitt Citation2015 for details), our method can also be adopted to introduce Markov switching in partial least-square regressions.

2 For example, extracting one factor from the MS-3PRF approach using a panel of 130 macroeconomic and financial variables with gross domestic product (GDP) growth as a target proxy takes about 350 sec using a laptop with a 2.7 GHz processor and 16 GB RAM.

3 In a Bayesian context, Guérin and Leiva-Leon (Citation2017) developed an algorithm to estimate a high-dimensional factor-augmented VAR model with regime-switching parameters in the factor loadings to study the interactions between monetary policy, the stock market, and the connectedness of industry-level stock returns. See also Von Ganske (Citation2016), who introduced regime-switching parameters in partial least-square regressions from a Bayesian perspective so as to forecast industry stock returns. Using a Bayesian framework, Hamilton and Owyang (Citation2012) developed a framework for modeling common Markov-switching components in panel datasets with large cross-sectional and time-series dimensions to estimate turning points in U.S. state-level employment data.

4 Kelly and Pruitt (Citation2015) specified Equation (Equation1) as yt + 1 = β0 + β1ft + ηt + 1, we use yt as dependent variable to have a comparable timing of the Markov chains with the other model equations. Also, the lag structure in Equation (Equation1) corresponds to the estimation step to calculate forecasts for y; for ease of exposition, we present the case of one-step forecasts, but the framework is straightforward to extend to the case of multi-step forecasts as we explain below.

5 Note that since proxies are used to compute the factors ft, we need as many proxy variables as factors to be extracted, that is, kf.

6 The variances of the errors in Equations (Equation1) to (Equation3) are also allowed to be time-varying, with the time variation in each variance driven by the same variable-specific Markov chain that also drives changes in the parameters of the conditional mean of the corresponding variable.

7 This assumption is always satisfied if there is no Markov switching in (Equation2). Otherwise, its validity is an empirical issue that can be assessed by comparing the estimated Markov chains in regressions of each xit either on the (estimated) factors or on the proxy variables zt.

8 As it will be shown in Section 3, Monte Carlo experiments suggest a better performance, in finite samples, when using ϕ^B,it.

9 In a recursive forecasting exercise over a period T + 1, …, T + H, the forecast origin T would be recursively updated, as well as all the estimates in steps 1 to 3.

10 Forecasts for the predicted probabilities h-period ahead for h ∈ {1, 2, …} are calculated recursively as P(SyT + h = jT) = ∑Mi = 1py, ijP(SyT + h − 1 = jT).

11 The use of direct estimation for multi-step forecasting in nonlinear models is rather common, see, for example, Stock and Watson (Citation1999), as it leads to major computational gains. Yet, the resulting direct forecasts are in general suboptimal with respect to the iterated ones, as the type of nonlinearity linking yt and th dated explanatory variables is in general different from that linking yt and t − 1 dated explanatory variables. In our context, the error term ηt in (11) can be serially correlated if the true model for yt is Equation (Equation1) and the factors are serially correlated, and this can affect the properties of the ML estimates of the states. An alternative forecasting procedure requires the introduction of a parametric model for the factors (e.g., a VAR), used to produce iterated forecasts for the f^T+h-1. Forecasts for yT + h can be then obtained as y^T+h|T=j=1M(P(SyT+h=j|ΩT)β^0(SyT+h=j)+P(SyT+h=j|ΩT)β^(SyT+h=j)f^T+h-1). We leave an evaluation of this alternative forecasting approach for future research.

12 Ching, Fung, and Ng (Citation2002) proposed a multivariate Markov chain approach for modeling multiple categorical data sequences.

13 The three-letter country codes follow the convention from the International Olympic Committee except for Taiwan, labeled as TAI.

14 Data for the euro before January 1999 and Taiwan were obtained from the U.S. Federal Reserve G.5 table (monthly average of daily data).

16 The Diebold and Mariano (Citation1995) test of equal out-of-sample predictive accuracy is reported to give a sense of statistical significance of the point forecasting results. However, this test is based on the population MSFE (not the actual MSFE), so this test tends to reject the null of equal MSFEs too often.

17 Admittedly, in the case of the euro, the forecasting performance of the MS-3PRF (first and third passes) and the MSS-3PRF (first and third passes) approaches deteriorates as the forecast horizon lengthens, suggesting that it is not always relevant to model regime shifts in the forecasting equation.

18 Data descriptions and details on data transformation are available online at https://research.stlouisfed.org/econ/mccracken/fred-databases/Appendix_Tables_Update.pdf. The slight modifications we made to the original dataset are reported in the appendix.

19 Our results are qualitatively robust to the use of a different number of factors in the predictive equation. Detailed results are available on request.

20 When estimating the number of factors using information criteria, it is common to find a large number of factors summarizing the comovements of U.S. macroeconomic variables (e.g., McCracken and Ng (Citation2016) estimated eight factors in the FRED-MD monthly macroeconomic database). However, in the forecasting exercise, in line with the literature, we use the first factor in the predictive equation. This corresponds to a real economic activity factor that closely follows the U.S. business cycle dynamics (see ). Using the first two factors in the predictive equation led to little changes in the forecasting performance.

21 The Bry and Boschan (Citation1971) algorithm is a nonparametric method to estimate cycles in time series. We implemented the quarterly version of the Bry and Boschan (Citation1971) algorithm from Harding and Pagan (Citation2002), using the GAUSS code available at http://www.ncer.edu.au/resources/data-and-code.php.

22 We also compare the regime probabilities obtained with GDP growth as a target proxy with those estimated with the MS-3PRF factor as a target proxy. The regime probabilities are similar in both cases, which provides empirical support for our modeling assumption that the Markov chains in Equations (Equation3) and (Equation5) are the same.

23 This result still holds when doing PCA at a monthly frequency and then aggregating the factor at a quarterly frequency.

24 As a side note, the first pass of the 3PRF filter could possibly accommodate mixed-frequency data using the techniques outlined in Foroni, Guérin, and Marcellino (Citation2018); whereas, in the third pass of the filter, unrestricted mixed data sampling (MIDAS) polynomials could be used as in Hepenstrick and Marcellino (Citation2016), and regime-switching parameters in the mixed-frequency predictive equation could be modeled as in Guérin and Marcellino (Citation2013).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.