299
Views
1
CrossRef citations to date
0
Altmetric
Research Article

Estimation of high-dimensional seemingly unrelated regression models

, &
 

Abstract

In this article, we investigate seemingly unrelated regression (SUR) models that allow the number of equations (N) to be large and comparable to the number of the observations in each equation (T). It is well known that conventional SUR estimators, for example, the feasible generalized least squares estimator from Zellner (Citation1962) does not perform well in a high-dimensional setting. We propose a new feasible GLS estimator called the feasible graphical lasso (FGLasso) estimator. For a feasible implementation of the GLS estimator, we use the graphical lasso estimation of the precision matrix (the inverse of the covariance matrix of the equation system errors) assuming that the underlying unknown precision matrix is sparse. We show that under certain conditions, FGLasso converges uniformly to GLS even when T < N, and it shares the same asymptotic distribution with the efficient GLS estimator when T>NlogN. We confirm these results through finite sample Monte-Carlo simulations.

JEL Classification:

Notes

1 The core firms are linked to each other and to the periphery firms, while the periphery firms are not linked to each other directly but through the core firms.

2 Note smin(Ω)=1smax(Σ)1|Σ|.

3 More precisely, in each replication, we divide the T samples into five folds and use four of them as the training data set and one as the validation set. With each choice of λ, we estimate the β̂FGLasso estimators using the training data, then plug them into the validation set and calculate the mean squared error. We choose λn that minimizes the averaged MSE.

4 For four-nearest neighbor lattices design, N can only be square number, so we choose .N={49,100,196,289,400}.

5 Here, the l norm is the element-wise maximum norm ·, and the RMSE is defined as ·F/N.

6 In practice, the time for deriving estimator also depends on how the penalty parameter is chosen. For example, in five-fold cross validation, it requires 26 times of calculation before deriving the final estimator.

7 For simplicity, we omit the conditional sign in P(|X) throughout the proof of Lemma 3.1.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.