100
Views
0
CrossRef citations to date
0
Altmetric
Research Article

On the sparsity of synthetic control method

ORCID Icon &
Article: 2361184 | Received 26 Jul 2023, Accepted 23 May 2024, Published online: 03 Jun 2024
 

ABSTRACT

Synthetic Control Method (SCM) is a popular approach for causal inference in panel data, where the optimal weights for control units are often sparse. But the sparsity of SCM has received little attention in the literature except Abadie (2021), which explores the sparsity from the perspective of predictor space. In this paper, we make three contributions. First, we show that if there is a unique solution, then the number of positive weights is upper-bounded by the number of covariates. Second, we offer a simple alternative explanation about the sparsity of SCM from the perspective of parameter space. Third, we conduct a meta-analysis of empirical studies using SCM in the literature, which shows that the sparsity of SCM decreases with the relative number of covariates. A practical implication is that if the number of positive weights exceeds the number of covariates, there are multiple solutions and possibly unstable weights.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 In this paper, we use the terms “covariate” and “predictor” interchangeably.

2 One may regress among x2,,xN+1 and drop the variable with the highest R2, and so on.

3 For illustration purpose, we do not use the options “nested allopt” for more accurate numerical computation, otherwise the algorithm would fail when there is only one covariate (K = 1).

4 Note that Abadie et al. (Citation2010) never estimates the one-covariate case.

5 Among these 30 studies with all positive weights, the number of covariates ranges from 3 to 14 with a mean of 9, and the number of control units ranges from 4 to 54 with a mean of 17.43.

6 See Appendix 2 for a complete list of these papers used in the meta-analysis.

7 Note that all 61 papers with multiple SCM studies used the same donor pools throughout, and 49 of them used the same covariates as well, but 12 of them used different covariates due to missing values.

8 The paper fixed effects term ui captures unobserved heterogeneity across papers, which is analogous to the individual fixed effects in the panel data setting. Since ui may be correlated with the regressors, its omission could result in inconsistent estimation. Moreover, since some paper dummies are significant in the regressions (unreported), the paper fixed effects specification is preferred over the pooled OLS.

9 The opposite specific-to-general approach starts from the smallest model possible (i.e., a model with only the constant term), and proceeds by adding the most significant regressor at each step. However, the specific-to-general approach may suffer from omitted variable bias when the model is too small.

Additional information

Notes on contributors

Qiang Chen

Qiang Chen is a professor at the School of Economics, Shandong University.

Wenjun Li

Wenjun Li is a PhD student at the School of Economics, Shandong University.