Full article: A method to evaluate the rank condition for CCE estimators

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract.

We develop a binary classifier to evaluate whether the rank condition (RC) is satisfied or not for the Common Correlated Effects (CCE) estimator. The RC postulates that the number of unobserved factors, m, is not larger than the rank of the unobserved matrix of average factor loadings, ϱ. When this condition fails, the CCE estimator is inconsistent, in general. Despite its importance, to date this rank condition could not be verified. The difficulty lies in the fact that factor loadings are unobserved, such that ϱ cannot be directly determined. The key insight in this article is that ϱ can be consistently estimated with existing techniques through the matrix of cross-sectional averages of the data. Similarly, m can be estimated consistently from the data using existing methods. Thus, a binary classifier, constructed by comparing estimates of m and ϱ, correctly determines whether the RC is satisfied or not as $(N, T) \to \infty$ . We illustrate the practical relevance of testing the RC by studying the effect of the Dodd-Frank Act on bank profitability. The RC classifier reveals that the rank condition fails for a subperiod of the sample, in which case the estimated effect of bank size on profitability appears to be biased upwards.

Keywords:

1. Introduction

In a seminal paper, Pesaran (Citation2006) put forward the Common Correlated Effects (CCE) approach for $\sqrt{N T}$ -consistent estimation of panel data models with a multifactor error structure. The method aims to control for the unobserved common factors by augmenting the regression model with cross-sectional averages (CSA) of the observables. The CCE estimator has been applied to a large range of fieldsFootnote¹, and it has also been extended to several theoretical settings.Footnote² Such popularity of CCE can be attributed to the computational simplicity as well as the excellent finite-sample performance of the estimator in stylised setups.

Nevertheless, CCE comes at a cost. In particular, the CSA of the observables are valid proxies for the unobserved factors only if the number of factors, m, does not exceed the rank of the matrix of averaged factor loadings, ϱ. This so-called “rank condition” (RC) implies that there exist at least as many observables holding linearly independent information about the unobserved factors as there are factors. Westerlund and Urbain (Citation2013) demonstrate that the CCE estimator is inconsistent when the RC fails and the factor loadings are correlated with the regressors. Furthermore, Karabiyik et al. (Citation2019) and Juodis et al. (Citation2021) show that even when the factor loadings are uncorrelated with the regressors, failure of the RC leads to a lower rate of consistency for the CCE estimator.

Despite the importance of the RC for the asymptotic properties of the CCE estimator, practitioners typically take this assumption for granted. The main reason is that the matrix of average factor loadings is unobserved and therefore its rank cannot be directly evaluated or estimated.

This article puts forward a binary classifier that evaluates the rank condition. The key insight is that the rank of the unobserved matrix of average factor loadings, ϱ, can be established from the matrix of CSA of the data. We shall show that ϱ can be estimated consistently using existing procedures developed for determining the true rank of an unknown matrix; see e.g., Camba-Mendez and Kapetanios (Citation2009) and Al-Sadoon (Citation2017) for an overview of this literature. Similarly, the number of factors, m, can be estimated from the data in a straightforward manner using existing methods, such as those developed by Onatski (Citation2010), Ahn and Horenstein (Citation2013), and Kapetanios (Citation2010), among many others. Comparing consistent estimates of m and ϱ, $\hat{m}$ and $\hat{ϱ}$ , respectively, the rank condition is deemed to be satisfied when the classifier $\hat{R C} \equiv 1 - \mathbbm 1 {\hat{ϱ} < \hat{m}} = 1$ , where 𝟙{⋅} is an indicator function that returns 1 when the argument inside the curly brackets holds true and 0 otherwise. $\hat{R C}$ is shown to be consistent, i.e., it correctly determines whether the rank condition is satisfied or not, with probability 1 as $(N, T) \to \infty$ .

When the RC is violated for the standard CCE approach, one can augment the model with additional CSA that contain new information about the factors. Several potential augmentations have been suggested (see e.g., Chudik and Pesaran, Citation2015; Juodis, Citation2022; Karabiyik et al., Citation2019). However, it is not always clear which set of additional CSA to choose, and whether the selected augmentation is sufficient to restore the RC.Footnote³ To address these issues, we put forward a strategy that combines the classifier proposed in this article and the IC criterion of Karabiyik et al. (Citation2019). The resulting procedure enables consistent CCE estimation of panel data models with a multifactor error structure, even in cases where the rank condition fails for the original CCE estimator.

We illustrate the practical relevance of our RC classifier and augmentation strategy by studying the effect of the Dodd-Frank Act of 2010 on bank profitability. In particular, based on a random sample of 450 banks, we analyze bank profitability conditional on several potential drivers, controlling for macro-risk factors and common shocks. To examine the impact of the Dodd-Frank Act, we estimate the model separately over two subperiods, namely 2006:Q1-2010:Q4 and 2011:Q1-2019:Q4. The RC classifier reveals that the rank condition fails for the first subperiod. By augmenting the standard set of CSA using external variables, our procedure is able to restore the rank condition. This proves to be important because the estimated effect of bank size on profitability is significantly lower when the RC is restored.

In what follows, we will use 𝐀^† to denote the Moore-Penrose pseudo-inverse of the matrix 𝐀, rk(𝐀) for its rank, $| A |$ for the determinant and $‖ A ‖ = {[t r (A A^{'})]}^{1 / 2}$ for its Euclidean (Frobenius) matrix norm. For an n×n matrix 𝐀, the $λ_{1} (A) \geq λ_{2} (A) \geq \dots \geq λ_{n} (A)$ denote its n ordered eigenvalues. A vec(.) denotes the vectorization operation. Finally, ⌊a⌋ (⌈a⌉) is the floor (ceiling) function, which yields the largest (smallest) integer less than (greater than) or equal to a.

2. A multifactor panel data model and CCE

2.1. Model and assumptions

We study the following linear regression model with unobserved common factors (1) $\begin{array}{l} y_{i} & = X_{i} β + F λ_{i} + ε_{i}, \end{array}$ (1) where $y_{i} = [y_{i 1}, \dots, y_{i T}]^{'}$ denotes a T×1 vector of observations on the dependent variable for individual i, $X_{i} = [x_{i 1}, \dots, x_{i T}]^{'}$ denotes a T×K matrix of covariates, where 𝐱_it is K×1, and 𝜷 is a K×1 vector of unknown parameters of interest with $‖ β ‖ < \infty$ . The error term is composite, such that $F = [f_{1}, \dots, f_{T}]^{'}$ denotes a T×m matrix of unobserved common factors, where 𝐟_t is m×1, and 𝝀_i denotes an m×1 vector of factor loadings. The dimension m is fixed and finite. Finally, $ε_{i} = [ε_{i 1}, \dots, ε_{i T}]^{'}$ is a T×1 vector of purely idiosyncratic disturbances.

Following Pesaran (Citation2006), we assume that the covariates are also subject to a common factor structure, such that the data generating process (DGP) for 𝐗_i is given by (2) $\begin{array}{l} X_{i} & = F Γ_{i} + V_{i}, \end{array}$ (2) where 𝚪_i denotes an m×K matrix of factor loadings, and $V_{i} = [v_{i 1}, \dots, v_{i T}]^{'}$ is a T×K matrix of idiosyncratic errors.

Replacing 𝐗_i in Eq. (Equation1(1) $\begin{array}{l} y_{i} & = X_{i} β + F λ_{i} + ε_{i}, \end{array}$ (1) ) by the expression in Eq. (Equation2(2) $\begin{array}{l} X_{i} & = F Γ_{i} + V_{i}, \end{array}$ (2) ), and stacking the observables into a $T \times (1 + K)$ matrix $Z_{i} = [y_{i}, X_{i}] \equiv [z_{i 1}, . . ., z_{i T}]^{'}$ , yields (3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) where $C_{i} = [δ_{i}, Γ_{i}]$ is of order $m \times (1 + K)$ with $δ_{i} = λ_{i} + Γ_{i} β$ , and $U_{i} = [ε_{i} + V_{i} β, V_{i}]$ . In what follows, it is important to note that 𝐂_i can be written as $C_{i} = {\tilde{C}}_{i} B$ , with (4) $\begin{array}{l} {\tilde{C}}_{i} = [λ_{i}, Γ_{i}]; B = [\begin{array}{c} 1 & 0_{1 \times K} \\ β & I_{K} \end{array}] . \end{array}$ (4)

Therefore, since 𝐁 has full rank, the rank of 𝐂_i is solely determined by the matrix of factor loadings ${\tilde{C}}_{i}$ .

The following assumptions are made throughout the article:

Assumption 1.

(Idiosyncratic errors) ε_it and v_it are mean zero, covariance-stationary, and independent across i, with $E (ε_{i t}^{4}) < \infty$ and $E ({‖ v_{i t} ‖}^{4}) < \infty$ for all i and t. Let $U = [U_{1}, \dots, U_{N}]$ be such that $λ_{1} (U U^{'} / M) = O_{p} (1)$ and $λ_{⌊ d^{c} \tilde{M} ⌋} (U U^{'} / M) \geq c + o_{p} (1)$ for some real c > 0 and $d^{c} \in [0, 1)$ , where $M = max {N, T}$ and $\tilde{M} = min {N, T}$ .

Assumption 2.

(Common factors) f_t is covariance-stationary with $E ({‖ f_{t} ‖}^{4}) < \infty$ and absolute summable autocovariances. In addition, rk(F) = m and $T^{- 1} F^{'} F \to Σ_{F}$ as T→∞, where Σ_F is positive definite.

Assumption 3.

(Factor loadings) ${\tilde{C}}_{i}$ is generated according to (5) $\begin{array}{l} {\tilde{C}}_{i} = \tilde{C} + Ξ_{i}; ξ_{i} \sim i . i . d . (0_{m (K + 1)}, Ω_{ξ}), \end{array}$ (5)

where $\tilde{C} = E ({\tilde{C}}_{i}) \equiv [λ, Γ]$ such that $‖ \tilde{C} ‖ < \infty$ , 𝛏_i = 𝑣𝑒𝑐(Ξ_i) and $Ω_{ξ} = E (ξ_{i} ξ_{i}^{'})$ with $‖ Ω_{ξ} ‖ < \infty$ . In addition, $\frac{1}{N} \sum_{i = 1}^{N} C_{i} C_{i}^{'} \to Σ_{C}$ as N→∞, with Σ_C positive definite.

Assumption 4.

(Independence) $f_{t}, ε_{i s}, v_{j l}, ξ_{h}$ are mutually independent for all t,i,s,j,l,h.

The setup described by the DGP in Eq. (Equation3(3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) ) together with Assumptions 1–4, is similar to that in Pesaran (Citation2006) but deviates in the following respects. First, we focus on a model with homogeneous slope coefficients and without fixed effects. This is for ease of exposition only, as the results below also follow through under the assumption of independent random coefficients with a common mean, as in Pesaran (Citation2006). See Section 3.5 for a discussion, and also Appendix C.2 for simulation evidence. Second, following Westerlund and Urbain (Citation2013) and Karabiyik et al. (Citation2019), Assumption 3 generalizes Pesaran (Citation2006) by allowing 𝝀_i and 𝚪_i to be mutually correlated, although uncorrelated across cross-sectional units i. Third, we introduce more explicit regularity conditions on the innovations, factors and their loadings compared to what is typically the case in the CCE literature. In particular, Assumption 1 places restrictions on the eigenvalues of the innovation covariance matrix as in Ahn and Horenstein (Citation2013). They exclude the presence of factors in the innovations and bound the largest $⌊ d^{c} \tilde{M} ⌋$ eigenvalues away from zero. For the factors and loadings, the non central second moments are assumed to converge to a positive definite matrix. These regularity conditions are common in the factor literature (see e.g., Ahn and Horenstein, Citation2013; Bai and Ng, Citation2002) and allow us to consistently estimate m. Lastly, note that rk(𝐅) = m of Assumption 2 implies that T≥m.

2.2. CCE and the rank condition

Since 𝐅 enters into the data generating process of both 𝐲_i and 𝐗_i, and 𝝀_i and 𝚪_i are allowed to be mutually correlated, 𝐗_i is endogenous. Therefore, standard panel data estimators, such as the two-way fixed effects estimator, fail to be consistent for the parameters of interest, 𝜷. The key idea of CCE is to replace 𝐅 with CSA of the observables in Eq. (Equation3(3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) ).

In particular, taking sample averages over i in Eq. (Equation3(3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) ), we obtain (6) $\begin{array}{l} \underset{T \times (K + 1)}{\overset{―}{Z}} = \underset{T \times m}{F} \underset{m \times (K + 1)}{\overset{―}{C}} + \underset{T \times (K + 1)}{\overset{―}{U}}, \end{array}$ (6) where $\overset{―}{Z} = [{\overset{―}{z}}_{1}, \dots, {\overset{―}{z}}_{T}]^{'}$ , $\overset{―}{U} = [{\overset{―}{u}}_{1}, \dots, {\overset{―}{u}}_{T}]^{'}$ and bars denote CSA as in $\overset{―}{Z} = \frac{1}{N} \sum_{i = 1}^{N} Z_{i}$ .

Under Assumptions 1–4 it is easy to show that $\overset{―}{C} = C + O_{p} (N^{- 1 / 2})$ , where 𝐂 = E(𝐂_i), and $‖ {\overset{―}{u}}_{t} ‖ = O_{p} (N^{- 1 / 2})$ for all $t = 1, \dots, T$ . As a result, the observed CSA converge to a linear combination of the m common factors at every $t = 1, \dots, T$ : (7) $\begin{array}{l} {\overset{―}{z}}_{t} & = C^{^{'}} f_{t} + (\overset{―}{C} - C)^{'} f_{t} + {\overset{―}{u}}_{t} = C^{'} f_{t} + O_{p} (N^{- 1 / 2}) . \end{array}$ (7)

Suppose that 𝐂 has full rank such that 𝐂𝐂^′ is invertible and bounded by Assumption 3. Pre-multiplying Eq. (Equation7(7) $\begin{array}{l} {\overset{―}{z}}_{t} & = C^{^{'}} f_{t} + (\overset{―}{C} - C)^{'} f_{t} + {\overset{―}{u}}_{t} = C^{'} f_{t} + O_{p} (N^{- 1 / 2}) . \end{array}$ (7) ) by 𝐂 and solving for 𝐟_t yields (8) $\begin{array}{l} f_{t} & = (C C^{'})^{- 1} C {\overset{―}{z}}_{t} + O_{p} (N^{- 1 / 2}) . \end{array}$ (8)

Hence, as N→∞, the common factor component at time t can be controlled for (or estimated) with the cross-section averages ${\overset{―}{z}}_{t}$ .

The pooled CCE estimator for 𝜷 is the least-squares estimator given by (9) $\begin{array}{l} \hat{β} = {(\sum_{i = 1}^{N} X_{i}^{'} M X_{i})}^{- 1} \sum_{i = 1}^{N} X_{i}^{'} M y_{i}, \end{array}$ (9) where $M = I_{T} - \overset{―}{Z} ({\overset{―}{Z}}^{'} \overset{―}{Z})^{†} {\overset{―}{Z}}^{'}$ .Footnote⁴

The above idea of estimating factors with CSA crucially relies upon the assumption that 𝐂 has full rank. This restriction, known as the “rank condition” (RC), corresponds to (10) $\begin{array}{l} ϱ = m \end{array}$ (10) where ϱ = rk(𝐂). When ϱ < m, the RC fails and the CCE estimator is generally inconsistent. This is because the CSA do not contain enough information on 𝐟_t, which implies that the factor estimator in Eq. (Equation8(8) $\begin{array}{l} f_{t} & = (C C^{'})^{- 1} C {\overset{―}{z}}_{t} + O_{p} (N^{- 1 / 2}) . \end{array}$ (8) ) does not exist.

There are several cases where the RC may fail. To begin with, such failure occurs when $m > K + 1$ , i.e., the number of factors is larger than the number of CSA, in which case $ϱ \leq m i n {m, K + 1} = K + 1 < m$ . In addition, although $K + 1 \geq m$ is a necessary condition for the RC to hold, it is by no means sufficient. For example, certain columns of $\overset{―}{Z}$ can be asymptotically uninformative because the corresponding observables: (i) do not load on the common factors (e.g., some of the columns in 𝚪_i equal zero); (ii) have factor loadings that average out (e.g., $\overset{―}{Γ} = O_{p} (N^{- 1 / 2})$ ); or (iii) do not contain information on the common factors that is distinct from that already provided by other observables. In all these cases, the number of columns that are informative to estimate 𝐟_t, as measured by ϱ, can be lower than m.Footnote⁵

3. Evaluating the rank condition

Despite the importance of the RC for the properties of the CCE estimator, this assumption is typically taken for granted. The main reason is that the population mean of the matrix of factor loadings, 𝐂, is unobserved and therefore its rank cannot be directly evaluated or estimated. The key insight of this article is that ϱ can be determined by estimating the rank of $\overset{―}{Z}$ using existing techniques. Given a consistent estimate of ϱ, the RC is evaluated by direct comparison of that value with a consistent estimate for m. The latter can be determined from the observed data in a straightforward manner, based on e.g., Bai and Ng (Citation2002), Alessi et al. (Citation2010), Onatski (Citation2010), and Ahn and Horenstein (Citation2013).

The following two sections provide details for consistent estimation of ϱ and m as $(N, T) \to \infty$ . Section 3.3 puts forward a binary classifier that evaluates the rank condition correctly with probability 1 as $(N, T) \to \infty$ . Section 3.4 discusses a strategy for obtaining a consistent CCE estimator when the RC fails.

3.1. Consistent estimation of ϱ

We make use of the fact that the rank of an unobserved matrix 𝐀 can be determined through a $\sqrt{N}$ -consistent estimator of that matrix $A_{N} = A + O_{p} (N^{- 1 / 2})$ ; see e.g., Robin and Smith (Citation2000) and Kleibergen and Paap (Citation2006). Noting that $\overset{―}{Z} = F C + O_{p} (N^{- 1 / 2})$ is $\sqrt{N}$ -consistent for 𝐅𝐂 (for T fixed), it follows from the rank equivalence (11) $\begin{array}{l} r k (F C) = r k (C) = ϱ, \end{array}$ (11) that ϱ can be consistently estimated by applying a rank estimator to $\overset{―}{Z}$ .Footnote⁶

Many popular rank estimators are based either on sequential testing procedures, as e.g., in Chen and Fang (Citation2019), or on information criteria (IC), see e.g., Cragg and Donald (Citation1997). Camba-Mendez and Kapetanios (Citation2009) provide an overview and conclude that sequential testing procedures have an advantage over IC methods under several modeling scenarios. Therefore, in the remainder of this section, we closely follow the sequential testing procedure developed by Robin and Smith (Citation2000). This is easy to implement and relies on relatively mild assumptions, in that it does not require the variance-covariance of the estimator of the unknown matrix 𝐅𝐂 to be full rank, or its rank to be known.

A major complication that arises in our setting, however, is that unlike Robin and Smith (Citation2000), where the dimensions of the target matrix are fixed as the sample size grows, here 𝐅𝐂 and its estimator $\overset{―}{Z}$ are of order T×n.Footnote⁷ Therefore, the number of rows increases with the time dimension such that $\overset{―}{Z} = F C + O_{p} (\sqrt{T} N^{- 1 / 2})$ is not $\sqrt{N}$ -consistent when $(N, T) \to \infty$ . To circumvent this issue, we introduce a narrow matrix 𝚿 of order n×T, such that $Ψ \overset{―}{Z}$ is n×n and rk(𝚿𝐅𝐂) = rk(𝐅𝐂). That is, 𝚿 has the role of reducing the dimension of $\overset{―}{Z}$ without altering the rank of the matrix it estimates. The following assumption is imposed:

Assumption 5.

(Dimension reduction matrix) As $(N, T) \to \infty$ , Ψ satisfies $\begin{array}{l} (i) & ‖ Ψ F ‖ = O_{p} (1); (i i) ‖ Ψ \overset{―}{U} ‖ = O_{p} (N^{- 1 / 2}); \\ (i i i) & \sqrt{N} vec (Ψ \overset{―}{Z} - Ψ F C) \to^{L} N (0, Ω) . \end{array}$

Assumption 5 places additional restrictions on the potential choices for 𝚿, besides it being rank preserving. Assumption 5(i) requires that the entries of 𝚿 are sufficiently bounded. Assumption 5(ii) states that 𝚿 is asymptotically uncorrelated with $\overset{―}{U}$ , the error term in Eq. (Equation6(6) $\begin{array}{l} \underset{T \times (K + 1)}{\overset{―}{Z}} = \underset{T \times m}{F} \underset{m \times (K + 1)}{\overset{―}{C}} + \underset{T \times (K + 1)}{\overset{―}{U}}, \end{array}$ (6) ). Assumption 5(iii) ensures that, by application of a suitable central limit theorem, $Ψ \overset{―}{Z}$ remains a $\sqrt{N}$ -consistent estimator for 𝚿𝐅𝐂 and is asymptotically normally distributed with variance 𝛀 as $(N, T) \to \infty$ . This assumption is identical to Assumption 2.2 in Robin and Smith (Citation2000), except that it is imposed on $Ψ \overset{―}{Z}$ rather than $\overset{―}{Z}$ itself.

In practice, there exist several options for 𝚿 that satisfy the rank preservation condition and Assumption 5. One option is to set $Ψ = T^{- 1 / 2} Φ$ , with the entries of 𝚽 drawn from the standard normal distribution. The following theorem confirms that this choice is rank-preserving and satisfies the required consistency and boundedness conditions of Assumption 5. In turn, Assumption 5 (iii) is easily seen to hold by application of a CLT.

Theorem 1.

Let T > n and 𝚽 be a n×T random matrix with i.i.d. standard normal entries. (i) For a T×n matrix A, it holds that $\begin{array}{l} P r [r k (Φ A) = r k (A)] = 1. \end{array}$

(ii) Let $Ψ = T^{- 1 / 2} Φ$ . Under Assumptions 1-4, as $(N, T) \to \infty$ , it follows that $\begin{array}{l} Ψ \overset{―}{Z} = Ψ F C + O_{p} (N^{- 1 / 2}), \end{array}$

where $‖ Ψ F C ‖ = O_{p} (1)$ .

The proof of Theorem 1 is in Appendix B.

Remark 3.1.

An alternative stochastic option would be $Ψ = {\overset{―}{Z}}^{'} / T$ . However, this is ruled out because even though $Ψ \overset{―}{Z} = {\overset{―}{Z}}^{'} \overset{―}{Z} / T$ is stochastically bounded and has the same rank as $\overset{―}{Z}$ , it does not have an asymptotic normal distribution, i.e., it violates Assumption 5(iii).

The 𝚿 matrix can also be deterministic. Since n time periods contain the same information on the rank of 𝐂 as do T observations, an obvious candidate is $Ψ = [0_{n \times (T - n)}, I_{n}]$ , which considers only the last n rows of $\overset{―}{Z}$ . One can also take averages over every n-th row in $\overset{―}{Z}$ by setting $Ψ = \frac{1}{⌈ T / n ⌉} [ι_{⌈ T / n ⌉}^{'} \otimes I_{n}] I_{⌈ T / n ⌉ n, T}$ , where 𝜾_a is an a×1 vector of ones.

Given the above, we propose estimating the rank of the n×n matrix $Ψ \overset{―}{Z}$ by sequentially testing the null hypothesis $H_{0} : ϱ = ϱ^{*}$ against the alternative $H_{a} : ϱ > ϱ^{*}$ , using the following statistic: (12) $\begin{array}{l} τ = N \sum_{ℓ = ϱ^{*} + 1}^{n} λ_{ℓ} (A), \end{array}$ (12) where $λ_{1} (A) \geq \dots \geq λ_{n} (A)$ are the ordered eigenvalues of $A \equiv Ψ \overset{―}{Z} {\overset{―}{Z}}^{'} Ψ^{'}$ . The procedure is implemented sequentially for $ϱ^{*} = 0, \dots, n - 1$ and the estimated rank $\hat{ϱ}$ corresponds to the smallest value of ϱ^* for which the null hypothesis is not rejected. Under the null, τ has a limiting distribution which is a weighted sum of independent $χ_{1}^{2}$ variables, with weights given by the $(n - ϱ^{*})^{2}$ largest eigenvalues of $(D_{ϱ^{*}}^{'} \otimes R_{ϱ^{*}}^{'}) Ω (D_{ϱ^{*}} \otimes R_{ϱ^{*}})$ , where $D_{ϱ^{*}}$ and $R_{ϱ^{*}}$ denote the eigenvectors corresponding to the $n - ϱ^{*}$ smallest eigenvalues of ${\overset{―}{Z}}^{'} Ψ^{'} Ψ \overset{―}{Z}$ and 𝐀, respectively. We shall assume in accordance with Robin and Smith (Citation2000) that $r k [(D_{ϱ}^{'} \otimes R_{ϱ}^{'}) Ω (D_{ϱ} \otimes R_{ϱ})] > 0$ .

The asymptotic variance 𝛀 of $Ψ \overset{―}{Z}$ is unknown but can be estimated consistently byFootnote⁸: (13) $\begin{array}{l} \hat{Ω} = \frac{1}{N} \sum_{i = 1}^{N} vec (Ψ Z_{i} - Ψ \overset{―}{Z}) vec (Ψ Z_{i} - Ψ \overset{―}{Z})^{'} . \end{array}$ (13)

As discussed in Robin and Smith (Citation2000), the estimator for ϱ obtained from the test sequence is consistent when the employed significance level α_N vanishes at an appropriate rate with N. This is because α_N is the probability of over-estimating the true rank, $P r (\hat{ϱ} > ϱ)$ , which must tend to zero for consistency. The authors show that α_N = o(1) and $- N^{- 1} \ln α_{N} = o (1)$ are sufficient for consistency. This is summarized in the following proposition:

Proposition 1.

Let Ass.1-5 hold and $r k [(D_{ϱ}^{'} \otimes R_{ϱ}^{'}) Ω (D_{ϱ} \otimes R_{ϱ})] > 0$ . Provided that α_N = o(1) and $- N^{- 1} \ln α_{N} = o (1)$ as N→∞, it follows that $\hat{ϱ} - ϱ = o_{p} (1)$ .

The proposition follows from the arguments of Theorem 5.2 in Robin and Smith (Citation2000), mutatis mutandis. We omit the proof to save space.

Remark 3.2.

Clearly, α_N has to vanish sufficiently fast with N to limit the over-estimation frequency, but not too fast, as this results in under-estimation when N is small. We suggest to specify the nominal level as $α_{N} = α c N^{- 1 / γ}$ . This way, for a given choice of α and γ, the small N significance level is controlled through c > 1, whereas the speed at which α_N decreases with N is governed by γ > 0. For instance, choosing $α = 5 %$ and setting c = 20, γ = 1 fixes the nominal level to 5% for N = 20 and lets it decrease at rate N. Given that over-estimating ϱ may lead to false conclusions that the rank condition holds, we prefer a conservative estimator through a fast decrease with N (i.e., requiring strong evidence against the null before rejecting it in favor of a higher rank estimate).

3.2. Consistent estimation of m

Existing methods to estimate the number of factors from observed data rely on one of the following three approaches: looking at differences or ratios of adjacent eigenvalues (Ahn and Horenstein, Citation2013; Onatski, Citation2010), specifying threshold functions to separate bounded from unbounded eigenvalues (Alessi et al., Citation2010; Bai and Ng, Citation2002) or sequential tests to determine which eigenvalues are unbounded (Kapetanios, Citation2010; Trapani, Citation2018). Preliminary simulation evidence conducted for this article shows that the Growth Ratio (GR) by Ahn and Horenstein (Citation2013) performs well in finite samples and outperforms other estimators.Footnote⁹ Therefore, in what follows we propose estimating m using the GR statistic.

In particular, let $Z = [Z_{1}, \dots, Z_{N}]$ denote a $T \times (K + 1) N$ matrix, where 𝐙_i (defined in Eq. (Equation3(3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) )) collects all observables for individual i in a $T \times (K + 1)$ matrix. Also, let m_max denote the maximum value of m considered in estimation, such that m_max≥m. We define (14) $\begin{array}{l} \hat{m} = \underset{j \in {1, \dots, m_{m a x}}}{\arg \max} G R (j); G R (j) = \frac{l n (V (j - 1) / V (j))}{l n (V (j) / V (j + 1))}, \end{array}$ (14) where $V (j) = \sum_{k = j + 1}^{h} λ_{k} (Z Z^{'} / N T)$ with $h = m i n {T, N (K + 1)}$ , and λ_k(𝐙𝐙^′∕NT) denotes the kth largest eigenvalue of (𝐙𝐙^′∕NT).

The GR statistic is easy to compute because it involves maximizing the “growth ratio” of two adjacent eigenvalues arranged in descending order. The main intuition is that the growth ratios of two adjacent eigenvalues of 𝐙𝐙^′∕NT are asymptotically bounded, except for the growth ratio involving the mth and (m+1)th eigenvalues, which diverges to infinity.

Under regularity conditions implied by Assumptions 1–4, Ahn and Horenstein (Citation2013) showFootnote¹⁰ (15) $\begin{array}{l} l i m_{m i n {N, T} \to \infty} P r (\hat{m} = m) = 1, \end{array}$ (15) for any $m_{m a x} \in {m, (d^{c} m i n {N, T}) - m - 1}$ , where $d^{c} \in (0, 1]$ .

Remark 3.3.

In exactly the same way as described above, the number of factors can also be estimated based on the T×T matrix 𝐘𝐘^′∕NT, where $Y = [y_{1}, \dots, y_{N}]$ is of dimension T×N. However, since both 𝐲_i and 𝐗_i share the same factors by assumption, it is natural to combine them together in order to increase the information set used to construct proxies for 𝐅. This strategy is in line with the rationale behind the CCE approach, which involves solving a system of equations, such that Eq. (Equation3(3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) ) includes LHS variables (observables) that are solely driven by a common factor component and purely idiosyncratic noise. Moreover, this strategy is consistent with Westerlund and Urbain (Citation2015), who also estimate factors based on 𝐙𝐙^′∕NT.

3.3. A consistent classifier for the rank condition

Given consistent estimates $\hat{ϱ}$ and $\hat{m}$ of the rank of 𝐂 and the number of factors, the rank condition is deemed to be violated when $\hat{ϱ} < \hat{m}$ . We define the following classifier: (16) $\begin{array}{l} \hat{R C} \equiv 1 - 1 {\hat{ϱ} < \hat{m}}, \end{array}$ (16) where 𝟙{⋅} is an indicator function that returns 1 when the argument inside the curly brackets holds true, and 0 otherwise. Hence, if $\hat{R C} = 1$ the rank condition is considered to be satisfied, whereas $\hat{R C} = 0$ indicates that (Equation10(10) $\begin{array}{l} ϱ = m \end{array}$ (10) ) may be violated. The definition in (Equation16(16) $\begin{array}{l} \hat{R C} \equiv 1 - 1 {\hat{ϱ} < \hat{m}}, \end{array}$ (16) ) shows that we also take $\hat{ϱ} > \hat{m}$ as a sign that (Equation10(10) $\begin{array}{l} ϱ = m \end{array}$ (10) ) is satisfied.Footnote¹¹

Let $R C \equiv 1 - \mathbbm 1 {ϱ < m}$ denote the indicator of the true state of the rank condition in the population. The following proposition summarizes the asymptotic properties of the classifier.

Proposition 2.

Let Assumptions 1–5 hold true. Suppose also that ϱ is determined based on the sequential testing procedure outlined in Section 3.1, with α_N = o(1), and $- N^{- 1} \ln α_{N} = o (1)$ , and m is determined by Eq. (Equation14(14) $\begin{array}{l} \hat{m} = \underset{j \in {1, \dots, m_{m a x}}}{\arg \max} G R (j); G R (j) = \frac{l n (V (j - 1) / V (j))}{l n (V (j) / V (j + 1))}, \end{array}$ (14) ) with m_max≥m. Then, $P r [\hat{R C} = R C] \to 1$ as $(N, T) \to \infty$ .

That is, the probability that the classifier correctly identifies whether the rank condition is satisfied or not, converges to unity. The result follows directly from the consistency of $\hat{ϱ}$ as N→∞ under Assumptions 1-5, given an appropriate rate of decay for α_N, and the consistency of $\hat{m}$ as $(N, T) \to \infty$ given appropriate specification of m_max.

3.4. What if the rank condition is violated?

When $\hat{R C} = 0$ , the standard CCE estimator is generally inconsistent unless the regressors are uncorrelated with the unobserved factor loadings. One may seek to restore the RC by augmenting the model with additional CSA (see Appendix A for several options). This brings about two important issues.

The first one is how to choose relevant additional CSA from a set of candidate expansions, as not all candidates are necessarily informative about 𝐅. The second one is whether the selected additional CSA are also able to restore the RC.

To tackle the first question, Karabiyik et al. (Citation2019) have proposed an IC selection procedure. To illustrate, let ${\overset{―}{Z}}_{+}$ be the matrix of available expansions (17) $\begin{array}{l} {\overset{―}{Z}}_{+} = {{\overset{―}{Z}}_{+}^{(1)}, {\overset{―}{Z}}_{+}^{(2)}, {\overset{―}{Z}}_{+}^{(3)}}, \end{array}$ (17) where (say) ${\overset{―}{Z}}_{+}^{(1)} = {\overset{―}{Z}}^{(e)}$ contains CSA of new exogenous variables, ${\overset{―}{Z}}_{+}^{(2)} = {\overset{―}{Z}}_{w_{1}}$ contains a matrix of CSA arising from a new weighting variable w₁, and similarly ${\overset{―}{Z}}_{+}^{(3)} = {\overset{―}{Z}}_{w_{2}}$ for a weight w₂ (see Appendix A for details). The appropriate set of expansion CSA can be selected from ${\overset{―}{Z}}_{+}$ by minimizing (18) $\begin{array}{l} ℓ^{*} = \underset{ℓ}{\arg \min} I C (ℓ), \end{array}$ (18) where $C_{N T} = min {N, \sqrt{T}}$ in (19) $\begin{array}{l} I C (ℓ) & = ln | Σ_{i = 1}^{N} Z_{i}^{'} M_{A}^{(ℓ)} Z_{i} / N T | + g (ℓ); g (ℓ) = cols ({\overset{―}{Z}}_{A}^{(ℓ)}) (K + 1) \frac{ln (C_{N T})}{C_{N T}}, \end{array}$ (19) where cols(⋅) denotes the number of columns of the matrix within brackets and $ℓ = {ℓ_{1}, ℓ_{2}, . . .}$ gathers the indices of the considered expansions from ${\overset{―}{Z}}_{+}$ . As such, for (say) $ℓ = {ℓ_{1}, ℓ_{3}}$ , we have ${\overset{―}{Z}}_{A}^{(ℓ)} = [\overset{―}{Z}, {\overset{―}{Z}}_{+}^{(1)}, {\overset{―}{Z}}_{+}^{(3)}] = [\overset{―}{Z}, {\overset{―}{Z}}^{(e)}, {\overset{―}{Z}}_{w_{2}}]$ , and $M_{A}^{(ℓ)} = I_{T} - {\overset{―}{Z}}_{A}^{(ℓ)} {({\overset{―}{Z}}_{A}^{(ℓ)'} {\overset{―}{Z}}_{A}^{(ℓ)})}^{†} {\overset{―}{Z}}_{A}^{(ℓ)'}$ .

A desirable property of the IC selection procedure is that it identifies the CSA that bring in new information about the factors in 𝐙_i given what is already present in $\overset{―}{Z}$ . Candidates that are uninformative, or informative on factors that do not feature in 𝐙_i, will be excluded (asymptotically). However, the IC does not by itself signal whether the additional CSA are sufficient to restore the RC. For example, if the IC does not select additional CSA besides $\overset{―}{Z}$ , this could be either because the rank condition is satisfied with $\overset{―}{Z}$ , or because no further informative CSA are available in the proposal set ${\overset{―}{Z}}_{+}$ . To overcome this problem, we propose combining the IC with our RC classifier, as outlined in Algorithm 1.

Remark 3.4.

An alternative strategy would be to combine our classifier with the regularization approach proposed by Juodis (Citation2022). The latter makes use of the Singular Value Decomposition in order to remove the asymptotically redundant singular values of appropriately normalized CSA. We leave this possibility for future research.

Algorithm 1.

CCE_A algorithm

Estimate the model parameters using the standard CCE approach and calculate IC₀ = IC(∅) (no expansions). Proceed to step 2;
Evaluate the rank condition for $\overset{―}{Z}$ . If $\hat{R C} = 1$ , no further steps are required. If $\hat{R C} = 0$ , proceed to step 3;
Employ the IC in Eq. Equation19(19) $\begin{array}{l} I C (ℓ) & = ln | Σ_{i = 1}^{N} Z_{i}^{'} M_{A}^{(ℓ)} Z_{i} / N T | + g (ℓ); g (ℓ) = cols ({\overset{―}{Z}}_{A}^{(ℓ)}) (K + 1) \frac{ln (C_{N T})}{C_{N T}}, \end{array}$ (19) to select from ${\overset{―}{Z}}_{+} = {{\overset{―}{Z}}_{+}^{(1)}, {\overset{―}{Z}}_{+}^{(2)}, {\overset{―}{Z}}_{+}^{(3)}, \dots}$ the set of CSA that are relevant for the factors in 𝐙_i. That is, define $ℓ^{*} = {arg min}_{ℓ} I C (ℓ)$ ;
If $I C (ℓ^{*}) \leq I C_{0}$ , evaluate the rank condition for ${\overset{―}{Z}}_{A} = [\overset{―}{Z}, {\overset{―}{Z}}_{+}^{(ℓ^{*})}]$ and proceed to step 5, else proceed to step 6;
If $\hat{R C} ({\overset{―}{Z}}_{A}) = 1$ , estimate the model with the CCE_A estimator based on ${\overset{―}{Z}}_{A}$ . No further steps are required. If $\hat{R C} ({\overset{―}{Z}}_{A}) = 0$ , proceed to step 6;
${\overset{―}{Z}}_{+}$ does not contain sufficient informative expansions to restore the rank condition in the model. Add new potential expansions to ${\overset{―}{Z}}_{+}$ and return to step 3;

The key benefit of Algorithm 1 is that it either returns a consistent CCE estimator or signals to the researcher that alternative CSA need to be sought. If $\overset{―}{Z}$ or the potential expansions ${\overset{―}{Z}}_{+}$ contain sufficient informative CSA to satisfy or restore the RC, they will (asymptotically) be identified and the CCE estimates emerging from Algorithm 1 are consistent. In this case, the algorithm will end in step (5) (or step (2)). If on the other hand $\overset{―}{Z}$ does not satisfy the RC and ${\overset{―}{Z}}_{+}$ does not contain the right (or insufficient) expansions to restore it, then this will be signaled by landing in step (6). The researcher then knows that alternative CSA candidates need to be sought and fed into the algorithm before consistent CCE estimates can be obtained.

Remark 3.5.

It is also possible to evaluate the RC for all potential augmentations until $\hat{R C} = 1$ . However, this strategy bares the risk of selecting CSA that load on different factors than those in 𝐙_i, and so they are irrelevant for approximating the factor space. This is because such CSA will increase the rank of the augmented loading matrix, despite being irrelevant, and they will therefore be incorrectly favored by the classifier. A preliminary pass-through by the IC selection, as in Algorithm 1, eliminates such irrelevant options.

Remark 3.6.

We refer to Appendix D for a discussion and results pertaining to inference with the augmented CCE estimator after the application of Algorithm 1.

3.5. Heterogeneous slopes

The preceding results and theorems apply similarly to heterogeneous random slope models as in Assumption 4 of Pesaran (Citation2006), where $β_{i} = β + ν_{i}$ , $ν_{i} \sim I I D (0_{K \times 1}, Ω_{ν})$ , with 𝛀_ν nonnegative definite and 𝝂_g is independent of $f_{t}, ε_{i s}, v_{j l}, υ_{h}$ for all t,i,s,j,l,g,h. Note that in this setting, the first column of $C_{i} = [δ_{i}, Γ_{i}]$ changes to $δ_{i} = λ_{i} + Γ_{i} β_{i}$ and similarly for the first column of $U_{i} = [ε_{i} + V_{i} β_{i}, V_{i}]$ . However, since 𝝂_i is an i.i.d. random variable with finite variance, expectation zero and independent of the other model components, it straightforwardly follows that Assumptions 1–5 listed above directly follow through. In particular, the critical eigenvalue bounds and $\sqrt{N}$ -consistency of $Ψ \overset{―}{Z}$ for 𝚿𝐅𝐂 remain unaffected. The rank condition can thus be evaluated with the same procedures as outlined above. This is also confirmed by the Monte Carlo simulations reported in Appendix C.

4. Monte Carlo simulation

In this section, we investigate the small sample performance of the rank condition classifier proposed in Section 3 using Monte Carlo simulations.

4.1. Design

Data are generated from Eq. (Equation3(3) $\begin{array}{l} Z_{i} & = F C_{i} + U_{i}, \end{array}$ (3) ), broadly following Westerlund and Urbain (Citation2013). We set m = 2, K = 1, β = 3 and sample the time series in 𝐅, 𝜺_i and 𝐕_i assuming independent autoregressive processes with a common AR coefficient ρ = 0. 8 and normally distributed mean zero innovations with variance (1−ρ²) for the factors and $(1 - ρ^{2}) / 2$ for the idiosyncratic errors. For the factor loadings 𝝀_i and 𝚪_i, we specify the following three scenarios:

Experiment 1: $λ_{i} = [3, 2]^{'} + η_{i}$ , $η_{i} \sim N (0_{2}, I_{2})$ , and $Γ_{i} = λ_{i} + [- 2, 0]^{'}$ .
Experiment 2: $λ_{i} = {\begin{cases} [0, 2]^{'} + η_{i} & for i = 1, \dots, ⌊ N / 2 ⌋ \\ [0, 2]^{'} + η_{i} & for i = ⌊ N / 2 ⌋ + 1, \dots, N \end{cases}$ with $η_{i} \sim N (0_{2}, I_{2})$ and 𝚪_i = 𝝀_i.
Experiment 3: $λ_{i} \sim N (0_{2}, I_{2})$ and 𝚪_i = 𝝀_i.

Thus, in Experiment 1 the RC is satisfied for the simple CSA $\overset{―}{Z}$ ( $ϱ = m = 2$ ). In Experiment 2, the basic CSA contain some information for estimating the factors (ϱ = 1), yet not sufficient to satisfy the RC. Since the loadings in 𝐲_i and 𝐗_i are (perfectly) correlated, the standard CCE estimator is not consistent. In Experiment 3 the standard CSA contain no information at all about the factors ( $ϱ = 0 < m$ ), in which case consistent CCE estimation is also not possible with $\overset{―}{Z}$ .

We evaluate the RC in each MC iteration, using Algorithm 1 of Section 3.4. The number of factors (m) is estimated by the GR statistic of Ahn and Horenstein (Citation2013), setting m_max = 7. The rank of the loading matrix (ϱ) is estimated as in Section 3.1 with a random dimension reduction $Ψ = T^{- 1 / 2} Φ$ , 𝚽 containing i.i.d. standard-normal entries, and the nominal significance level given by $α_{N} = c α N^{- 1 / γ}$ , with c = 20, γ = 1 and $α = 5 %$ .

Additional CSA are constructed using the following weighting schemes: (20) $\begin{array}{l} {\overset{―}{Z}}_{w, 1} = \sum_{i = 1}^{N} Z_{i} w_{i, 1}; w_{i, 1} & = {\begin{cases} 1 / N_{1} & for i = 1, \dots, N / 2; \\ 0 & for i = N / 2 + 1, \dots, N, \end{cases} \end{array}$ (20) (21) $\begin{array}{l} {\overset{―}{Z}}_{w, 2} = \sum_{i = 1}^{N} Z_{i} w_{i, 2}; w_{i, 2} & = {\begin{cases} 0 & for i = 1, \dots, N / 2; \\ 1 / (N - N_{1}) & for i = N / 2 + 1, \dots, N, \end{cases} \end{array}$ (21) which results in CSA calculated over the first ( ${\overset{―}{Z}}_{w, 1}$ ) and second ( ${\overset{―}{Z}}_{w, 2}$ ) group of N∕2 cross-sectional units. This choice of weights presumes the existence of an exogenous grouping of the cross-sectional units, as in Experiment 2. It is not an appropriate RC-restoring expansion for experiments 1 and 3, as no such grouping exists for these experiments. We also consider candidate CSA originating from the T×2 matrix of external variables $\begin{array}{l} Z_{i}^{(e)} = F C_{i}^{(e)} + ϵ_{i}^{(e)}, \end{array}$ where the columns of $ϵ_{i}^{(e)}$ are generated as AR(1) processes with autoregressive coefficient ρ = 0. 8 and mean zero normally distributed innovations with variance $(1 - ρ^{2}) / 2$ , while $\begin{array}{l} C_{i}^{(e)} = [\begin{array}{c} 2.5 & 1 \\ 1 & 2.5 \end{array}] + η_{i}^{(e)}; v e c (η_{i}^{(e)}) \sim N (0_{4}, I_{4}) . \end{array}$

As the $Z_{i}^{(e)}$ load on the same factors as those in 𝐙_i, the matrix ${\overset{―}{Z}}^{(e)} = \frac{1}{N} \sum_{i = 1}^{N} Z_{i}^{(e)}$ is an informative, RC-restoring, expansion in experiments 2 and 3. We also accommodate in our simulations the fact that in practice not all external variables will load on the same factors as those in 𝐙_i. These irrelevant candidates are generated from $\begin{array}{l} Z_{i}^{(g)} = G C_{i}^{(g)} + ϵ_{i}^{(g)}, \end{array}$ where the factors 𝐆, loadings $C_{i}^{(g)}$ and innovations $ϵ_{i}^{(g)}$ follow the same DGP as 𝐅, $C_{i}^{(e)}$ and $ϵ_{i}^{(e)}$ but are independently generated from the latter. As such, $Z_{i}^{(g)}$ is informative about 𝐆 but not 𝐅, and ${\overset{―}{Z}}^{(g)}$ is therefore not an appropriate expansion in any of the considered experiments. The total set of candidate expansions that is fed into Algorithm 1 is thus a mixture of both relevant and uninformative candidates, and is given by (22) $\begin{array}{l} {\overset{―}{Z}}_{+} = [{\overset{―}{Z}}_{w, 1}, {\overset{―}{Z}}_{w, 2}, {\overset{―}{Z}}^{(e)}, {\overset{―}{Z}}^{(g)}] . \end{array}$ (22)

In accordance with Algorithm 1, the augmented estimator CCE_A selects expansions from ${\overset{―}{Z}}_{+}$ using the Information Criterion by Karabiyik et al. (Citation2019) given in Eq. (Equation19(19) $\begin{array}{l} I C (ℓ) & = ln | Σ_{i = 1}^{N} Z_{i}^{'} M_{A}^{(ℓ)} Z_{i} / N T | + g (ℓ); g (ℓ) = cols ({\overset{―}{Z}}_{A}^{(ℓ)}) (K + 1) \frac{ln (C_{N T})}{C_{N T}}, \end{array}$ (19) ). The RC is re-evaluated when expansions are selected.

We generate 10000 datasets for each combination of $N = (20, 50, 100, 200, 500, 1000)$ and $T = (20, 50, 100, 200)$ , and calculate the under/over-estimation frequencies for $\hat{ϱ}$ and $\hat{m}$ , and the classification accuracy of $\hat{R C}$ , i.e., the % of MC draws where the RC is correctly evaluated. When the RC is not satisfied for the standard CCE estimator (experiments 2 and 3), we also consider the CCE_A estimator and compute the ‘RC satisfied rate’ as the % of MC draws where Algorithm 1 selects expansions that restore the rank condition.

4.2. Estimating ϱ and m

Results for the performance of the estimators for ϱ and m are presented in in A∕B format, with A and B the percentage of MC iterations where ϱ or m are, respectively, under- and over-estimated. The left panel contains results for estimating the rank ϱ of the loading matrix and reveals that both the over-and under-estimation frequencies tend to zero as N→∞. This is consistent with the main result of the paper that ϱ can be estimated consistently from $\overset{―}{Z}$ . As expected, we find that the rank estimator is somewhat sensitive to the size of the cross-section dimension, which needs to be sufficiently large (i.e., N of at least 50) to achieve an accuracy of 75%. In contrast, its performance is largely invariant to the size of T, which supports the projection strategy to guarantee computability of the estimator and large N consistency when also T→∞. Note that the rank estimator is conservative in the sense that the true rank is more likely to be under-estimated than over-estimated. This is a consequence of our chosen significance level $α_{N} = 20 α N^{- 1}$ , of which its fast decay in N implies that strong evidence against the null $ϱ = ϱ^{*}$ is required before it is rejected in favor of a higher rank $ϱ > ϱ^{*}$ . Yet, the observed under-estimation frequency is reasonable and vanishes sufficiently fast with N.

A method to evaluate the rank condition for CCE estimators

Abstract.

1. Introduction

2. A multifactor panel data model and CCE

2.1. Model and assumptions

2.2. CCE and the rank condition

3. Evaluating the rank condition

3.1. Consistent estimation of ϱ

3.2. Consistent estimation of m

3.3. A consistent classifier for the rank condition

3.4. What if the rank condition is violated?

3.5. Heterogeneous slopes

4. Monte Carlo simulation

4.1. Design

4.2. Estimating ϱ and m

Table 1. Under/over-estimation frequency of the estimators for ϱ and m.

4.3. Evaluating the rank condition

4.3.1. Experiment 1: rank condition satisfied

Table 2. Evaluating the rank condition: Experiment 1.

4.3.2. Experiment 2: rank condition violated for basic weights

Table 3. Evaluating the rank condition: Experiment 2.

4.3.3. Experiment 3: rank condition violated

Table 4. Evaluating the rank condition: Experiment 3.

4.3.4. Experiments with heterogeneous slopes

5. Application: the impact of the Dodd-Frank Act on the profitability of U.S. banks

5.1. Data and model specification

5.2. Evaluating the RC

Table 5. US bank profitability: Evaluating the rank condition.

Table 6. US bank profitability: IC for additional CSA.

5.3. CCE and CCEA estimation results

Table 7. US bank profitability: CCE and CCEA estimation results

6. Conclusion

Acknowledgments

Additional information

Funding

Notes

References

Appendix A

Rank condition not satisfied: potential CSA for expansions

Appendix B

Proofs of theoretical results

B.1. Proof of Theorem 1

Appendix C

Additional simulation results

C.1. Homogeneous slopes

Table C.1. Algorithm 1: Selection percentages for expansion CSA.

Table C.2. Algorithm 1: Sensitivity and specificity.

C.1.1. Estimation results

Table C.3. Estimation results for β in Experiment 1.

Table C.4. Estimation results for β in Experiment 2.

Table C.5. Estimation results for β in Experiment 3.

C.2. Heterogeneous slopes

Table C.6. Algorithm 1: Selection percentages for expansion CSA (heterogeneous slopes).

Table C.7. Algorithm 1: Sensitivity and Specificity (heterogeneous slopes).

C.2.1. Evaluating the rank condition

Table C.8. Under/over-estimation frequency of the estimators for ϱ and m (heterogeneous slopes).

Table C.9. Evaluating the rank condition: Experiment 1 (heterogeneous slopes).

Table C.10. Evaluating the rank condition: Experiment 2 (heterogeneous slopes).

Table C.11. Evaluating the rank condition: Experiment 3 (heterogeneous slopes).

C.2.2. Estimation results

Table C.12. Estimation results for heterogeneous β in Experiment 1.

Table C.13. Estimation results for heterogeneous β in Experiment 2.

Table C.14. Estimation results for heterogeneous β in Experiment 3.

Appendix D

Post-selection inference

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

5.3. CCE and CCE_A estimation results

Table 7. US bank profitability: CCE and CCE_A estimation results