Full article: Improved inference for the panel data model with unknown unit-specific heteroscedasticity: A Monte Carlo evidence

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

For a panel data model (PDM), it is common that the error terms of panel regression model are heteroscedastic. In the available literature, the heteroscedastic consistent covariance matrix estimators (HCCMEs) have been used for adequate testing of the coefficients of PDM. Usually, these HCCMEs are based on the residuals derived from ordinary least square (OLS) estimator which is considerably inefficient in the presence of heteroscedasticity. To get efficient estimation, the existing literature proposes some adaptive estimators for the PDM. This paper presents the HCCMEs, derived from some adaptive estimator, while considering the panel data-set with unit-specific heteroscedasticity. Through the Monte Carlo simulations, we present the numerical evaluation and attractive findings.

Keywords:

AMS subject classification:

Public Interest Statement

Panel data are multi-dimensional data consisting of measurements over time. The observations of multiple phenomena, obtained over multiple time periods for the same companies, firms, countries or individuals etc., constitute the panel data. Panel data have several advantages over purely crosssectional or purely time series data. To model such data, a panel data model (PDM) is used that provides information on individual behaviour, both across individuals and over time. In spite of its many advantages, a PDM may pose several estimation and inference problems due to several reasons and heteroscedasticity in cross-sectional units at the same point in time (i.e. unit-specific heteroscedasticity) is one of them. The present article addresses the same issue and suggests how one can improve in the inferential issue in the presence of unit-specific heteroscedasticity.

1. Introduction

In econometrics, one important type of data is known as the panel data. Panel data are based on various observations, collected from same individuals over several time periods. A regression model that fits panel data is known as the panel data model (PDM). In econometrics, analysis of PDM is the most dynamic assortment somewhat in light of the fact that panel data-sets give a rich domain to advancement of estimation methods and hypothetical results. In more useful terms, researchers have possessed the capacity to utilize time-series and cross-sectional data to inspect issues which could not be handled in either setting alone.

An important assumption of classical linear regression model (CLRM) is homoscedasticity that the variance of error term remains constant and thus, the error term is identically distributed. If this assumption is not met, there exists the issue of heteroscedasticity and the OLS results are inadequate in this case. Heteroscedasticity is a common problem in the PDM and it is desirable to concentrate on it for making robust inference. The ordinary technique used for estimation of PDM like the OLS does not lead to efficient estimation and correct inference in the presence of heteroscedasticity. The OLS estimator is not biased and inconsistent but does not remain best linear unbiased estimator (BLUE) when the assumption of homoscedasticity is violated. Furthermore, usual t and F statistic are unable to construct precise confidence interval and to perform correct testing of hypothesis. Moreover, presence of the high leverage points in given data-set may also lead to incorrect inference. Therefore, focus of this study is to bring improvement in inference of linear PDM suffering from heteroscedasticity, namely the unit-specific heteroscedasticity (USH).

Mazodier and Trognon (Citation1978) were the first who studied the problem of heteroscedasticity in the PDM, and later, Baltagi and Griffin (Citation1988) and Randolph (Citation1988) considered it. For the efficient estimation of the PDM under heteroscedasticity, some adaptive estimators are available in the literature. Li and Stengos (Citation1994) developed an adaptive estimator for the unit time-varying heteroscedasticity (UTVH) and Roy (Citation2002) proposed an adaptive estimator for the USH. Baltagi, Bresson, and Pirotte (Citation2004) studied performance of both of these estimators and found that Roy’s estimator performed well in terms of relative MSE and was not dependent on selection of bandwidth. However, the estimator proposed by Li and Stengos for UTVH showed loss in efficiency for smaller bandwidth but performed well under higher bandwidth.

To tackle the problem of heteroscedasticity, Eicker (Citation1963) and White (Citation1980) proposed HCCME for non-panel data which made it conceivable to draw asymptotically robust inference. In the existing literature, it can be seen that Arellano (Citation1987) built White’s estimator for the PDM. For common regression models, Ahmed, Aslam, and Pasha (Citation2011) and Aslam, Riaz, and Altaf (Citation2013) used the adaptive HCCME (AHCCME). Cribari-Neto (Citation2004) proposed a variant of the HCCME for common regression models to take into account the effect of leverage points. This estimator is known as HC4. Cribari-Neto, Souza, and Vasconcellos (Citation2007) proposed another version of the HCCME for linear regression models to study the effect of maximal leverage on associated inference. It is termed as the HC5. Adaptive versions of the HC4 and HC5 have been used for common regression models by Aslam et al. (Citation2013). It has been noticed in the available literature that the AHCCMEs are not used for the PDM. Therefore, this is the main concern of the current study.

This article unfolds as follows. Section 2 describes the PDM with USH and adaptive estimator. Section 3 describes the AHCCME. In Section 4, the quasi-t test statistic and computation of confidence interval and power of test are discussed. Empirical results are presented in Section 5. An illustrative example has been given in Section 6 and, finally, Section 7 concludes the said work.

2. Adaptive estimator for USH

Following Li and Stengos (Citation1994) and Roy (Citation2002), consider the standard one-way error component model(1) $y_{it} = x_{it} β + u_{it}; i = 1, 2, \dots, N; t = 1, 2, \dots, T with u_{it} = μ_{i} + v_{it},$ (1)

where x_it is 1 × q, $v_{it} \sim i.i.d. (0, σ_{v}^{2})$ is a unit-time varying error component (UTVEC) and $μ_{i}$ is the unit-specific error (USE) component assumed to be i.i.d. with that $E (μ_{i} | {\bar{x}}_{i .}) = 0,$ , $v a r (μ_{i} | {\bar{x}}_{i .}) = ω ({\bar{x}}_{i .}) = ω_{i},$ where ${\bar{x}}_{i .} = \frac{1}{T} \sum_{t = 1}^{T} x_{it}$ . In other words, the conditional variance of USE is suffering from heteroscedasticity of unknown form. Throughout the paper, we assume T is small and N is large.

Matrix form of Model (1) is presented by Li and Stengos (Citation1994) as(2) $y = x β + Z μ + v,$ (2)

where $Z = I_{n} \otimes e_{T}$ , e_T is a T-dimensional column vector of ones, ⊗ denotes the Kronecker product, $μ = {[μ_{1}, μ_{2}, \dots, μ_{N}]}^{'}, y$ , and v are the NT × 1 column vectors of dependent variable and UTVEC, respectively, while x is an NT × q matrix of regressors.

Following the work of Baltagi and Griffin (Citation1988), Roy (Citation2002) presented inverse of the conditional variance–covariance matrix of error term (Zμ + v) in (2), which is denoted by W⁻¹,(3) $W^{- 1} = diag (\frac{1}{σ_{i}^{2}}) \otimes (\frac{J_{T}}{T}) + diag (\frac{1}{σ_{v}^{2}}) \otimes (I_{T} - \frac{J_{T}}{T}),$ (3)

where $σ_{i}^{2} = T ω_{i} + σ_{v}^{2} \forall i and J_{T} is T \times T$ matrix of ones. For Model (2), the true generalized least square (TGLS) estimator of β is(4) $\hat{β} = {(x^{'} W^{- 1} x)}^{- 1} x^{'} W^{- 1} y,$ (4)

The estimation of (4) involves covariance matrix of order NT × NT. So for large data-set, Roy proposed following version of (4)(5) $\hat{β} = {(\sum_{i = 1}^{N} x_{i}^{'} A_{i}^{- 1} x_{i})}^{- 1} \sum_{i = 1}^{N} x_{i}^{'} A_{i}^{- 1} y_{i},$ (5)

where x_i is a T × q matrix of regressors for the ith individual, y_i is T × 1 and $A_{i}^{- 1}$ is T × T covariance matrix(6) $A_{i}^{- 1} = \frac{1}{γ_{i} (1 - ρ_{i})} [I_{T} - \frac{e_{T} e_{T}^{'} ρ_{i}}{1 - ρ_{i} + T ρ_{i}}],$ (6)

where $ρ_{i} = \frac{ω_{i}}{γ_{i}}$ and $γ_{i} = ω_{i} + σ_{v}^{2} .$ To find estimates of $ω_{i}, σ_{v}^{2}$ and γ_i need to be estimated which are unknown parameters in (6). Roy (Citation2002) estimated $σ_{v}^{2}$ as

${\hat{σ}}_{v}^{2} = \frac{\sum_{i = 1}^{N} \sum_{t = 1}^{T} {[(y_{it} - {\bar{y}}_{i .}) - {\hat{β}}_{w}^{'} (x_{it} - {\bar{x}}_{i .})]}^{2}}{N (T - 1) - q},$

where ${\bar{y}}_{i .}$ is similarly defined as ${\bar{x}}_{i .}$ and ${\hat{β}}_{w}$ is the within group estimator (WGE) (for more details; see (Greene, Citation1997)).

Roy (Citation2002) defined $γ_{i} = E (u_{it}^{2} | {\bar{x}}_{i .}) = ω_{i} + σ_{v}^{2}$ and proposed the following kernel estimator for $γ_{i}$ (7) ${\hat{γ}}_{i} = \frac{\sum_{j = 1}^{N} \sum_{t = 1}^{T} {\hat{u}}_{jt}^{2} K \{\frac{({\bar{x}}_{i .} - {\bar{x}}_{j .})}{d}\}}{\sum_{j = 1}^{N} \sum_{t = 1}^{T} K \{\frac{({\bar{x}}_{i .} - {\bar{x}}_{j .})}{d}\}}; i = 1, 2, \dots, N,$ (7)

where ${\hat{u}}_{jt}^{2}$ is the OLS residual from the regression of y_jt on x_jt, $K {\cdot}$ is the kernel function with d as the smoothing parameter. Using (7), the estimates of ω_i can be found as ${\hat{ω}}_{i} = {\hat{γ}}_{i} - {\hat{σ}}_{v}^{2}$ and hence an estimator of $A_{i}^{- 1}$ can be obtained as

${\hat{A}}_{i}^{- 1} = \frac{1}{{\hat{γ}}_{i} (1 - {\hat{ρ}}_{i})} [I_{T} - \frac{e_{T} e_{T}^{'} {\hat{ρ}}_{i}}{1 - {\hat{ρ}}_{i} + T {\hat{ρ}}_{i}}] .$

The AGLS estimator of β is then obtained as

$\hat{β} = {(\sum_{i = 1}^{N} x_{i}^{'} A_{i}^{- 1} x_{i})}^{- 1} \sum_{i = 1}^{N} x_{i}^{'} A_{i}^{- 1} y_{i} .$

3. The AHCCME

For common regression models (i.e. models for cross-sectional data), Ahmed et al. (Citation2011) used the AHCCME (AHC0-AHC3). For common regression models, Cribari-Neto (Citation2004) proposed HC4 and Cribari-Neto et al. (Citation2007) proposed HC5. The AHC4 and AHC5 have been used for common regression models by Aslam et al. (Citation2013). However, such covariance estimators have not been studied yet by any author for the PDM. According to the above cited studies, the HC3, HC4, and HC5 give attractive performance to improve testing. Therefore, in the present study, we skip HC1 and HC2 but HC0 is included as a standard estimator.

The usual covariance matrix of $\hat{β}$ is $Ψ = {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} (\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} E (u_{i} u_{i}^{'}) {\hat{A}}_{i}^{- 1} x_{i}) {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} .$

Following White (Citation1980), Ahmed et al. (Citation2011), and Aslam et al. (Citation2013), we define a consistent estimator of the PDM as follows:(8) ${\hat{Ψ}}^{(0)} = {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} (\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} φ_{i}^{(0)} {\hat{A}}_{i}^{- 1} x_{i}) {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1},$ (8)

where ${\hat{ϕ}}_{i}^{(0)} = ({\hat{u}}_{i} {\hat{u}}_{i}^{'})$ and ${\hat{u}}_{i} = {({\hat{u}}_{i 1}, {\hat{u}}_{i 2}, \dots, {\hat{u}}_{iT})}^{'}$ is the AGLS residual given as $\hat{u} = y - x \hat{β}$ .

The estimator in (8) is termed as AHC0.

Consider $h_{i} = {(h_{i 1}, h_{i 2}, \dots, h_{iT})}^{'}$ and h_it as the itth diagonal element of hat matrix $H = \sum_{i = 1}^{N} x_{i} {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} {\hat{A}}_{i}^{- 1} x_{i},$ then the AHC3 can be defined as $AHC 3 = {\hat{Ψ}}^{(3)} = {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} (\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} {\hat{φ}}_{i}^{(3)} {\hat{A}}_{i}^{- 1} x_{i}) {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1},$

where ${\hat{ϕ}}_{i}^{(3)} = ({\hat{u}}_{i} {\hat{u}}_{i}^{'}) d i a g {(1 - h_{i})}^{- 2}$ (see similar construction of the HCCME in Uchôa, Cribari-Neto, and Menezes (Citation2014)). An observation with $h_{it} \geq \frac{2 q}{NT}$ is declared as a high leverage point by Hoaglin and Welsch (Citation1978). A general rule-of-thumb, cited in Cribari-Neto (Citation2004), is that the values of h_it in excess of two or three times the average (i.e. $\frac{2 q}{NT}$ and $\frac{3 q}{NT}$ ) are regarded as influential. The adaptive versions of Cribari-Neto (Citation2004) and Cribari-Neto et al. (Citation2007) estimator have not been studied in context of the PDM yet by any author. Therefore, we propose to use AHC4 and AHC5 for the PDM. The AHC4 and AHC5 are $AHC 4 = {\hat{Ψ}}^{(4)} = {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} (\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} {\hat{φ}}_{i}^{(4)} {\hat{A}}_{i}^{- 1} x_{i}) {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1},$

where ${\hat{φ}}_{i}^{(4)} = ({\hat{u}}_{i} {\hat{u}}_{i}^{'}) diag {(1 - h_{i})}^{- π_{i}}$ , $π_{i} = min (4, \frac{h_{i 1}}{{\bar{h}}_{i 1}}), \dots, min (4, \frac{h_{iT}}{{\bar{h}}_{iT}})$ , ${\bar{h}}_{it}$ being the average

leverage (i.e. the average value of all leverages). Since 0 < h_i < 1 and $π_{i} > 0,$ hence $0 < {(1 - h_{i})}^{π_{i}} < 1 .$ $AHC 5 = {\hat{Ψ}}^{(5)} = {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} (\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} {\hat{φ}}_{i}^{(5)} {\hat{A}}_{i}^{- 1} x_{i}) {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1},$

where ${\hat{φ}}_{i}^{(5)} = ({\hat{u}}_{i} {\hat{u}}_{i}^{'}) d i a g {(1 - h_{i})}^{- δ_{i}},$ $δ_{i} = m i n [(\frac{h_{i 1}}{{\bar{h}}_{i 1}}, \dots, \frac{h_{iT}}{{\bar{h}}_{iT}}), m a x (\{4, \frac{c h_{\max}}{{\bar{h}}_{i 1}}\}, \dots, \{4, \frac{c h_{\max}}{{\bar{h}}_{iT}}\})],$

0 < c < 1, h_max is the maximum value of leverages. Since 0 < h_i < 1 and δ_i > 0, hence $0 < {(1 - h_{i})}^{δ_{i}} < 1$ .

Generally, the estimators presented above can be written in unified fashion as(9) ${\hat{Ψ}}^{(s)} = {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1} (\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} {\hat{φ}}_{i}^{(s)} {\hat{A}}_{i}^{- 1} x_{i}) {(\sum_{i = 1}^{N} x_{i}^{'} {\hat{A}}_{i}^{- 1} x_{i})}^{- 1}, s = 0, 3, 4, 5 .$ (9)

4. Adaptive heteroscedasticity consistent interval estimators (AHCIE), test statistic, and power of test

Cribari-Neto and Lima (Citation2009) considered heteroscedasticity consistent interval estimators (HCIE) based on $\hat{β}$ and HCCMEs for common regression models. Aslam et al. (Citation2013) used the AHCIE in their work for non-panel regression models. But, we are going to consider the AHCIE for the PDM.

Let $θ = h (β) : R^{k} \to R$ be a function of parameter of interest, $\hat{θ}$ is its estimate and $s (\hat{θ})$ is the asymptotic standard error. Consider the studentized statistic, $t_{n} = \frac{\hat{θ} - θ}{s (\hat{θ})} .$ It is quite easy to show that $t_{n} \to^{d} N (0, 1) .$

Consider the hypothesis, $H_{0} : β_{r} = β_{r}^{0}$ against $H_{1} : β_{r} = β_{r}^{1}$ , where $β_{r}^{0}$ is a hypothesized value of β_r under H₀.

Under homoscedasticity, the test statistic for above given hypothesis is(10) $\hat{t} = \frac{({\hat{β}}_{r} - β_{r}^{0})}{\sqrt{{\hat{Ψ}}_{(r r)}}},$ (10)

where ${\hat{Ψ}}_{(r r)}$ is the rth diagonal element of Ψ and r = 0, 1, 2,…, q−1. Then $\hat{t}$ is likely to follow a Student’s t distribution with degree of freedom (NT − tr(H)), such that $\hat{t} > t_{1 - \frac{α}{2}, N T - t r (H)}$ . For large sample size, the quantity above converges in distribution to the standard normal distribution (for more details; see (Cribari-Neto & Lima, Citation2009)). Thus, a test of asymptotic significance α rejects H₀ if $|\hat{t}| > Z_{1 - \frac{α}{2}},$ where $Z_{1 - \frac{α}{2}}$ is the $(1 - \frac{α}{2})$ quantile of standard normal distribution. Thus, the true size of test can be computed as(11) $P (reject H_{0} | H_{0}) = P (|\hat{t}| > Z_{1 - \frac{α}{2}} | β_{r} = β_{r}^{0}) .$ (11)

Similar, the power of test can be measured as(12) $P (reject H_{0} | H_{1}) = P (|\hat{t}| > Z_{1 - \frac{α}{2}} | β_{r} = β_{r}^{1}) .$ (12)

when the errors are heteroscedastic, the statistic in (10) can be re-defined as follows:

$\hat{t} = \frac{({\hat{β}}_{r} - β_{r}^{0})}{\sqrt{{\hat{Ψ}}_{(r r)}^{(s)}}}$ , s = 0, 3, 4 and 5.

In a similar manner, the confidence interval can be constructed. A (1−α)×100% (two tailed) confidence interval based on the AHCCME is(13) ${\hat{β}}_{r} \pm Z_{1 - \frac{α}{2}} \sqrt{{\hat{Ψ}}_{(r r)}^{(s)}}; s = 0, 3, 4, 5 .$ (13)

5. Empirical results

For the empirical results, we use the same Monte Carlo scheme as used in some previous studies like Li and Stengos (Citation1994), Roy (Citation2002), and Rilstone (Citation1991)

The considered model is $y_{it} = β_{0} + β_{1} x_{it} + μ_{i} + v_{it}; i = 1, 2, \dots, N; t = 1, 2, \dots, T,$

where x_it = 0.5w_i,t-1 + w_it and $w_{it} \sim i.i.d. E x p (N (0, 0 . 4^{2}))$ , i.e. w_it is generated from lognormal distribution. The values assigned to β₀ and $β_{1}$ are 5 and 0.5, respectively. The v_it and $μ_{i}$ can be generated as $v_{it} \sim i . i . d . N (0, σ_{v}^{2})$ , $μ_{i} \sim i . i . d . N (0, ω_{i})$ , with $ω_{i} = ω ({\bar{x}}_{i}) = α^{2} {(1 + {\bar{x}}_{i})}^{2}$ . It is supposed that heteroscedasticity is of additive form. Let the total variance $γ_{i}$ and the expected variance of μ_i is $γ_{i} = ω_{i} + σ_{v}^{2}$ and $\bar{ω}$ , respectively. For comparison across different data generating process, the expected total variance is set to be $ω_{i} + σ_{v}^{2} = 8$ . The values of $λ$ are 0, 1, 2, and 3, where 0 indicates the homoscedastic USE and other shows different levels of heteroscedasticity for the fixed value of $σ_{v}^{2}$ and the values assigned to $σ_{v}^{2}$ are 2, 4, and 6. Increase in $λ$ cause increase in degree of heteroscedasticity. Moreover, the value of $\bar{ω}$ can be obtained using different values of $λ$ for each value of $σ_{v}^{2}$ and α is obtained using the additive heteroscedastic design specified above for given $λ$ . Thus, the values of ω_ifor each $σ_{v}^{2}$ under the four different values of $λ$ are obtained. The Gaussian kernel cited in Roy (Roy, Citation2002) is used and defined as $k (x) = \frac{1}{\sqrt{2 π}} e^{- \frac{x^{2}}{2}}$ .

Roy (Citation2002) used 0.5, 1, and 1.5 as bandwidth. In the present work, we used 0.5 as bandwidth.

The simulations are 5000 with two schemes for fixed small T but large N:

(1)	Scheme I: N = 50; T = 3; NT = 150
(2)	Scheme II: N = 100; T = 3; NT = 300

For empirical investigation, the concerned estimators are given below:

(1)	The pooled OLSE
(2)	The WGE
(3)	The AGLS estimator (AGLSE)
(4)	The AHCCME

The numerical evaluation in this section has been divided into four parts; the first part compares the efficiency of the estimators in terms of mean squares error (MSE), the second part presents evaluation of the covariance estimators for their performance in hypothesis testing in terms of null rejection rate (NRR), the third part is reserved for performance of the AHCIEs, and the fourth part compares performance of the estimators in terms of power of test. Empirical size and coverage is given in percentage form. Empirical size is studied at 1, 5, and 10% nominal level of significance (LOS) and nominal coverage is taken to be 95%. The estimators have been studied under different degrees of heteroscedasticity, namely

λ

= 0, 1, 2, and 3 as used by Roy (Citation2002).

Tables and show mean and MSE for Schemes I and II, respectively. Intercept is excluded in the WG estimation, therefore it does not appear in these tables and discussion is concentrated only on the slope estimates. Table shows that all the estimators remain almost unbiased and there is no issue of bias under heteroscedasticity. But the OLSE is inefficient for smaller UTVH ( $σ_{v}^{2} = 2$ ) as it yields higher MSEs than the AGLSE. For $σ_{v}^{2} = 2$ , the MSE of OLSE is more than twice of AGLSE and WGE. The WGE performs better than OLSE in terms of MSE but outperformed by AGLSE for $σ_{v}^{2} = 4$ . Due to gain in efficiency, the AGLSE remains an attractive choice. Such results are actually due to Roy (Citation2002). Performance of the OLSE improves for larger UTVH ( $σ_{v}^{2} = 6$ ). For $σ_{v}^{2} = 6$ and $λ = 1$ , the MSE of OLSE is identical to AGLSE. The similar behavior of all the estimators is observed in Table as noticed in Table . The MSE of OLSE decreases with the increase of sample size but it is still less efficient than the AGLSE and WGE for smaller UTVH ( $σ_{v}^{2} = 2$ ).

Table 1. Mean and MSE (N = 50, T = 3)

Display Table

Table 2. Mean and MSE (N = 100, T = 3)

Display Table

Empirical sizes are displayed in Figure at 5% LOS and $σ_{v}^{2} = 2$ . The OLSE curve shows high over-rejection. While the curves produced by the AGLSE and WGE are closer to nominal LOS (5%). The curve of AHC0 shows deviation from nominal level (5%) under mild and moderate heteroscedasticity but becomes closer to 5% under severe heteroscedasticity for small sample. However, the AHC0 gets improvement in performance for large sample. Similar results have been reported by Long and Ervin (Citation2000) for cross-sectional data. The AHC4 and AHC5 curves are closer to the nominal LOS (5%).

Figure 1. Empirical size (%) for β.

Tables and display empirical sizes for Schemes I and II, respectively. In Table , the test based on the OLS variance estimator is largely liberal under smaller UTVH ( $σ_{v}^{2} = 2$ ). It expresses high size distortion for the cases of heteroscedasticity. Under severe heteroscedasticity ( $λ$ = 3), the NRR produced by the OLS variance estimator based quasi-t test is 8.30% at 5% LOS for smaller UTVH ( $σ_{v}^{2} = 2$ ). However, the quasi-t test, based on the OLS variance estimator, gives better NRR for the larger UTVH ( $σ_{v}^{2} = 6$ ). The quasi-t test, based on the AGLS variance estimator, performs better than the test based on the OLS variance estimator. For instance, for $λ = 3$ , the NRR yields by AGLS variance estimator based quasi-t test is 5.14% at 5% LOS for smaller UTVH ( $σ_{v}^{2} = 2$ ). It verifies the reported results of Roy. The quasi-t tests that employ AHCCMEs yield good NRR from smaller UTVH ( $σ_{v}^{2} = 2$ ) to the larger UTVH ( $σ_{v}^{2} = 6$ ). The best NRR among the AHCCMEs, is observed by the tests based on AHC4 and AHC5. In case of severe heteroscedasticity ( $λ = 3$ ), the AHC4 yields exact NRR for $σ_{v}^{2} = 6$ at 5% LOS. The results given by the AHC4 and AHC5 confirm the findings made by Cribari-Neto (Cribari-Neto, Citation2004) and Cribari-Neto et al. (Cribari-Neto et al., Citation2007) for the non-panel data and also justify their formulation for the PDM.

Table 3. NRR of quasi-t test for N = 50, T = 3

Display Table

Table 4. NRR of quasi-t test for N = 100, T = 3

Display Table

In Table , behavior of all the estimators is similar as presented in Table . The tests, based on the AGLSE variance estimator, perform well in terms of NRR as reported by Roy (Roy, Citation2002). Performance of the AHC4 and AHC5 remains attractive and justifies our proposal for the PDM.

Estimation of confidence interval is done as illustrated in Equation (13). For $σ_{v}^{2} = 2$ , empirical coverage is presented in Figure . The OLSE curve exhibits under-coverage, while the curve of AGLSE is closer to the nominal coverage (95%). The curve of AHC0 shows under-coverage for small sample but coverage rate is closer to the nominal coverage (95%) for the large samples. On the other side, the curves of AHC4 and AHC5 are closer to the nominal coverage (95%).

Figure 2. Empirical coverage (%) for β.

For the above-mentioned estimators, Tables and carry empirical coverage and average length for Schemes I and II, respectively. Performance of the OLSE is not satisfactory in for smaller UTVH as it shows under-coverage. However, the OLSE gets improvement in performance from smaller ( $σ_{v}^{2} = 2$ ) to larger UTVH ( $σ_{v}^{2} = 6$ ). While the AGLSE shows remarkable performance for homoscedastic as well as for all types of heteroscedastic cases. The empirical coverage of the WGE is closer to nominal coverage (95%) for all degrees of heteroscedasticity and it outperforms the OLSE. It is noticed that the best empirical coverage among AHCCMEs are produced by our AHC4 and AHC5. The AHC4 exhibits exact coverage for $σ_{v}^{2} = 2$ in case of mild heteroscedasticity ( $λ = 1$ ) and also for larger UTVH ( $σ_{v}^{2} = 6$ ) when $λ = 3$ . The AHC5-based confidence intervals display coverage that is close to the nominal coverage (95%).

Table 5. 95% Confidence interval: coverage (%) and length (N = 50, T = 3)

Display Table

Table 6. 95% Confidence interval: coverage (%) and length (N = 100, T = 3)

Display Table

Performance of the estimators in Table is similar to that observed in Table . For the large samples, Roy’s estimator outperforms the OLSE. Among the AHCCME, AHC4 and AHC5 express very good coverage and average interval length and they remain attractive choice.

Figures – show empirical power curves, built upon all the above mentioned estimators for Scheme I. For $σ_{v}^{2} = 2$ , Figure gives indication that for homoscedastic ( $λ = 0$ ) and heteroscedastic situations ( $λ = 1$ , 2 and 3), all the estimators show identical power of test to that of the AGLSE except OLSE. However, as $σ_{v}^{2}$ increases, the OLSE gets improvement in such a way that for $σ_{v}^{2} = 6$ , all the estimators become near to identical in power of test. Table gives numerical values of the empirical power for a specific case i.e. $σ_{v}^{2} = 2, N = 50, T = 3 .$ Aslam (Citation2006) presented power curve analysis of the above mentioned estimators in context of the PDM and our results verify his findings.

Figure 3. Empirical power at 5% LOS (N = 50, T = 3; $σ_{v}^{2} = 2$ ).

Figure 4. Empirical power at 5% LOS (N = 50, T = 3; $σ_{v}^{2} = 4$ ).

Figure 5. Empirical power at 5% LOS (N = 50, T = 3; $σ_{v}^{2} = 6$ ).

Table 7. Power results (%) for $σ_{v}^{2} = 2, N = 50, T = 3$

Display Table

For all the estimators under consideration, empirical power curves for Scheme II are displayed in Figures –. In case of larger sample, it is noticed that power curves of all the estimators get slumber. Performance of the OLSE is not good for smaller UTVH ( $σ_{v}^{2} = 2$ ) but it becomes closer to the AGLSE for larger UTVH ( $σ_{v}^{2} = 6$ ).

Figure 6. Empirical power at 5% LOS (N = 100, T = 3; $σ_{v}^{2} = 2$ ).

Figure 7. Empirical power at 5% LOS (N = 100, T = 3; $σ_{v}^{2} = 4$ ).

Figure 8. Empirical power at 5% LOS (N = 100, T = 3; $σ_{v}^{2} = 6$ ).

6. Illustrative example

We take an example of panel data of productivity of USA (Munnell, Citation1990) with T = 17 and N = 48. The model of interest is(14) $y_{it} = β_{0} + β_{1} x_{1 i t} + β_{2} x_{2 i t} + β_{3} x_{3 i t} + β_{4} x_{4 i t} + β_{5} x_{5 i t} + β_{6} x_{6 i t} + u_{it}; (i = 1, 2, \dots, 48, t = 1, 2, \dots, 17),$ (14)

where y denotes gross production, x₁ is high way capital, x₂ is water utility capital, x₃ is utility capital, x₄ is private capital, x₅ is employed capital, and x₆ is unemployed capital.

In order to evaluate the testing performance of all the stated estimators, following Cribari-Neto (Citation2004), an extra variable (e.g. $x_{5}^{2}$ ) is being added as an explanatory variable. Thus, Model (14) is reformulated as(15) $y_{it} = β_{0} + β_{1} x_{1 i t} + β_{2} x_{2 i t} + β_{3} x_{3 i t} + β_{4} x_{4 i t} + β_{5} x_{5 i t} + β_{6} x_{6 i t} + β_{7} x_{5 i t}^{2} + u_{it} .$ (15)

The USH is found after the Wald test (with p-value < 0.01). Table displays the comparative statistics obtained from Model (14). The results obtained from fitting of Model (15) are presented in Table . All the regression coefficients are found to be statistically significant while referring to Table . In Model (15), we include square of employed capital as an extra explanatory variable with the expectedly no impact on determining the gross production. Thus, it should be statistically non-significant. We perform the inference again. In this situation, the attractive estimator would be one that does not reject the null hypothesis of β₇ = 0. Table shows that the tests based on only AHC4 and AHC5 do not reject the hypothesis of β₇ = 0 at 1% LOS.

Table 8. Comparative statistic of model (14)

Display Table

Table 9. Comparative statistic of model (15)

Display Table

7. Conclusion

To improve the testing of coefficients of the PDM with the problem of USH, we have used the HCCMEs, based on Roy’s (Citation2002) adaptive estimator. It is found that the AHC4 and AHC5 perform better than all the competing estimators in terms of NRR, power of tests and empirical coverage of interval estimators. On the basis of our findings, the adaptive versions of HCCME are found to be as attractive choice for the testing of PDM as they are for the linear regression models with heteroscedastic errors.

Funding

The authors received no direct funding for this research.

Additional information

Notes on contributors

Afshan Saeed

Muhammad Aslam is a tenured associate professor at the Department of Statistics, Bahauddin Zakariya University, Multan, Pakistan. He acquires a teaching experience of more than 20 years at postgraduate level. His main area of interest runs around the inference of linear regression models with issues of heteroscedasticity and multicollinearity. He has conducted a number of researches in the stated area while leading a research team, comprising of his research students and few colleagues. Recently, this team has developed three R packages, namely “mctest”, “lmridge”, and “liureg” which are available on the R CRAN. These are comprehensive packages for detection of multicollinearity, estimation of the ridge and Liu regression models with different choices of penalties. The present article is a part of PhD research project of Afshan Saeed (the Principal author) under the supervision of Aslam. This article, primarily addresses the inference of panel data model with the issue of unit-specific heteroscedasticity.

References

Ahmed, M., Aslam, M., & Pasha, G. R. (2011). Inference under heteroscedasticity of unknown form using an adaptive estimator. Communications in Statistics – Theory and Methods, 40, 4431–4457.10.1080/03610926.2010.513793
Web of Science ®Google Scholar
Arellano, M. (1987). Computing robust standard errors for within-group estimators. Oxford Bulletin of Economics and Statistics, 49, 431–434.
Web of Science ®Google Scholar
Aslam, M. (2006). Adaptive procedures for estimation of linear regression models with known and unknown heteroscedastic errors ( dissertation). Pakistan: Bahauddin Zakariya University.
Google Scholar
Aslam, M., Riaz, T., & Altaf, S. (2013). Efficient estimation and robust inference of linear regression models in the presence of heteroscedastic errors and high leverage points. Communications in Statistics – Simulation and Computation, 42, 2223–2238.10.1080/03610918.2012.695847
Web of Science ®Google Scholar
Baltagi, B. H., & Griffin, M. (1988). A generalized error component model with heteroscedastic disturbances. International Economic Review, 29, 745–753.10.2307/2526831
Web of Science ®Google Scholar
Baltagi, B. H., Bresson, G., & Pirotte, A. (2004). Adaptive estimation of heteroskedastic error component models. USA: Texas A & M University, Working Paper.
Google Scholar
Cribari-Neto, F. (2004). Asymptotic inference under heteroskedasticity of unknown form. Computational Statistics & Data Analysis, 45, 215–233.10.1016/S0167-9473(02)00366-3
Web of Science ®Google Scholar
Cribari-Neto, F., & Lima, M. G. A. (2009). Heteroskedasticity-consistent interval estimators. Journal of Statistical Computation and Simulation, 79, 787–803.10.1080/00949650801935327
Web of Science ®Google Scholar
Cribari-Neto, F., Souza, T. C., & Vasconcellos, K. L. P. (2007). Inference under heteroskedasticity and leveraged data. Communications in Statistics – Theory and Methods, 36, 1877–1888.10.1080/03610920601126589
Web of Science ®Google Scholar
Eicker, F. (1963). Asymptotic normality and consistency of the least squares estimators for families of linear regressions. The Annals of Mathematical Statistics, 34, 447–456.10.1214/aoms/1177704156
Google Scholar
Greene, W. H. (1997). Econometric analysis (3rd ed.). Upper Saddle River, NJ: Prentice Hall.
Google Scholar
Hoaglin, D. C., & Welsch, R. E. (1978). The hat matrix in regression and ANOVA. American Statistical, 32, 17–22.
Web of Science ®Google Scholar
Li, Q., & Stengos, T. (1994). Adaptive estimation in the panel data error component model with heteroscedasticity of unknown form. International Economic Review, 35, 981–1000.10.2307/2527006
Web of Science ®Google Scholar
Long, J. S., & Ervin, L. H. (2000). Using heteroscedasticity consistent standard errors in the linear regression model. American Statistical, 54, 217–224.
Web of Science ®Google Scholar
Mazodier, P., & Trognon, A. (1978). Heteroscedasticity and stratification in error components models. Anna. de l’Insee, 30–31, 451–482.
Google Scholar
Munnell, A. H. (1990). Why has productivity growth declined? Productivity and public investment. New England: Economic Review.
Google Scholar
Randolph, W. C. (1988). A transformation for heteroksedastic error components regression models. Economics Letters, 27, 349–354.10.1016/0165-1765(88)90161-9
Web of Science ®Google Scholar
Rilstone, P. (1991). Some Monte Carlo evidence on the relative efficiency of parametric and semi parametric EGLS estimators. Journal of Business & Economic Statistics, 9, 179–187.
Web of Science ®Google Scholar
Roy, N. (2002). Is adaptive estimation useful for panel models with heteroscedasticity in the individual specific error component? Some monte carlo evidence Economic Review, 21, 189–203.
Google Scholar
Uchôa, C. F. A., Cribari-Neto, F., & Menezes, T. A. (2014). Testing inference in heteroscedastic fixed effects models. European Journal of Operational Research, 235, 660–670.10.1016/j.ejor.2014.01.032
Web of Science ®Google Scholar
White, H. (1980). A heteroscedasticity-consistent covariance matrix estimator and a direct test for heteroscedasticity. Econometrica, 48, 817–838.10.2307/1912934
Web of Science ®Google Scholar

Improved inference for the panel data model with unknown unit-specific heteroscedasticity: A Monte Carlo evidence