2,732
Views
2
CrossRef citations to date
0
Altmetric
ORIGINAL ARTICLE

Almon-KL estimator for the distributed lag model

ORCID Icon & ORCID Icon
Pages 406-412 | Received 18 May 2021, Accepted 01 Oct 2021, Published online: 14 Oct 2021

Abstract

The Almon technique is widely used to estimate the parameters of the distributed lag model (DLM). The technique suffers a setback from the challenge of multicollinearity because the explanatory variables and their lagged values are often correlated. The Almon-Ridge estimator (A-RE) and Almon-Liu estimator (A-LE) were introduced as alternative estimators for efficient modelling. We developed a new method of estimating the coefficients of the DLM using the Almon-KL estimator (A-KLE). A-KLE dominates the other estimators considered in this study via theoretical findings, simulation design and two numerical examples. The estimators’ performance was compared using the mean squared error.

AMS SUBJECT CLASSIFICATION:

1. Introduction

Distributed lag model (DLM) is a model that is commonly adopted to predict the current values of a response variable based on both the current values of a regressor (explanatory variable) and its lagged values (Cromwell, Hannan, Labys, & Terraza, Citation1994; Judge, Griffiths, Hill, & Lee, Citation1980). For instance, the effect of public investment such as road and highways construction on growth in the gross national product (GNP) will show up with a lag, and this effect will likely linger on for some years (Maddala, Citation1974). Under certain assumptions, the ordinary least squares estimator (OLSE) is the easiest method of estimating the distributed lags’ parameters. The assumptions include assuming a fixed maximum lag; the error should be normally distributed and unrestricted relationship among the lagged explanators. However, multicollinearity arises from the relationship between the explanatory variables and their lagged explanators, resulting in a large variance of the coefficient estimates. The OLSE becomes inefficient when there is multicollinearity. Also, the method suffers a breakdown when the number of observations does not sufficiently exceed the number of lags (Almon, Citation1965; Fisher, Citation1937; Frost, Citation1975; Maddala, Citation1974). Almon technique proposed by Almon (Fisher, Citation1937) is widely used to estimate the parameters of DLM because the methods reduced the effect of multicollinearity (Fair & Jaffee, Citation1971; Majid, Aslam, Altaf, & Amanullah, Citation2019). The assumption under the Almon lag model is that there must be a finite lag length and the lag weight approximated by a polynomial of suitable degree.

The Almon estimator (AE) is the best linear unbiased estimator (BLUE) for the DLMs (Vinod & Ullah, Citation1981). Therefore, this estimator has widespread usage in applied econometrics due to its ease of estimation. However, the estimator still suffers a breakdown when there is a high correlation among the explanatory variables. Biased estimators were developed as alternatives to the OLS estimator in the linear regression model (LRM) (Baye & Parker, Citation1984; Hoerl & Kennard, Citation1970; Kaçıranlar & Sakallıoğlu, Citation2001; Kibria & Lukman, Citation2020; Li & Yang, Citation2012; Liu, Citation1993; Lukman, Ayinde, Kibria, & Adewuyi, Citation2020; Massy, Citation1965; Swindel, Citation1976). Some of these estimators have been adapted to AE to mitigate the effect of multicollinearity.

The ridge estimator was suggested as an alternative to the AE (Chanda & Maddala, Citation1984; Lindley & Smith, Citation1972; Lukman, Ayinde, Binuomote, & Onate, Citation2019; Maddala, Citation1974; Vinod & Ullah, Citation1981; Yeo & Trivedi, Citation2009). The limitation of the ridge regression estimator for the DLMs was pointed out in the following study (Lindley & Smith, Citation1972; Maddala, Citation1974; Yeo & Trivedi, Citation2009). According to Maddala (Citation1974) and Yeo and Trivedi (Citation2009), the empirical findings do not fulfil expectations despite adopting different ridge parameters, k. Alternative estimators include the Almon modified ridge and Liu estimators (Gültay & Kaçıranlar, Citation2015). Özbay & Kaçıranlar (Citation2017) introduced the Almon two-parameter estimator to deal with the problem of multicollinearity for the DLM. The estimator extends the two-parameter estimation procedure by Özkale and Kaçıranlar (Citation2007) to the DLM. The Almon two-parameter ridge was also developed to handle multicollinearity in DLM (Lipovetsky, Citation2006; Lipovetsky & Conklin, Citation2005; Ozbay, Citation2019). Kibria and Lukman (Citation2020) developed the Kibria–Lukman estimator (KLE) to cushion the effect of multicollinearity in the LRM. The estimator gain advantage over the ridge and the Liu estimator. Thus, the focus of this study is to develop the Almon-KL estimator (A-KLE) based on KLE.

The other part of the article is structured as follows: we introduced the DLM, its estimation methods, and the proposed estimator in Section 2. Section 3 deals with the theoretical comparisons and the choice of biasing parameters. We examined the estimators’ performance using a simulation study and two examples in Sections 4 and 5, respectively. Finally, we provide a concluding remark.

2. Distributed lag model (DLM) and method of estimation

The finite DLM is defined as (2.1) yt=β0xt+β1xt1+β2xt2++βpxtp+ut=i=0pβixti+ut,t=p+1,,T,(2.1) where βi are the current and lagged coefficients of xt, yt denotes the t-th observation on the response variable y, xtp denotes the (t-p)th observation on the explanatory variable x, ut is the disturbance term corresponding to the t-th observation and is assumed to be IN(0, σ2) and p denotes the lag length. Model Equation(2.1) can be written in matrix form as (2.2) y=Xβ+u,(2.2) where X=[xp+1xpx1xp+2xp+1x2xTxT1xTp],y=(yp+1yp+2yT),β=[β0β1βp]andu=(up+1up+2uT).

The OLSE of β is (2.3) β̂=(XX)1Xy.(2.3)

The OLS estimator of β is BLUE when there is no violation of the model assumptions (Lukman et al., Citation2019). As earlier mentioned, there is high tendency of multicollinearity among the regressors since lags of the same explanatory variable appear in the model (Güler, Gültay, & Kaçiranlar, Citation2015; Gültay & Kaçıranlar, Citation2015). It becomes difficult to estimate with the conventional OLS when the lag length (p) is known and large (Güler et al., Citation2015; Gültay & Kaçıranlar, Citation2015; Ozbay, Citation2019; Özbay & Kaçıranlar, Citation2017). The Almon polynomial lag distribution was recommended as a replacement to the OLS estimator (Almon, Citation1965). The Almon polynomial lag is defined as (2.4) βi=α0+α1i+α2i2++αsis,ps0.(2.4)

The above equation can simply be written as (2.5) β=Aα,(2.5) where α=[α0α1αs] and A=[1000111112222s1pp2ps] are α:(s+1)×1 vector and A:(p+1)×(r+1) matrix, respectively. The ranks of matrices X and A are assumed to be (p+1)<(Tp) and (s+1)<(p+1), respectively. If s<p then the rank of A is s + 1. We substitute Equation(2.5) into Equation(2.2) and obtained the new equation as, (2.6) y=XAα+u=Zα+u,(2.6) where Z=XA. The OLS estimator of Equation(2.6) becomes the AE in Equation(2.7): (2.7) α̂A=(ZZ)1Zy,(2.7) such that (2.8) β̂A=Aα̂A.(2.8)

EquationEquation (2.8) is referred to as AE of β which is still BLUE. A notable problem with this technique centered on the choice of lag length and the degree of the polynomial, respectively because the two are unknown in practice. One of the suggested means of selecting the lag length is to select a reasonable lag and check if it fit the model Equation(2.1) using any of the following criteria: Akaike information criterion (AIC), Bayesian information criterion (BIC) and adjusted coefficient of determination (R¯2) (Davidson & MacKinnon, Citation1993). The ridge estimator was suggested as an alternative approach due to the limitation of the AE. The Almon-Ridge estimator (A-RE) of β is defined as (2.9) β̂Ak=Aα̂k,k>0(2.9) where α̂k=(ZZ+kI)1Zy. Özbay and Toker (Citation2021) examined the predictive performance of A-RE. According to Gültay & Kaçıranlar (Citation2015), the Almon-Liu estimator (A-LE) is defined as: (2.10) β̂Ad=Aα̂d,(0 <d <1)(2.10) where α̂d=(ZZ+I)1(ZZ+dI)α̂A.

The Liu estimator overcomes the limitation in the ridge estimator for the LRM. Recently, Kibria and Lukman (Citation2020) developed the KL estimator (KLE) to mitigate multicollinearity in the LRM. The estimator is a modification of the Liu estimator; it gained an advantage over the Liu estimator using the mean squared error. The bias of KLE is two times lower than the bias of the ridge estimator. Thus, the KL estimator performs better than the ridge and the Liu estimators. Following this merit, the estimator is adapted to the AE to handle multicollinearity in this study. The A-KLE is defined as (2.11) β̂AKL=Aα̂KL,(2.11) where α̂KL=(ZZ+kI)1(ZZkI)α̂A.

Suppose Z=XQ,α=Qβ, and ZZ=QXXQ=E=diag(e1,,es+1) such that e1e2es+1 are the ordered eigenvalues of ZZ and Q is the matrix whose columns are the eigenvectors of ZZ. and the canonical form of the estimators are as follows: (2.12) α̂A=E1Zy,(2.12) (2.13) α̂k=(E+kI)1Zy,(2.13) (2.14) α̂d=(E+I)1(E+dI)α̂A,(2.14) (2.15) α̂KL=(E+kI)1(EkI)α̂A.(2.15)

The statistical properties of A-KLE are as follows:

The bias of the A-KLE is defined as: (2.16) B(α̂KL)=2kEkα,(2.16) where Ek=(E+kI)1. The variance of the A-KLE is defined as: (2.17) D(α̂KL)=σ2Ek(EkI)E1Ek(EkI).(2.17)

Therefore, the mean square error matrix (MSEM) and the scalar mean square error (SMSE) of A-KLE, respectively, are (2.18) MSEM(α̂KL)=σ2Ek(EkI)E1Ek(EkI)+4k2EkEkαα,(2.18) and (2.19) SMSE(α̂KL)=σ2i=1s+1((eik)2ei(ei+k)2)+4k2i=1s+1(αi2(ei+k)2).(2.19)

3. Theoretical comparisons among the estimators

In this section, we adopted Lemma 3.1 to evaluate the estimators’ performance.

Lemma 3.1.

Let β̂1 and β̂2 be two linear estimators of β. The difference in the covariance of the two estimators D is positive definite if and only if bias(β̂2)[σ2D+bias(β̂1)bias(β̂1)]1bias(β̂2)<1.

Consequently, MSEM(β̂1)MSEM(β̂2)=σ2D+bias(β̂1)bias(β̂1)bias(β̂2)bias(β̂1)>0

(R Core Team, Citation2020; Trenkler & Toutenburg, Citation1990).

3.1. Comparison between α̂A and α̂KL

Theorem 3.1.

The estimator α̂KL is preferred to α̂A in the MSEM sense, if and only if, b[σ2(E1Ek(EkI)E1Ek(EkI))]1b<1 where b=2kEkα.

Proof.

(3.1) D(α˜̂A)D(α̂KL)=[σ2E1σ2Ek(EkI)E1Ek(EkI)]=σ2diag{1ei(eik)2ei(ei+k)2}i=1s+1.(3.1)

It is noted that E1Ek(EkI)E1Ek(EkI) is non-negative (nn) because ei+k>eik for k>0.

3.2. Comparison between α̂k and α̂KL

Theorem 3.2.

The estimator α̂KL is preferred to α̂k in the MSEM sense, if and only if, b[σ2(EkEEkEk(EkI)E1Ek(EkI))+b˜1b1]1b<1 where b=2kEkα and b1=kEkα.

Proof.

(3.2) D(α̂k)D(α̂KL)=[σ2(EkEEkEk(EkI)E1Ek(EkI))]=σ2diag{ei(ei+k)2(eik)2ei(ei+k)2}i=1s+1.(3.2)

It is noted that EkEEkEk(EkI)E1Ek(EkI) is non-negative because ei2(eik)2>0 for k<2ei.

3.3. Comparison between α̂d and α̂KL

Theorem 3.3.

The estimator α̂KL is preferred to α̂d in MSEM sense if and only if, b[σ2(EdE1EdEk(EkI)E1Ek(EkI))+b2b2]1b<1 where b=2kEkα and b˜2=(1d)(E+I)1α.

Proof.

(3.3) D(α̂d)D(α̂KL)=[σ2(EdE1EdEk(EkI)E1Ek(EkI))]=σ2diag{(ei+d)2ei(ei+1)2(eik)2ei(ei+k)2}i=1s+1.(3.3)

It is noted that EdE1EdEk(EkI)E1Ek(EkI) is non-negative since (ei+k)2(ei+d)2(eik)2(ei+1)2>0 for k>ei(1d)d+ei+1 and d>ei(1k)kei+k.

4. Monte Carlo simulation study

We conducted a simulation study for DLM to explore the estimators’ performance. The dependent variable and explanatory variables are generated according to the following study (Güler et al., Citation2015; Özbay & Kaçıranlar, Citation2017; Özbay & Toker, Citation2021). (4.1) yt=β0xt+β1xt1+βpxtp+ut,(4.1) (4.2) x1=v1,(4.2) (4.3) xt=τxt1+1τ2vtt2,(4.3) whereτ denotes the correlation between the regressors and its values are chosen as follows: 0.8, 0.9, 0.99, and 0.999. The MSE is minimized subject to the constraint ββ=1 (Güler et al., Citation2015; Özbay & Kaçıranlar, Citation2017; Özbay & Toker, Citation2021). ut and vt are generated such that utN(0,σ2) and vtN(0,1), respectively. The choices of σ2 are 1, 5 and 10. The data for each replication are determined in accordance with the following studies (Güler et al., Citation2015; Özbay & Kaçıranlar, Citation2017; Özbay & Toker, Citation2021). An observation of T-p = 60 and 100 with the lag length of 8 and 16 was evaluated. The experiment is repeated 2000 times using the RStudio programming language (R Core Team, Citation2020). The model (4.1) is transformed back to the Almon model Equation(2.6) after generating the observations. The mean squared error is obtained as follows: (4.4) MSE(β̂)=12000j=12000(β̂ijβi)(β̂ijβi)(4.4) where β̂ij denotes the estimate of the ith parameter in jth replication and βi is the true parameter values. The simulated MSE values are presented in and for lag length are 8 and 16, respectively.

Table 1. Estimated MSE’s when lag length is 8.

Table 2. Estimated MSE’s when lag length is 16.

From and , we observed the following trend about the mean squared error. The mean squared error increases as the following increases: level of multicollinearity, lag length, and the error variance. The mean squared error decreases as the sample size increase. This trend is more feasible considering . The results show that AE suffers setback when the regressors are correlated. The biased estimators provide a more consistent estimate when there is multicollinearity. However, the A-KLE produces the best result by producing smaller mean squared error among all the estimators under study.

Figure 1. Graphical illustration of the simulation result.

Figure 1. Graphical illustration of the simulation result.

5. Applications

To illustrate the theoretical findings of the paper, we consider the two real datasets.

5.1. Application example I

The Almon dataset is employed to illustrate the performances of each of the estimators (Almon, Citation1965). These data are quarterly data that spanned from 1953 to 1967 with expenditures as the response variable and appropriations as the independent variable. Following Güler et al. (Citation2015), the lag length is taken to be 8 and the lag weight approximated by a polynomial of order 2. The eigenvalues of ZZ are computed to be 2.388929e + 13, 1.628323e + 09, 4.312742e + 07, and 4.782067e + 00, respectively. The condition index of CI=max(ei)/min(ei) is determined to be 2235083, which revealed the presence of severe multicollinearity among the explanatory variable and its lagged variables. The library dLagM in RStudio is used to analyse the data. The biasing parameter for A-RE and A-KLE is computed using: (5.1) k̂=(s+1)σ̂2i=1s+1α̂i2.(5.1)

The biasing parameter for the A-LE is obtained using the following: (5.2) d̂=min(σ̂2α̂i2+σ̂2ei).(5.2)

The regression parameter before transformation is presented in as follows:

By canonical transformation, the 4×4 matrix Q˜ is the matrix of the eigenvectors, E is a 4×4 diagonal matrix of eigenvalues of ZZ such that ZZ=QEQ. The Almon-estimators of α are presented in . The result in supports the fact that the Almon-estimator suffers set back when there is multicollinearity. The regression coefficient of the biased estimators appears very similar. The A-KLEr has the lowest mean squared error followed by the A-RE.

Table 3. Estimates and t-tests for beta coefficients.

Table 4. Estimates for the Almon coefficients.

Table 5. Estimates and t-tests for beta coefficients.

5.2. Application example II

This dataset is composed of monthly global mean sea level (GMSL) (compared to 1993–2008 average) series by CSIRO, land ocean temperature anomalies (1951–1980 as a baseline period) by GISS, NASA, and monthly Southern Oscillation Index (SOI) by Australian Government Bureau of Meteorology (BOM) between July 1885 and June 2013 (Church & White, Citation2011; Demirhan, Citation2020). In this study, the response and the explanatory variables were chosen to be GMSL and land ocean temperature anomalies, respectively. The dataset is available in the R package data (seaLevelTempSOI). More detail on the data description is available in the study of Demirhan (Citation2020). The lag length is taken to be 4 and the lag weight approximated by a polynomial of order 2 (Demirhan, Citation2020). The eigenvalues of ZZ are computed to be 6607.86, 1590.99, 67.12 and 6.96, respectively. The condition number is 948.82 which shows that the explanatory variable and its lagged variables are correlated. The regression parameter before canonical transformation is presented in Table 5.

By canonical transformation, the Almon-estimators of α are in . From , the Almon-estimator for the second lag has a positive sign as opposed a negative sign by other estimators. As mentioned earlier, one of the effects of multicollinearity on the AE is that the coefficients can occasionally exhibit a wrong sign. The results show that the new estimator has the lowest mean squared error. This result agrees with the theoretical and simulation result.

Table 6. Estimates for the Almon coefficients.

6. Some concluding remarks

The DLM is often adopted to predict the current values of a response variable based on both the current values of a regressor and its lagged values. Multicollinearity arises from the relationship between the regressors and their lagged explanators. The AE is popularly used to estimate the parameters of the models. However, the estimator suffers a setback when there is multicollinearity. The A-RE and A-LE estimators were suggested as alternative techniques. In this study, we developed the A-KLE to mitigate the threat of multicollinearity for the DLM. We theoretically proved that A-KLE dominates AE, A-RE and A-LE. The theoretical findings were supported by the results of the simulation study and two numerical examples.

Acknowledgements

This paper was written and completed while the first author visited Professor B. M. Golam Kibria at Florida International University from April 1 to 30, 2021. The International Mathematical Union sponsored this trip.

Disclosure statement

No potential conflict of interest was reported by the authors.

References