206
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Nonparametric estimation of additive models with errors-in-variables

, & ORCID Icon
 

Abstract

In the estimation of nonparametric additive models, conventional methods, such as backfitting and series approximation, cannot be applied when measurement error is present in a covariate. This paper proposes a two-stage estimator for such models. In the first stage, to adapt to the additive structure, we use a series approximation together with a ridge approach to deal with the ill-posedness brought by mismeasurement. We derive the uniform convergence rate of this first-stage estimator and characterize how the measurement error slows down the convergence rate for ordinary/super smooth cases. To establish the limiting distribution, we construct a second-stage estimator via one-step backfitting with a deconvolution kernel using the first-stage estimator. The asymptotic normality of the second-stage estimator is established for ordinary/super smooth measurement error cases. Finally, a Monte Carlo study and an empirical application highlight the applicability of the estimator.

Acknowledgment

The authors would like to thank anonymous referees for helpful comments.

Notes

1 If ϕX* is treated as the true underlying covariate and ϕ0, we have Yμm1(Z1)mD(ZD)U=g˜(ϕX*), with g˜(·)=g(·/ϕ). Then the normalization required in Assumption 1 (6) becomes I˜g˜(w)dw=0, with I˜={w˜=ϕw:wI}. Since I is the range of X* chosen by researchers, instead of I˜ (which is unknown without a specific value of ϕ), we may directly decide the range of interest of ϕX* based on limited knowledge of ϕ (for example, a set of possible values of ϕ).

2 Even though it seems somewhat different, the Sobolev condition imposed here is essentially equivalent to the one used in Meister (Citation2009, eq. (2.30)), which imposes |fft(t)|2|t|2αdt<c. First, it is easy to see that |fft(t)|2(1+|t|2)αdt<c implies |fft(t)|2|t|2αdt<c. For the other direction, we have |fft(t)|2(1+|t|2)αdt2α|t|1|fft(t)|2dt+2α|fft(t)|2|t|2αdt<c, where the first inequality follows by 2α|t|2α(1+|t|2)α|t|1, and the second inequality follows by fL2(R) and Meister (Citation2009, eq. (2.30)).

3 We set α = 2 because Han and Park (Citation2018) assumed that fj is twice continuously differentiable in their Assumption K4. See Meister (Citation2009, Section A.2) for the relationship between order of differentiability and choice of α.

4 Even though Han and Park (Citation2018) considered a different setup where all covariates are mismeasured, the convergence rate of their smoothed backfitting method would remain the same when only one covariate is mismeasured, as it is independent of their number of covariates d. This is a natural result when the regression function has an additive structure. Therefore, the uniform convergence rate presented in Han and Park (Citation2018, Corollary 3.5) can be directly compared to Theorem 2 when the smoothing parameters α and β are the same.

5 We note that if β>1/2, then fϵ is bounded and continuous.

6 Similarly to the previous case, here we set α = 2 because Horowitz and Mammen (Citation2004) assume that mj is twice continuously differentiable in their Assumption A2.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.