Search in:

Statistical Theory and Related Fields Volume 3, 2019 - Issue 2

Submit an article Journal homepage

Free access

225

Views

CrossRef citations to date

Altmetric

Listen

Articles

Nearest neighbour imputation under single index models

Jun ShaoSchool of Statistics, East China Normal University, Shanghai, People's Republic of China;Department of Statistics, University of Wisconsin, Madison, WI, USAView further author information

Lei WangSchool of Statistics and Data Science & LPMC, Nankai University, Tianjin, People's Republic of ChinaCorrespondence[email protected]
View further author information

Pages 208-212 | Received 14 Sep 2019, Accepted 30 Sep 2019, Published online: 11 Oct 2019

Cite this article
https://doi.org/10.1080/24754269.2019.1675409
CrossMark

In this article

ABSTRACT
1. Introduction
2. Method and theory
3. Simulation results
Acknowledgements
Disclosure statement
Additional information
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

A popular imputation method used to compensate for item nonresponse in sample surveys is the nearest neighbour imputation (NNI) method utilising a covariate to defined neighbours. When the covariate is multivariate, however, NNI suffers the well-known curse of dimensionality and gives unstable results. As a remedy, we propose a single-index NNI when the conditional mean of response given covariates follows a single index model. For estimating the population mean or quantiles, we establish the consistency and asymptotic normality of the single-index NNI estimators. Some limited simulation results are presented to examine the finite-sample performance of the proposed estimator of population mean.

KEYWORDS:

Asymptotic normality
curse of dimensionality
imputation
mean
quantiles
SAVE

1. Introduction

Let $P$ be a finite population containing N units indexed by i, $y_{i}$ be a univariate outcome or response of interest from unit $i \in P$ , $x_{i}$ be a covariate vector associated with $y_{i}$ , and let $S \subset P$ be a sample of size n taken from $P$ according to some sampling design. We consider the situation where $x_{i}$ is always observed if $i \in S$ but $y_{i}$ is subject to nonresponse, i.e., $y_{i}$ is observed if and only if $i \in R \subset S$ . In sample surveys, imputation is commonly applied to compensate for nonresponse (Kalton & Kasprzyk, Citation1986; Rubin, Citation1987; Sedransk, Citation1985). The nearest neighbour imputation (NNI) method imputes a missing $y_{j}$ by $y_{l}$ , where $l \in R$ is the nearest neighbour of j in the sense that $d (x_{j}, x_{l}) = min_{i \in R} d (x_{i}, x_{j})$ and $d (x_{i}, x_{j})$ is a distance between $x_{i}$ and $x_{j}$ , e.g., the Euclidean distance. It is a popular method in many survey agencies and has a long history of applications in surveys such as the Census 2000 and the Current Population Survey conducted by the U.S. Census Bureau (Farber & Griffin, Citation1998; Fay, Citation1999), the Job Openings and Labor Turnover Survey and the Employee Benefits Survey conducted by the U.S. Bureau of Labor Statistics (Montaquila & Ponikowski, Citation1993), and the Unified Enterprise Survey, the Survey of Household Spending, and the Financial Farm Survey conducted by Statistics Canada (Rancourt, Citation1999).

The NNI method has some nice features. First, imputed values are actually occurring y-values, not constructed values; they may not be perfect substitutes, but are unlikely to be nonsensical values. Second, the NNI method may be more efficient than imputation not using x-values, such as mean imputation or random imputation, when x provides useful auxiliary information. Third, the NNI method does not assume a parametric regression model between y and x and, hence, it is more robust against model violations than ratio or regression imputation based on a linear regression model. Finally, under some conditions NNI estimators (i.e., estimators calculated using standard formulas and treating nearest neighbour imputed values as observed data) are asymptotically valid not only for moments of $y_{i}$ but also for the distribution and quantiles of $y_{i}$ , which is a superiority over other non-random imputation methods (such as mean, ratio or regression imputation) that lead to valid moment estimators only.

For a univariate covariate $x_{i}$ , some asymptotic properties of NNI are established in Chen and Shao (Citation2000, Citation2001) and Shao and Wang (Citation2008). When $x_{i}$ is multivariate, however, NNI runs into the curse of dimensionality problem. The purpose of this paper is to propose a single-index NNI method for multivariate $x_{i}$ and derive its asymptotic properties, under the following single index model assumption:

The population $P$ can be partitioned into K sub-populations, $P_{1}, \dots, P_{K}$ , such that for within each $P_{k}$ , $(x_{i}, y_{i})$ 's are independent and identically distributed (i.i.d.) from a superpopulation with $E (y_{i} | x_{i}) = μ_{k} (β_{k}^{'} x_{i}),$ where $β_{k}^{'}$ is the transpose of an unknown parameter vector $β_{k}$ with the same dimension as $x_{i}$ and $μ_{k} (\cdot)$ is an unspecified function, $k = 1, \dots, K$ .

Imputation for nonrespondents are typically done within each $P_{k}$ and, hence, $P_{k}$ 's are often referred to as imputation classes. They are usually constructed using a categorical variable whose values are observed for all sampled units; for example, under stratified sampling, strata or unions of strata are often used as imputation classes. Each imputation class should contain a large number of sampled units. When there are many strata of small sizes, imputation classes are often obtained through poststratification (Valliant, Citation1993) and/or combining small strata. The superpopulation assumption on $(x_{i}, y_{i})$ within each imputation class ensures exchangeability of units within each $P_{k}$ . The single index model assumption is a semiparametric assumption, since $μ_{k}$ is unspecified.

Details of the proposed method are presented in Section 2, where we also show that estimators based on single-index NNI are consistent and asymptotically normal under some limiting process as the sample size n increases to infinity. To complement the theory, some simulation results are presented in Section 3 to examine the finite sample performance of proposed estimators.

2. Method and theory

We consider one stage sampling without clusters. Let $w_{i}$ be the survey weight for unit $i \in P$ , which is equal to the inverse of probability that unit i is selected, a known quantity from sampling design. When there is no nonresponse, a simple and popular estimator of the unknown population total $Y = \sum_{i \in P} y_{i}$ is the Horvitz–Thompson estimator $\hat{Y} = \sum_{i \in S} w_{i} y_{i}$ , which has the unbiasedness property (1) $E_{s} (\hat{Y}) = E_{s} (\sum_{i \in S} w_{i} y_{i}) = \sum_{i \in P} y_{i} = Y,$ (1) where $E_{s}$ is the expectation with respect to sampling. If the total number of units in $P$ , N, is known, then the population mean $Y / N$ is estimated by $\hat{Y} / N$ . If N is unknown, then Y/N is estimated by $\hat{Y} / \hat{N}$ , where $\hat{N} = \sum_{i \in S} w_{i}$ satisfying $E_{s} (\hat{N}) = N$ .

The most important population parameter in a survey study concerning a variable y is the population mean. Estimation of population quantiles has also become more and more important in modern survey studies. For income variables, for example, the median income or other quantiles could be as important as the mean income. In children with cystic fibrosis, the 10th percentiles of height and weight are important clinical boundaries between healthy and possibly nutritionally compromised patients (Kosorok, Citation1999). Let $I (y_{i} \leq t)$ be the indicator of $y_{i} \leq t$ for any fixed value t. Using property (Equation1(1) $E_{s} (\hat{Y}) = E_{s} (\sum_{i \in S} w_{i} y_{i}) = \sum_{i \in P} y_{i} = Y,$ (1) ) with $y_{i}$ replaced by $I (y_{i} \leq t)$ , we obtain an approximately unbiased estimator $\sum_{i \in S} w_{i} I (y_{i} \leq t) / \hat{N}$ of the population cumulative distribution of $y_{i}$ at t, which further leads to an approximately unbiased estimator of any quantile of the distribution of $y_{i}$ .

When $y_{i}$ has nonresponse, however, the previously discussed estimators cannot be used. Imputation is a popular technique to handle nonresponse. It fills in a value for every nonrespondent $y_{j}$ , such that an unbiased or approximate unbiased estimator can be obtained using the formula for the situation of no nonresponse with imputed values treated as observed values. That is, if ${\hat{y}}_{j}$ is an imputed value for nonrespondent $y_{j}$ , then our estimator of the population total Y is (2) ${\hat{Y}}_{I} = \sum_{i \in R} w_{i} y_{i} + \sum_{j \in \bar{R}} w_{j} {\hat{y}}_{j},$ (2) where $R$ and $\bar{R}$ are the sets of respondents and nonrespondents, respectively, in the sample $S = R \cup \bar{R}$ .

Under (A1), we consider NNI within each imputation class and independently across imputation classes. For a multivariate $x_{i}$ , if $β_{k}$ in (A1) is known, we can apply a single-index NNI by defining the distance between $x_{i}$ and $x_{j}$ as $| β_{k}^{'} x_{i} - β_{k}^{'} x_{j} |$ , to avoid the curse of dimensionality issue in multivariate NNI. As $β_{k}$ is generally unknown, we can first estimate $β_{k}$ by ${\hat{β}}_{k}$ using a nonparametric method such as the sliced inverse regression (SIR) proposed by Li and Duan (Citation1991) or the sliced average variance estimation (SAVE) proposed by Cook and Weisberg (Citation1991), and then apply single-index NNI using $| {\hat{β}}_{k}^{'} x_{i} - {\hat{β}}_{k}^{'} x_{j} |$ as the distance between $x_{i}$ and $x_{j}$ , i.e., a nonrespondent $y_{j}$ in imputation class k is imputed by ${\hat{y}}_{j} = y_{l}$ with l satisfying (3) $| {\hat{β}}_{k}^{'} x_{l} - {\hat{β}}_{k}^{'} x_{j} | = min_{i \in R \cap P_{k}} | {\hat{β}}_{k}^{'} x_{i} - {\hat{β}}_{k}^{'} x_{j} | .$ (3) After imputation, the population total Y is estimated by ${\hat{Y}}_{I}$ in (Equation2(2) ${\hat{Y}}_{I} = \sum_{i \in R} w_{i} y_{i} + \sum_{j \in \bar{R}} w_{j} {\hat{y}}_{j},$ (2) ) with ${\hat{y}}_{j}$ defined by (Equation3(3) $| {\hat{β}}_{k}^{'} x_{l} - {\hat{β}}_{k}^{'} x_{j} | = min_{i \in R \cap P_{k}} | {\hat{β}}_{k}^{'} x_{i} - {\hat{β}}_{k}^{'} x_{j} | .$ (3) ). The population cumulative distribution of $y_{i}$ at any t is estimated by ${\hat{F}}_{I} (t) = \frac{1}{\hat{N}} \{\sum_{i \in R} w_{i} I (y_{i} \leq t) + \sum_{j \in \bar{R}} w_{j} I ({\hat{y}}_{j} \leq t)\},$ regardless whether N is known or unknown (to ensure that the estimate $\to 1$ when $t \to \infty$ ).

To consider asymptotic properties of estimators based on singe-index NNI, we assume that the finite population $P$ is a member of a sequence of finite populations indexed by ν. All limiting processes in this paper are understood to be as $ν \to \infty$ . We need the following assumptions in addition to (A1).

The size of $P_{k}$ and sample size of $S \cap P_{k}$ increase to infinity as $ν \to \infty$ , while the number of sub-populations, K, is fixed.
There is a fixed constant c>0 (not depending on ν) such that $max_{i \in P} \frac{n w_{i}}{N} \leq c and \frac{n}{N^{2}} E_{s} {(\sum_{i \in S} w_{i})}^{2} \leq c .$

Recall that N is the size of $P$ and n is the sample size. The first condition in (A3) ensures that none of the weights $w_{i}$ 's is disproportionately large (see Krewski & Rao, Citation1981). The second condition in (A3) means that the sampling variance of $\sum_{i \in S} w_{i} / N$ is at most of the order $n^{- 1}$ . These conditions are typically satisfied, e.g., they are satisfied under stratified simple random sampling designs.

Let $a_{i}$ be the response indicator, i.e., $a_{i} = 1$ if $y_{i}$ is observed and $a_{i} = 0$ if $y_{i}$ is a nonrespondent.

Within each $P_{k}$ , $(x_{i}, y_{i}, a_{i})$ 's are i.i.d. from a superpopulation with $E (y_{i}^{8}) < \infty$ , $(x_{i}, y_{i}, a_{i})$ 's from different imputation classes are independent, and sampling is independent of the superpopulation.
Within each $P_{k}$ , under the superpopulation, $P (a_{i} = 1 | x_{i}, y_{i}, k) = P (a_{i} = 1 | x_{i}, k) > 0$ , which is continuous in $x_{i}$ .
Within each $P_{k}$ , the conditional distribution of $x_{i}$ given $a_{i}$ has a bounded and continuous Lebesgue density and $μ_{k} (\cdot)$ in (A1) is a differentiable function.
Within each $P_{k}$ , $\begin{aligned} q_{k, i} (γ) & = P (| γ^{'} x - γ^{'} x_{i} | \\ = min_{j \in R_{k}} | γ^{'} x - γ^{'} x_{j} | | X_{k}, R_{k}, S_{k}) \end{aligned}$ is differentiable with respect to γ, where P is with respect to x under superpopulation, $S_{k} = S \cap P_{k}$ , $R_{k} = R \cap P_{k}$ , and $X_{k} = {x_{i} : i \in R_{k}}$ .
For each k, $n^{1 / 2} ({\hat{β}}_{k} - β_{k}) = n^{- 1 / 2} \sum_{i \in S \cap P_{k}} φ (x_{i}, y_{i}, a_{i}) + o_{p} (1)$ , where φ is a function satisfying $E {φ (x_{i}, y_{i}, a_{i})} = 0$ and $E {φ (x_{i}, y_{i}, a_{i})}^{2} < \infty$ , and $o_{p} (1)$ denotes a term converging to 0 in probability.

Because of (A4), NNI is carried out within each $S \cap P_{k}$ . (A5) assumes that, within an imputation class, the nonresponse mechanism is covariate-dependent (Little, Citation1995) or unconfounded (Lee, Rancourt, & Särndal, Citation1994), an assumption made for the validity of many other popular imputation methods. This actually is the main reason to construct imputation classes, in addition to the exchangeability of $(x_{i}, y_{i})$ 's. Although $(x_{i}, y_{i}, a_{i})$ 's within an imputation class are i.i.d., the nonresponse mechanism is still not completely at random, since $P (a_{i} = 1 | x_{i}, k)$ depends on the covariate $x_{i}$ . Finally, (A8) is satisfied if ${\hat{β}}_{k}$ is obtained using SIR (Li & Duan, Citation1991) or SAVE (Cook & Weisberg, Citation1991).

The following is our main theoretical result.

Theorem

Assume (A1)–(A8). Let ${\hat{Y}}_{I}$ by defined by (Equation2(2) ${\hat{Y}}_{I} = \sum_{i \in R} w_{i} y_{i} + \sum_{j \in \bar{R}} w_{j} {\hat{y}}_{j},$ (2) ) with imputed ${\hat{y}}_{j}$ based on single-index NNI. Then $\sqrt{n} (\frac{{\hat{Y}}_{I}}{N} - \frac{Y}{N}) / σ \to_{d} N (0, 1)$ for some $σ > 0,$ where $\to_{d}$ is convergence in distribution unconditionally with respect to the superpopulation and sampling.

Similar results can be obtained for ${\hat{F}}_{I} (t)$ with any t and quantiles related with ${\hat{F}}_{I}$ .

Proof of Theorem.

Proof of Theorem

The proof follows the same argument in Shao and Wang (Citation2008). Since variables are independent across imputation classes and imputation is carried out within each imputation class, it suffices to show the result within each imputation class or, equivalently, the result when K = 1. We now drop the subscript k in this proof. Let $S$ , $R$ and $X$ be defined as before with subscript k dropped. Then $E ({\hat{y}}_{i} | X, R, S) = \sum_{i \in R} q_{i} (\hat{β}) y_{i},$ where $q_{i} (\hat{β})$ is the probability that $i \in R$ is selected as the nearest neighbour of a nonrespondent and $q_{i} (β)$ is defined in (A7) with subscript k dropped. Define ${\hat{μ}}_{I} = {\hat{Y}}_{I} / N$ , $μ = Y / N$ , $μ_{1} = E (y_{i} | a_{i} = 1)$ , $μ_{0} = E (y_{i} | a_{i} = 0)$ , $p = P (a_{i} = 1)$ , ${\bar{w}}_{i} = w_{i} / N$ , ${\hat{e}}_{i} = {\hat{y}}_{i} - \sum_{i \in R} q_{i} (\hat{β}) y_{i}$ , $Q_{1} = \sum_{i \in \bar{R}} {\bar{w}}_{i} {\hat{e}}_{i}$ , ${\hat{Q}}_{2} = \sum_{i \in R} {\bar{w}}_{i} [y_{i} - μ ({\hat{β}}^{'} x_{i})] + (1 - p) \sum_{i \in R} q_{i} (\hat{β}) [y_{i} - μ ({\hat{β}}^{'} x_{i})]$ , ${\hat{Q}}_{3} = \sum_{i \in R} {\bar{w}}_{i} [μ ({\hat{β}}^{'} x_{i}) - μ_{1}]$ , ${\hat{Q}}_{4} = \sum_{i \in \bar{R}} [{\bar{w}}_{i} - (1 - p)] \sum_{i \in R} q_{i} (\hat{β}) [y_{i} - μ ({\hat{β}}^{'} x_{i})] + \sum_{i \in \bar{R}} {\bar{w}}_{i} [\sum_{i \in R} q_{i} (\hat{β}) μ ({\hat{β}}^{'} x_{i}) - μ_{0}]$ , $Q_{5} = (μ_{1} - μ_{0}) \sum_{i \in S} {\bar{w}}_{i} (a_{i} - p)$ and $Q_{6} = μ (\sum_{i \in S} {\bar{w}}_{i} - 1)$ . Also, for l = 2, 3, 4, define $Q_{l}$ to be ${\hat{Q}}_{l}$ with $\hat{β}$ replaced by β. Then $\begin{aligned} {\hat{μ}}_{I} - μ & = Q_{1} + {\hat{Q}}_{2} + {\hat{Q}}_{3} + {\hat{Q}}_{4} + Q_{5} + Q_{6} \\ = Q_{1} + Q_{2} + Q_{3} + Q_{4} + Q_{5} + Q_{6} \\ + ({\hat{Q}}_{2} - Q_{2}) + ({\hat{Q}}_{2} - Q_{3}) + ({\hat{Q}}_{4} - Q_{4}) . \end{aligned}$ For each $Q_{l}$ , it is shown in Shao and Wang (Citation2008) that each $n^{1 / 2} Q_{l}$ is an approximately linear function of random variables converging in distribution to a normal distribution with mean 0. Under (A6)–(A8) and Taylor expansions, we can show that each ${\hat{Q}}_{l} - Q_{l}$ , l = 2, 3, 4, can be approximated by a linear function of random variables converging in distribution to a normal distribution with mean 0. Hence, the result follows by repeatedly applying Lemma 1 in Schenker and Welsh (Citation1988).

3. Simulation results

A simulation study is performed to examine the finite sample performance of ${\hat{μ}}_{I} = {\hat{Y}}_{I} / N$ with ${\hat{Y}}_{I}$ defined in (Equation2(2) ${\hat{Y}}_{I} = \sum_{i \in R} w_{i} y_{i} + \sum_{j \in \bar{R}} w_{j} {\hat{y}}_{j},$ (2) ) and $w_{i} = N / n$ . With sample of size n = 200 or 500, data $(x_{1}, y_{1}, a_{1}), \dots, (x_{n}, y_{n}, a_{n})$ are i.i.d. generated as follows. First, a three-dimensional covariate vector $x_{i}$ is generated from the multivariate normal distribution with mean vector $(1, 1, 1)$ and covariance matrix $(\begin{matrix} 1 & 0.5 & 0.25 \\ 0.5 & 1 & 0.5 \\ 0.25 & 0.5 & 1 \end{matrix})$ Conditioned on $x_{i}$ , $y_{i}$ is generated according to a linear model: $y_{i} = β^{'} x_{i} + ε_{i}$ , or a nonlinear model: $y_{i} = 0.5 (β^{'} x_{i})^{2} + ε_{i}$ , where $β^{'} = (1, 1, 1)$ and $ε_{i}$ is generated from one of the following three distributions:

normal distribution $N (0, 4)$ ,
mixture normal distribution $0.4 N (0, 1) + 0.6 N (0, 9)$ ,
heteroscedastic normal distribution $N (0, x_{i 1}^{2} + 1)$ , where $x_{i 1}$ is the first component of $x_{i}$ .

Conditioned on $x_{i}$ , the response indicator $a_{i}$ is generated from the Bernoulli distribution with probability $π (x_{i}) = 1 / [1 + \exp (- 0.4 - 0.1 β^{'} x_{i})],$ where the coefficients in $π (x_{i})$ are chosen so that the unconditional rates of missing data are between $20 %$ and $40 %$ . For each i, $x_{i}$ is observed and $y_{i}$ is observed if and only if $a_{i} = 1$ .

For simplicity, we consider K = 1 in (A1) and N = n. Then, ${\hat{μ}}_{I} = {\hat{Y}}_{I} / n$ is considered as an estimator of the super-population mean $μ = E (y_{i})$ , which is $μ = 3$ under linear model and $μ = 7.25$ under nonlinear model. To apply single-index NNI in (Equation2(2) ${\hat{Y}}_{I} = \sum_{i \in R} w_{i} y_{i} + \sum_{j \in \bar{R}} w_{j} {\hat{y}}_{j},$ (2) ), SAVE (Cook & Weisberg, Citation1991) is used to obtain estimator $\hat{β}$ .

To evaluate the performance, we add two oracle estimators, in addition to ${\hat{μ}}_{I}$ . The first oracle estimator is $\hat{μ} = \sum_{i = 1}^{n} y_{i} / n$ , the sample mean without nonresponse, assuming we observe all $y_{i}$ 's. The second oracle estimator is ${\tilde{μ}}_{I}$ , which is the same as ${\hat{μ}}_{I}$ except that the true β, instead of $\hat{β}$ , is used in finding the nearest neighbour.

Table provides simulation bias and standard error (SD) of $\hat{μ}$ , ${\tilde{μ}}_{I}$ and ${\hat{μ}}_{I}$ based on 1000 runs. It can be seen from Table that all biases are negligible. In terms of the SD, ${\hat{μ}}_{I}$ based on single-index NNI is just slightly worse than the oracle estimator ${\tilde{μ}}_{I}$ using the true β instead of $\hat{β}$ .

Table 1. Simulation bias and standard deviation (SD) in estimating μ (1000 runs).

Display Table

The empirical results are consistent with our theoretical findings.

Acknowledgements

We would like to thank two referees for their comments and suggestions.

Disclosure statement

No potential conflict of interest was reported by the authors.

Additional information

Funding

This work was partially supported by the National Natural Science Foundation of China grants 11831008 and 11871287, the U.S. National Science Foundation grants DMS-1612873 and DMS-1914411, the Natural Science Foundation of Tianjin (18JCYBJC41100) and the Fundamental Research Funds for the Central Universities.

Notes on contributors

Jun Shao

Dr Jun Shao holds a PhD in statistics from the University of Wisconsin-Madison. He is a professor of statistics at the University of Wisconsin-Madison and East China Normal University. His research interests include variable selection and inference with high dimensional data,sample surveys, and missing data problems.

Lei Wang

Dr Lei Wang holds a PhD in statistics from East China Normal University. He is an assistant professor of statistics at Nankai University. His research interests include empirical likelihood and missing data problems.

References

Chen, J., & Shao, J. (2000). Nearest neighbor imputation for survey data. Journal of Official Statistics, 16, 113–132.
Google Scholar
Chen, J., & Shao, J. (2001). Jackknife variance estimation for nearest-neighbor imputation. Journal of the American Statistical Association, 96, 260–269. doi: 10.1198/016214501750332839
Web of Science ®Google Scholar
Cook, R. D., & Weisberg, S. (1991). Discussion of ‘Sliced inverse regression for dimension reduction’. Journal of the American Statistical Association, 86, 28–33.
Web of Science ®Google Scholar
Farber, J. E., & Griffin, R. (1998). A comparison of alternative estimation methodologies for Census 2000. Proceedings of the section on survey research methods (pp. 629–634). American Statistical Association.
Google Scholar
Fay, R. E. (1999). Theory and application of nearest neighbor imputation in Census 2000. Proceedings of the section on survey research methods (pp. 112–121). American Statistical Association.
Google Scholar
Kalton, G., & Kasprzyk, D. (1986). The treatment of missing data. Survey Methodology, 12, 1–16.
Google Scholar
Kosorok, M. R. (1999). Two-sample quantile tests under general conditions. Biometrika, 86, 909–921. doi: 10.1093/biomet/86.4.909
Web of Science ®Google Scholar
Krewski, D., & Rao, J. N. K. (1981). Inference from stratified samples: Properties of the linearization, jackknife and balanced repeated replication methods. The Annals of Statistics, 9, 1010–1019. doi: 10.1214/aos/1176345580
Web of Science ®Google Scholar
Lee, H., Rancourt, E., & Särndal, C. E. (1994). Experiments with variance estimation from survey data with imputed values. Journal of Official Statistics, 10, 231–243.
Google Scholar
Li, K. C., & Duan, N. (1991). Regression analysis under link violation. The Annals of Statistics, 17, 1009–1052. doi: 10.1214/aos/1176347254
Web of Science ®Google Scholar
Little, R. J. (1995). Modeling the dropout mechanism in repeated-measures studies. Journal of the American Statistical Association, 90, 1112–1121. doi: 10.1080/01621459.1995.10476615
Web of Science ®Google Scholar
Montaquila, J. M., & Ponikowski, C. H. (1993). Comparison of methods for imputing missing responses in an establishment survey. Proceedings of the section on survey research methods (pp. 446–451). American Statistical Association.
Google Scholar
Rancourt, E. (1999). Estimation with nearest neighbor imputation at statistics Canada. Proceedings of the section on survey research methods (pp. 131–138). American Statistical Association.
Google Scholar
Rubin, D. B. (1987). Multiple imputation for nonresponse in surveys. New York: Wiley.
Google Scholar
Schenker, N., & Welsh, A. H. (1988). Asymptotic results for multiple imputation. The Annals of Statistics, 16, 1550–1566. doi: 10.1214/aos/1176351053
Web of Science ®Google Scholar
Sedransk, J. (1985). The objective and practice of imputation. Proceedings of the first annual research conference (pp. 445–452). Washington, DC: Bureau of the Census.
Google Scholar
Shao, J., & Wang, H. (2008). Confidence intervals based on survey data with nearest neighbor imputation. Statistica Sinica, 18, 281–297.
Web of Science ®Google Scholar
Valliant, R. (1993). Poststratification and conditional variance estimation. Journal of the American Statistical Association, 88, 89–96.
Web of Science ®Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Your download is now in progress and you may close this window

Did you know that with a free Taylor & Francis Online account you can gain access to the following benefits?

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Have an account?
Login now Don't have an account?
Register for free

Login or register to access this feature

Have an account?
Login now Don't have an account?
Register for free

Choose new content alerts to be informed about new research of interest to you
Easy remote access to your institution's subscriptions on any device, from any location
Save your searches and schedule alerts to send you new results
Export your search results into a .csv file to support your research

Nearest neighbour imputation under single index models

ABSTRACT

1. Introduction

2. Method and theory

Proof of Theorem

3. Simulation results

Table 1. Simulation bias and standard deviation (SD) in estimating μ (1000 runs).

Acknowledgements

Disclosure statement

Notes on contributors

Jun Shao

Lei Wang

References

Information for

Open access

Opportunities

Help and information

Nearest neighbour imputation under single index models

ABSTRACT

1. Introduction

2. Method and theory

Proof of Theorem

3. Simulation results

Table 1. Simulation bias and standard deviation (SD) in estimating μ (1000 runs).

Acknowledgements

Disclosure statement

Additional information

Funding

Notes on contributors

Jun Shao

Lei Wang

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date