353
Views
2
CrossRef citations to date
0
Altmetric
Articles

Model-assisted estimation in high-dimensional settings for survey data

, &
Pages 761-785 | Received 30 Sep 2020, Accepted 20 Feb 2022, Published online: 17 Mar 2022

References

  • P. Bardsley and R. Chambers, Multipurpose estimation from unbalanced samples, Appl. Stat. 33 (1984), pp. 290–299.
  • J.-F. Beaumont and C. Bocci, Another look at ridge calibration, Metron-Int. J. Statist. 66 (2008), pp. 260–262.
  • F. Breidt, G. Claeskens, and J. Opsomer, Model-assisted estimation for complex surveys using penalized splines, Biometrika 92 (2005), pp. 831–846.
  • F.-J. Breidt and J.-D. Opsomer, Local polynomial regression estimators in survey sampling, Ann. Statist. 28 (2000), pp. 1023–1053.
  • L. Breiman, J.H. Friedman, R.A. Olshen, and C.J. Stone, Classification and Regression Trees, Routledge, New York, 1984.
  • L. Breiman, Random forests, Mach. Learn. 45 (2001), pp. 5–32.
  • H. Cardot, A. Dessertaine, C. Goga, E. Josserand, and P. Lardin, Comparison of different sample designs and construction of confidence bands to estimate the mean of functional data : an illustration on electricity consumption, Surv. Methodol. 39 (2013), pp. 283–301.
  • H. Cardot, C. Goga, and M.-A. Shehzad, Calibration and partial calibration on principal components when the number of auxiliary variables is large, Statist. Sin. 27 (2017), pp. 243–260.
  • R. Chambers, Robust case-weighting for multipurpose establishment surveys, J. Off. Stat. 12 (1996), pp. 3–32.
  • G. Chauvet and C. Goga, Asymptotic efficiency of the calibration estimator in a high-dimensional data setting, J. Statist. Plann. Inference 217 (2021), pp. 177–187.
  • T. Chen and C. Guestrin, XGBoost, in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining – KDD 16, 2016, ACM Press, New York.
  • M. Dagdoug, C. Goga, and D. Haziza, Model-assisted estimation through random forests in finite population sampling, preprint (2022), to appear in J. Am. Stat. Assoc.
  • M. Dagdoug, C. Goga, and D. Haziza, Imputation procedures in surveys using nonparametric and machine learning methods: an empirical comparison, to appear in J. Survey Statist. Methodol. (2021).
  • J.-C. Deville and C.-E. Särndal, Calibration estimators in survey sampling, J. Am. Stat. Assoc. 87 (1992), pp. 376–382.
  • D. Firth and K. Bennett, Robust models in probability sampling, J. R. Statist. Soc. Ser. B 60 (1998), pp. 3–21.
  • C. Goga, Réduction de la variance dans les sondages en présence d'information auxiliaire: une approche non paramétrique par splines de régression, Canad. J. Statist. 33 (2005), pp. 163–180.
  • C. Goga and A. Ruiz-Gazen, Efficient estimation of non-linear finite population parameters by using non-parametrics, J. R. Statist. Soc. Ser. B 76 (2014), pp. 113–140.
  • C. Goga and M.A. Shehzad, Overview of ridge regression estimators in survey sampling, Université de Bourgogne: Dijon, France, 2010.
  • F. Guggemos and Y. Tillé, Penalized calibration in survey sampling: design-based estimation assisted by mixed models, J. Statist. Plann. Inference 140 (2010), pp. 3199–3212.
  • T. Hastie, R. Tibshirani, and J. Friedman, The Elements of Statistical Learning: Data Mining, Inference and Prediction. Springer; New York: 2011.
  • A.E. Hoerl and R.W. Kennard, Ridge regression: biased estimation for nonorthogonal problems, Technometrics 12 (1970), pp. 55–67.
  • A.E. Hoerl and R.W. Kennard, Ridge regression: biased estimation for nonorthogonal problems, J. Am. Stat. Assoc. 42 (2000), pp. 80–86.
  • D. Horvitz and D. Thompson, A generalization of sampling without replacement from a finite universe, J. Am. Stat. Assoc. 47 (1952), pp. 663–685.
  • C.-T. Isaki and W.-A. Fuller, Survey design under the regression superpopulation model, J. Am. Stat. Assoc. 77 (1982), pp. 49–61.
  • M. Kuhn and K. Johnson, Applied Predictive Modelling, Springer, New York, 2013.
  • R. Lehtonen and A. Veijanen, Logistic generalized regression estimators, Surv. Methodol. 24 (1998), pp. 51–56.
  • K. McConville and F.J. Breidt, Survey design asymptotics for the model-assisted penalised spline regression estimator, J. Nonparametric Regress. 25 (2013), pp. 745–763.
  • K.S. McConville, F.J. Breidt, T.C. Lee, and G.G. Moisen, Model-assisted survey regression estimation with the lasso, J. Surv. Stat. Methodol. 5 (2017), pp. 131–158.
  • K. McConville and D. Toth, Automated selection of post-strata using a model-assisted regression tree estimator, Scand. J. Statist. 46 (2019), pp. 389–413.
  • G.E. Montanari and M.G. Ranalli, Nonparametric model calibration in survey sampling, J. Am. Stat. Assoc. 100 (2005), pp. 1429–1442.
  • J.D. Opsomer, F.J. Breidt, G. Moisen, and G. Kauermann, Model-assisted estimation of forest resources with generalized additive models, J. Am. Stat. Assoc. 102 (2007), pp. 400–409.
  • J. Quinlan, Learning with continuous classes, in 5th Australian Joint Conference on Artificial Intelligence, Vol. 92, World Scientific, Singapore, 1992, pp. 343–348.
  • J. Rao and A.C. Singh, A ridge-shrinkage method for range-restricted weight calibration in survey sampling, in Proceedings of the Section on Survey Research Methods, American Statistical Association, Alexandria, 1997.
  • P.M. Robinson and C. -E. Särndal, Asymptotic properties of the generalized regression estimator in probability sampling, Sankhyā Series B 45 (1983), pp. 240–248.
  • C.-E. Särndal, On the π-inverse weighting best linear unbiased weighting in probability sampling, Biometrika 67 (1980), pp. 639–650.
  • C.-E. Särndal, B. Swensson, and J Wretman, Model Assisted Survey Sampling, Springer Series in Statistics, Springer-Verlag, New York, 1992.
  • C.-E. Särndal and R. Wright, Cosmetic form of estimators in survey sampling, Scand. J. Statist. 11 (1984), pp. 146–156.
  • T. Ta, J. Shao, Q. Li, and L. Wang, Generalized regression estimators with high-dimensional covariates, Stat. Sin. 30 (2020), pp. 1135–1154.
  • R. Tibshirani, Regression shrinkage and selection via the lasso, J. R. Statist. Soc. Ser. B 58 (1996), pp. 267–288.
  • D. Toth, rpms: Recursive Partitioning for Modeling Survey Data, 2021. R package version 0.5.1.
  • D. Toth and J.L. Eltinge, Building consistent regression trees from complex sample data, J. Am. Stat. Assoc. 106 (2011), pp. 1626–1636.
  • L. Wang and S. Wang, Nonparametric additive model assisted estimation for survey data, J. Multivar. Anal. 102 (2011), pp. 1126–1140.
  • H. Zou and T. Hastie, Regularization and variable selection via the elastic net, J. R. Statist. Soc. Ser. B 67 (2005), pp. 301–320.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.