104
Views
0
CrossRef citations to date
0
Altmetric
Research Article

A Projection Approach to Local Regression with Variable-Dimension Covariates

ORCID Icon, ORCID Icon &
Received 13 Feb 2023, Accepted 16 May 2024, Published online: 17 Jun 2024
 

Abstract

Incomplete covariate vectors are known to be problematic for estimation and inferences on model parameters, but their impact on prediction performance is less understood. We develop an imputation-free method that builds on a random partition model admitting variable-dimension covariates. Cluster-specific response models further incorporate covariates via linear predictors, facilitating estimation of smooth prediction surfaces with relatively few clusters. We exploit marginalization techniques of Gaussian kernels to analytically project response distributions according to any pattern of missing covariates, yielding a local regression with internally consistent uncertainty propagation that uses only one set of coefficients per cluster. Aggressive shrinkage of these coefficients regulates uncertainty due to missing covariates. The method allows in- and out-of-sample prediction for any missingness pattern, even if the pattern in a new subject’s incomplete covariate vector was not seen in the training data. We develop an MCMC algorithm for posterior sampling that improves a computationally expensive update for latent cluster allocation. Finally, we demonstrate the model’s effectiveness for nonlinear point and density prediction under various circumstances by comparing with other recent methods for regression of variable dimensions on synthetic and real data. Supplemental materials for this article are available online.

Supplementary Materials

Appendix:(.pdf file) Illustration of marginalization behavior in VDLReg; fast screening tool for local linearity indicator, with simulations and data illustration; performance under different missingness mechanisms; cluster efficiency with locally linear prediction; role of the global shrinkage hyperparameter; full posterior; algorithm outline and computational complexity for allocation update; MCMC diagnostics; additional simulation results; an additional application; and additional details from the Old Faithful application.

VDLocalReg_examples:(zipped folder) R scripts that call the ProductPartitionModels Julia package to fit VDLReg models and recreate examples in the article.

Descriptions are contained in the README file.

Acknowledgments

The authors gratefully acknowledge helpful conversations with Peter Müller and Jyotishka Datta, as well as suggestions from an associate editor and two anonymous reviewers that significantly strengthened this work and its presentation. Figures were generated using ggplot2 (Wickham Citation2016).

Disclosure Statement

The authors report there are no competing interests to declare.

Additional information

Funding

The authors gratefully acknowledge partial funding from grant FONDECYT 1220017.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 180.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.