169
Views
0
CrossRef citations to date
0
Altmetric
Application Note

A two-stage latent factor regression method to model the common and unique effects of multiple highly correlated exposure variables

&
Pages 168-192 | Received 10 Jul 2021, Accepted 15 Oct 2022, Published online: 25 Oct 2022

References

  • A. Agresti, Foundations of Linear and Generalized Linear Models, Wiley, New Jersey, 2015.
  • G. Ambler and A. Benner, mfp: Multivariable Fractional Polynomials (2022). Available at https://CRAN.R-project.org/package=mfp, R package version 1.5.2.2
  • Y. Amemiya and I. Yalcin, Nonlinear factor analysis as a statistical method, Stat. Sci. 16 (2001), pp. 275–294.
  • H. Artigue and G. Smith, The principal problem with principal components regression, Cogent Math. Stat. 6 (2019), p. 1622190.
  • K. Bollen, Structural Equations with Latent Variables, John Wiley & Sons, New York, 1989.
  • W.F. Christensen and Y. Amemiya, Latent variable analysis of multivariate spatial data, J. Am. Stat. Assoc. 97 (2002), pp. 302–317.
  • J.P. De Marco and C.C. Nóbrega, Evaluating collinearity effects on species distribution models: an approach based on virtual species simulation, PLoS ONE 13 (2018), pp. 1–25.
  • R. Engle and M. Watson, A one-factor multivariate time series model of metropolitan wage rates, J. Am. Stat. Assoc. 76 (1981), pp. 774–781.
  • C. Feng, J. Li, W. Sun, Y. Zhang, and Q. Wang, Impact of ambient fine particulate matter (PM2.5) exposure on the risk of influenza-like-illness: a time-series analysis in Beijing, China, Environ. Health Perspect. 15 (2016), pp. 1–12.
  • J. Fox, Applied Regression Analysis and Generalized Linear Models, 2nd ed., Sage, Los Angeles, CA, 2008.
  • M.H. Graham, Factors determining the upper limit of giant kelp, Macrocystis pyrifera Agardh, along the Monterey Peninsula, central California, USA, J. Exp. Mar. Biol. Ecol. 218 (1997), pp. 127–149.
  • M.H. Graham, Confronting multicollinearity in ecological multiple regression, Ecology 84 (2003), pp. 2809–2815.
  • R.F. Gunst, Regresion analysis with multicollinear predictor variables: definition, derection, and effects, Commun. Stat. Theory Methods 12 (1983), pp. 2217–2260.
  • R.F. Gunst, Comment: toward a balanced assessment of collinearity diagnostics, Am. Stat. 38 (1984), pp. 79–82.
  • A.S. Hadi and R.F. Ling, Some cautionary notes on the use of principal components regression, Am. Stat. 52 (1998), pp. 15–19.
  • J. Hanspach, I. Kühn, O. Schweiger, S. Pompe, and S. Klotz, Geographical patterns in prediction errors of species distribution models, Glob. Ecol. Biogeogr. 20 (2011), pp. 779–788.
  • J. Hoffmann and K. Shafer, Linear Regression Analysis: Applications and Assumptions, NASW Press, North Carolina, 2015.
  • J.W. Hogan and R. Tchernis, Bayesian factor analysis for spatially correlated data, with application to summarizing area-level material deprivation from census data, J. Am. Stat. Assoc. 99 (2004), pp. 314–324.
  • H. Hotelling, Analysis of a complex of statistical variables into principal components, J. Educ. Psychol. 24 (1933), pp. 417–441.
  • G. Huang, D. Lee, and E.M. Scott, Multivariate space-time modelling of multiple air pollutants and their health effects accounting for exposure uncertainty, Stat. Med. 37 (2018), pp. 1134–1148.
  • R.A. Johnson and D.W. Wichern, Applied Multivariate Statistical Analysis, Pearson Prentice Hall, New Jersey, 2007.
  • R.W. Johnson, Fitting percentage of body fat to simple body measurements, J. Stat. Educ. 4 (1996), p. null.
  • I.T. Jolliffe, Principal Component Analysis, 2nd ed., Springer Series in Statistics, Springer, New York, 2002.
  • A. Kalnins, Multicollinearity: how common factors cause type 1 errors in multivariate regression, Strat. Manag. J. 39 (2018), pp. 2362–2385.
  • M. Kutner, C. Nachtsheim, and J. Neter, Applied Linear Statistical Models, McGraw-Hill/Irwin, New York, 2004.
  • F. Laden, L.M. Neas, D.W. Dockery, and J. Schwartz, Association of fine particulate matter from different sources with daily mortality in six U.S. cities, Environ. Health Perspect. 108 (2000), pp. 941–947.
  • S. Lafi and J. Kaneene, An explanation of the use of principal-components analysis to detect and correct for multicollinearity, Prev. Vet. Med. 13 (1992), pp. 261–275.
  • M.R. Lavery, P. Acharya, S.A. Sivo, and L. Xu, Number of predictors and multicollinearity: what are their effects on error and bias in regression?, Commun. Stat. Simul. Comput. 48 (2019), pp. 27–38.
  • D. Lee, A. Rushworth, and S.K. Sahu, A Bayesian localized conditional autoregressive model for estimating the health effects of air pollution, Biometrics 70 (2014), pp. 419–429.
  • X. Liu, M.M. Wall, and J.S. Hodges, Generalized spatial structural equation models, Biostatistics 6 (2005), pp. 539–557.
  • E.R. Mansfield and B.P. Helms, Detecting multicollinearity, Am. Stat. 36 (1982), pp. 158–160.
  • W.F. Massy, Principal components regression in exploratory statistical research, J. Am. Stat. Assoc. 60 (1965), pp. 234–256.
  • P. Molenaar, A dynamic factor model for the analysis of multivariate time series, Psychometrika 50 (1985), pp. 181–202.
  • P. Molenaar, J.G. De Gooijer, and B. Schmitz, Dynamic factor analysis of nonstationary multivariate time series, Psychometrika 57 (1982), pp. 333–349.
  • R.H. Myers, Classical and Modern Regression with Applications, 2nd ed., Duxbury Advanced Series in Statistics and Decision Sciences, PWS-KENT PubCo, Boston, 1990.
  • K. Pituch and J. Stevens, Applied Multivariate Statistics for the Social Sciences Analyses with SAS and IBM's SPSS, Wiley, New Jersey, 2015.
  • H. Powell and D. Lee, Modelling spatial variability in concentrations of single pollutants and composite air quality indicators in health effects studies, J. R. Stat. Soc. Ser. A (Stat. Soc.) 177 (2014), pp. 607–623.
  • Z. Song, K.M. Chomicki, K. Drouillard, and R.P. Weidman, A statistical framework for testing impacts of multiple drivers of surface water quality in nearshore regions of large lakes, Sci. Total Environ. 811 (2022), p. 152362.
  • K.P. Vatcheva, M. Lee, J.B. McCormick, and M.H. Rahbar, Multicollinearity in regression analyses conducted in epidemiologic studies, Epidemiology 6 (2016), pp. 1–20.
  • F. Wang and M.M. Wall, Generalized common spatial factor model, Biostatistics 4 (2003), pp. 569–582.
  • S.N. Wood, Inference and computation with generalized additive models and their extensions, Test 29 (2020), pp. 307–339.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.