426
Views
0
CrossRef citations to date
0
Altmetric
Articles

A Scalable Partitioned Approach to Model Massive Nonstationary Non-Gaussian Spatial Datasets

&
Pages 105-116 | Received 25 Nov 2020, Accepted 20 Jul 2022, Published online: 07 Oct 2022

References

  • Anderes, E. B., and Stein, M. L. (2011), “Local Likelihood Estimation for Nonstationary Random Fields,” Journal of Multivariate Analysis, 102, 506–520. DOI: 10.1016/j.jmva.2010.10.010.
  • Bakka, H., Rue, H., Fuglstad, G.-A., Riebler, A., Bolin, D., Illian, J., Krainski, E., Simpson, D., and Lindgren, F. (2018), “Spatial Modeling with R-INLA: A Review,” Wiley Interdisciplinary Reviews: Computational Statistics, 10, e1443.
  • Banerjee, A., Dunson, D. B., and Tokdar, S. T. (2013), “Efficient Gaussian Process Regression for Large Datasets,” Biometrika, 100, 75–89. DOI: 10.1093/biomet/ass068.
  • Banerjee, S., and Gelfand, A. E. (2006), “Bayesian Wombling: Curvilinear Gradient Assessment under Spatial Process Models,” Journal of the American Statistical Association, 101, 1487–1501. DOI: 10.1198/016214506000000041.
  • Banerjee, S., Gelfand, A. E., Finley, A. O., and Sang, H. (2008), “Gaussian Predictive Process Models for Large Spatial Data Sets,” Journal of the Royal Statistical Society, Series B, 70, 825–848. DOI: 10.1111/j.1467-9868.2008.00663.x.
  • Bradley, J. R., Cressie, N., and Shi, T. (2016), “A Comparison of Spatial Predictors When Datasets Could be Very Large,” Statistics Surveys, 10, 100–131. DOI: 10.1214/16-SS115.
  • Bradley, J. R., Holan, S. H., and Wikle, C. K. (2020), “Bayesian Hierarchical Models with Conjugate Full-Conditional Distributions for Dependent Data from the Natural Exponential Family,” Journal of the American Statistical Association, 115, 2037–2052. DOI: 10.1080/01621459.2019.1677471.
  • Bradley, J. R., Holan, S. H., Wikle, C. K.(2018), “Computationally Efficient Multivariate Spatio-Temporal Models for High-Dimensional Count-Valued Data” (with Discussion), Bayesian Analysis, 13, 253–310. DOI: 10.1214/17-BA1069.
  • Campbell, T., and Broderick, T. (2019), “Automated Scalable Bayesian Inference via Hilbert Coresets,” The Journal of Machine Learning Research, 20, 551–588.
  • Cressie, N. (2015), Statistics for Spatial Data, New York: Wiley.
  • Cressie, N., and Johannesson, G. (2008), “Fixed Rank Kriging for Very Large Spatial Data Sets,” Journal of the Royal Statistical Society, Series B, 70, 209–226. DOI: 10.1111/j.1467-9868.2007.00633.x.
  • de Valpine, P., Turek, D., Paciorek, C., Anderson-Bergman, C., Temple Lang, D., and Bodik, R. (2017), “Programming with Models: Writing Statistical Algorithms for General Model Structures with NIMBLE,” Journal of Computational and Graphical Statistics, 26, 403–413. DOI: 10.1080/10618600.2016.1172487.
  • Diggle, P. J., Tawn, J., and Moyeed, R. (1998), “Model-based Geostatistics,” Journal of the Royal Statistical Society, Series C, 47, 299–350. DOI: 10.1111/1467-9876.00113.
  • Fouedjio, F., Desassis, N., and Rivoirard, J. (2016), “A Generalized Convolution Model and Estimation for Non-stationary Random Functions,” Spatial Statistics, 16, 35–52. DOI: 10.1016/j.spasta.2016.01.002.
  • Friedman, J., Hastie, T., and Tibshirani, R. (2010), “Regularization Paths for Generalized Linear Models via Coordinate Descent,” Journal of Statistical Software, 33, 1–22. DOI: 10.18637/jss.v033.i01.
  • Fuentes, M. (2001), “A High Frequency Kriging Approach for Non-stationary Environmental Processes,” Environmetrics: The Official Journal of the International Environmetrics Society, 12, 469–483. DOI: 10.1002/env.473.
  • Fuentes, M. (2002), “Interpolation of Nonstationary Air Pollution Processes: A Spatial Spectral Approach,” Statistical Modelling, 2, 281–298.
  • Gelfand, A. E., Kim, H.-J., Sirmans, C., and Banerjee, S. (2003), “Spatial Modeling with Spatially Varying Coefficient Processes,” Journal of the American Statistical Association, 98, 387–396. DOI: 10.1198/016214503000170.
  • Griffith, D. A. (2003), “Spatial Filtering,” in Spatial Autocorrelation and Spatial Filtering, pp. 91–130, Berlin: Springer.
  • Guan, Y., and Haran, M. (2018), “A Computationally Efficient Projection-based Approach for Spatial Generalized Linear Mixed Models,” Journal of Computational and Graphical Statistics, 27, 701–714. DOI: 10.1080/10618600.2018.1425625.
  • Guhaniyogi, R., and Banerjee, S. (2018), “Meta-kriging: Scalable Bayesian Modeling and Inference for Massive Spatial Datasets,” Technometrics, 60, 430–444. DOI: 10.1080/00401706.2018.1437474.
  • Haran, M., Hodges, J. S., and Carlin, B. P. (2003), “Accelerating Computation in Markov Random Field Models for Spatial Data via Structured MCMC,” Journal of Computational and Graphical Statistics, 12, 249–264. DOI: 10.1198/1061860031833.
  • Hastie, T., Tibshirani, R., and Friedman, J. (2009), The elements of Statistical Learning; Data Mining, Inference and Prediction, New york: Springer.
  • Heaton, M. J., Christensen, W. F., and Terres, M. A. (2017), “Nonstationary Gaussian Process Models using Spatial Hierarchical Clustering from Finite Differences,” Technometrics, 59, 93–101. DOI: 10.1080/00401706.2015.1102763.
  • Hefley, T. J., Broms, K. M., Brost, B. M., Buderman, F. E., Kay, S. L., Scharf, H. R., Tipton, J. R., Williams, P. J., and Hooten, M. B. (2017), “The Basis Function Approach for Modeling Autocorrelation in Ecological Data,” Ecology, 98, 632–646. DOI: 10.1002/ecy.1674.
  • Higdon, D. (1998), “A Process-Convolution Approach to Modelling Temperatures in the North Atlantic Ocean,” Environmental and Ecological Statistics, 5, 173–190.
  • Higdon, D., Gattiker, J., Williams, B., and Rightley, M. (2008), “Computer Model Calibration using High-Dimensional Output,” Journal of the American Statistical Association, 103, 570–583. DOI: 10.1198/016214507000000888.
  • Holland, D. M., Saltzman, N., Cox, L. H., and Nychka, D. (1999), “Spatial Prediction of Sulfur Dioxide in the Eastern United States,” in geoENV II–Geostatistics for Environmental Applications, eds. J. Gómez-Hernández, A. Soares, and R. Froidevaux, pp. 65–76, Dordrecht: Springer.
  • Hooten, M. B., and Wikle, C. K. (2010), “Statistical Agent-based Models for Discrete Spatio-Temporal Systems,” Journal of the American Statistical Association, 105, 236–248. DOI: 10.1198/jasa.2009.tm09036.
  • Huffman, G., Stocker, E., Bolvin, D., Nelkin, E., and Tan, J. (2019), GPM IMERG Final Precipitation L3 1 month 0.1 degree x 0.1 degree v06, Greenbelt, MD, Goddard Earth Sciences Data and Information Services Center (GES DISC).
  • Hughes, J., and Haran, M. (2013), “Dimension Reduction and Alleviation of Confounding for Spatial Generalized Linear Mixed Models,” Journal of the Royal Statistical Society, Series B, 75, 139–159. DOI: 10.1111/j.1467-9868.2012.01041.x.
  • Ingebrigtsen, R., Lindgren, F., and Steinsland, I. (2014), “Spatial Models with Explanatory Variables in the Dependence Structure,” Spatial Statistics, 8, 20–38. DOI: 10.1016/j.spasta.2013.06.002.
  • Joseph, V. R., and Kang, L. (2011), “Regression-based Inverse Distance Weighting with Applications to Computer Experiments,” Technometrics, 53, 254–265. DOI: 10.1198/TECH.2011.09154.
  • Katzfuss, M. (2013), “Bayesian Nonstationary Spatial Modeling for Very Large Datasets,” Environmetrics, 24, 189–200. DOI: 10.1002/env.2200.
  • Katzfuss, M. (2017), A Multi-Resolution Approximation for Massive Spatial Datasets,” Journal of the American Statistical Association, 112, 201–214.
  • Kim, H.-M., Mallick, B. K., and Holmes, C. (2005), “Analyzing Nonstationary Spatial Data using Piecewise Gaussian Processes,” Journal of the American Statistical Association, 100, 653–668. DOI: 10.1198/016214504000002014.
  • Kleiber, W., and Nychka, D. (2012), “Nonstationary Modeling for Multivariate Spatial Processes,” Journal of Multivariate Analysis, 112, 76–91. DOI: 10.1016/j.jmva.2012.05.011.
  • Lee, B. S., and Haran, M. (2021), “PICAR: An Efficient Extendable Approach for Fitting Hierarchical Spatial Models,” Technometrics, 64, 187–198. DOI: 10.1080/00401706.2021.1933596.
  • Li, Y., and Sun, Y. (2019), “Efficient Estimation of Nonstationary Spatial Covariance Functions with Application to High-Resolution Climate Model Emulation,” Statistica Sinica, 29, 1209–1231.
  • Lindgren, F., Rue, H., and Lindström, J. (2011), “An Explicit Link between Gaussian Fields and Gaussian Markov Random Fields: The Stochastic Partial Differential Equation Approach,” Journal of the Royal Statistical Society, Series B, 73, 423–498. DOI: 10.1111/j.1467-9868.2011.00777.x.
  • McDermott, P. L., Wikle, C. K., and Millspaugh, J. (2017), “Hierarchical Nonlinear Spatio-Temporal Agent-based Models for Collective Animal Movement,” Journal of Agricultural, Biological and Environmental Statistics, 22, 294–312. DOI: 10.1007/s13253-017-0289-2.
  • Minsker, S., Srivastava, S., Lin, L., and Dunson, D. B. (2017), “Robust and Scalable Bayes via a Median of Subset Posterior Measures,” The Journal of Machine Learning Research, 18, 4488–4527.
  • Mu, J., Wang, G., and Wang, L. (2018), “Estimation and Inference in Spatially Varying Coefficient Models,” Environmetrics, 29, e2485. DOI: 10.1002/env.2485.
  • Musgrove, D. R., Hughes, J., and Eberly, L. E. (2016), “Fast, Fully Bayesian Spatiotemporal Inference for fMRI Data,” Biostatistics, 17, 291–303. DOI: 10.1093/biostatistics/kxv044.
  • Nychka, D., Bandyopadhyay, S., Hammerling, D., Lindgren, F., and Sain, S. (2015), “A Multiresolution Gaussian Process Model for the Analysis of Large Spatial Datasets,” Journal of Computational and Graphical Statistics, 24, 579–599. DOI: 10.1080/10618600.2014.914946.
  • Nychka, D., Wikle, C., and Royle, J. A. (2002), “Multiresolution Models for Nonstationary Spatial Covariance Functions,” Statistical Modelling, 2, 315–331. DOI: 10.1191/1471082x02st037oa.
  • O’Hara, R. B., and Sillanpää, M. J. (2009), “A Review of Bayesian Variable Selection Methods: What, How and Which,” Bayesian Analysis, 4, 85–117. DOI: 10.1214/09-BA403.
  • Paciorek, C. J., and Schervish, M. J. (2003), “Nonstationary Covariance Functions for Gaussian Process Regression,” in NIPS, pp. 273–280. Citeseer.
  • Paciorek, C. J., and Schervish, M. J. (2006), “Spatial Modelling using a New Class of Nonstationary Covariance Functions,” Environmetrics: The Official Journal of the International Environmetrics Society, 17, 483–506.
  • Platnick, S., King, M. D., Ackerman, S. A., Menzel, W. P., Baum, B. A., Riédi, J. C., and Frey, R. A. (2003), “The Modis Cloud Products: Algorithms and Examples from Terra,” IEEE Transactions on Geoscience and Remote Sensing, 41, 459–473. DOI: 10.1109/TGRS.2002.808301.
  • Risser, M. D., and Calder, C. A. (2015), “Local Likelihood Estimation for Covariance Functions with Spatially-Varying Parameters: The Convospat Package for r,” arXiv preprint arXiv:1507.08613.
  • Royle, J. A., and Wikle, C. K. (2005), “Efficient Statistical Mapping of Avian Count Data,” Environmental and Ecological Statistics, 12, 225–243. DOI: 10.1007/s10651-005-1043-4.
  • Rue, H., Martino, S., and Chopin, N. (2009), “Approximate Bayesian Inference for Latent Gaussian Models by using Integrated Nested Laplace Approximations,” Journal of the Royal Statistical Society, Series B, 71, 319–392. DOI: 10.1111/j.1467-9868.2008.00700.x.
  • Sampson, P. (2010), “Constructions for Nonstationary Spatial Processes,” in The Handbook of Spatial Statistics, eds. A. E. Gelfand, P. J. Diggle, M. Fuentes, and P. Guttorp, pp. 119–130, Boca Raton, FL: CRC Press.
  • Sengupta, A., and Cressie, N. (2013), “Hierarchical Statistical Modeling of Big Spatial Datasets using the Exponential Family of Distributions,” Spatial Statistics, 4, 14–44. DOI: 10.1016/j.spasta.2013.02.002.
  • Sengupta, A., Cressie, N., Kahn, B. H., and Frey, R. (2016), “Predictive Inference for Big, Spatial, Non-Gaussian Data: MODIS Cloud Data and its Change-of-Support,” Australian & New Zealand Journal of Statistics, 58, 15–45.
  • Shaby, B., and Wells, M. T. (2010), “Exploring an Adaptive Metropolis Algorithm,” Currently Under Review, 1, 17.
  • Shepard, D. (1968), “A Two-Dimensional Interpolation Function for Irregularly-Spaced Data,” in Proceedings of the 1968 23rd ACM National Conference, pp. 517–524. DOI: 10.1145/800186.810616.
  • Shi, H., and Kang, E. L. (2017), “Spatial Data Fusion for Large Non-Gaussian Remote Sensing Datasets,” Stat, 6, 390–404. DOI: 10.1002/sta4.165.
  • Shi, T., and Cressie, N. (2007), “Global Statistical Analysis of MISR Aerosol Data: A Massive Data Product from NASAs Terra Satellite,” Environmetrics: The official journal of the International Environmetrics Society, 18, 665–680. DOI: 10.1002/env.864.
  • Srivastava, S., Cevher, V., Tran-Dinh, Q., and Dunson, D. B. (2015), “Wasp: Scalable Bayes via Barycenters of Subset Posteriors,” in AISTATS.
  • Stein, M. L. (2005), “Nonstationary Spatial Covariance Functions,” Technical report.
  • Stewart, G. W. (1993), “On the Early History of the Singular Value Decomposition,” SIAM Review, 35, 551–566. DOI: 10.1137/1035134.
  • Tibshirani, R. (1996), “Regression Shrinkage and Selection via the Lasso,” Journal of the Royal Statistical Society, Series B, 58, 267–288. DOI: 10.1111/j.2517-6161.1996.tb02080.x.
  • Tresp, V. (2000), “A Bayesian Committee Machine,” Neural Computation, 12, 2719–2741. DOI: 10.1162/089976600300014908.
  • Wahba, G. (1990), Spline Models for Observational Data, Philadelphia, PA: SIAM.
  • Zilber, D., and Katzfuss, M. (2021), “Vecchia–Laplace Approximations of Generalized Gaussian Processes for Big Non-Gaussian Spatial Data,” Computational Statistics & Data Analysis, 153, 107081.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.