218
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Fast Calculation of Gaussian Process Multiple-Fold Cross-Validation Residuals and their Covariances

&
Received 25 May 2022, Accepted 03 May 2024, Published online: 17 Jun 2024

References

  • An, S., Liu, W., and Venkatesh, S. (2007), “Fast Cross-validation Algorithms for Least Squares Support Vector Machine and Kernel Ridge Regression,” Pattern Recognition, 40, 2154–2162. DOI: 10.1016/j.patcog.2006.12.015.
  • Arlot, S., and Celisse, A. (2010), “A Survey of Cross-Validation Procedures for Model Selection,” Statistical Surveys, 4, 40–79.
  • Bachoc, F. (2013), “Cross Validation and Maximum Likelihood Estimation of Hyper-Parameters of Gaussian Processes with Model Misspecification,” Computational Statistics and Data Analysis, 66, 55–69. DOI: 10.1016/j.csda.2013.03.016.
  • Bates, S., Hastie, T., and Tibshirani, R. (2023), “Cross-validation: What Does It Estimate and How Well Does It Do It?” Journal of the American Statistical Association. DOI: 10.1080/01621459.2023.2197686.
  • Cressie, N. A. C. (1993), Statistics for Spatial Data, London: Wiley.
  • Currin, C., Mitchell, T., Morris, M., and Ylvisaker, D. (1988), “A Bayesian Approach to the Design and Analysis of Computer Experiments,” Technical Report 6498, ORNL.
  • Dubrule, O. (1983), “Cross Validation of Kriging in a Unique Neighborhood,” Journal of the International Association for Mathematical Geology, 15, 687–699. DOI: 10.1007/BF01033232.
  • Fischer, B. (2016), “Model Selection for Gaussian Process Regression by Approximation Set Coding,” Master’s thesis, ETH Zürich.
  • Golub, G. H., Heath, M., and Wahba, G. (1979), “Generalized Cross-Validation as a Method for Choosing a Good Ridge Parameter,” Technometrics, 21, 215–223. DOI: 10.1080/00401706.1979.10489751.
  • Handcock, M. S., and Stein, M. L. (1993), “A Bayesian Analysis of Kriging,” Technometrics, 35, 403–410. DOI: 10.1080/00401706.1993.10485354.
  • Helbert, C., Dupuy, D., and Carraro, L. (2009), “Assessment of Uncertainty in Computer Experiments from Universal to Bayesian Kriging,” Applied Stochastic Models in Business and Industry, 25, 99–113. DOI: 10.1002/asmb.743.
  • Kaminsky, A. L., Wang, Y., Pant, K., Hashii, W. N., and Atachbarian, A. (2021), “An Efficient Batch k-fold Cross-Validation Voronoi Adaptive Sampling Technique for Global Surrogate Modeling,” Journal of Mechanical Design, 143, 011706. DOI: 10.1115/1.4047155.
  • Le Gratiet, L., Cannamela, C., and Iooss, B. (2015), “Cokriging-based Sequential Design Strategies Using Fast Cross-Validation for Multi-Fidelity Computer Codes,” Technometrics, 57, 418–427. DOI: 10.1080/00401706.2014.928233.
  • Magnusson, M., Andersen, M. R., Jonasson, J., and Vehtari, A. (2020), “Leave-One-Out Cross-Validation for Bayesian Model Comparison in Large Data,” in Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS).
  • Martino, L., Laparra, V., and Camps-Valls, G. (2017), “Probabilistic Cross-Validation Estimators for Gaussian Process Regression,” in 2017 25th European Signal Processing Conference (EUSIPCO), pp. 823–827. DOI: 10.23919/EUSIPCO.2017.8081322.
  • Omre, H. (1987), “Bayesian Kriging–Merging Observations and Qualified Guesses in Kriging,” Mathematical Geology, 19, 25–39. DOI: 10.1007/BF01275432.
  • Petit, S. (2022), “Improved Gaussian Process Modeling: Application to Bayesian Optimization,” PhD thesis, Université Paris-Saclay.
  • Petit, S. J., Bect, J., Da Veiga, S., Feliot, P., and Vazquez, E. (2020), “Towards New Cross-Validation-based Estimators for Gaussian Process Regression: Efficient Adjoint Computation of Gradients,” in 52èmes Journées de Statistique (JdS 2020), pp. 633–638. https://arxiv.org/pdf/2002.11543
  • Pirot, G., Krityakierne, T., Ginsbourger, D., and Renard, P. (2019), “Contaminant Source Localization via Bayesian Global Optimization,” Hydrology and Earth System Sciences, 23, 351–369. DOI: 10.5194/hess-23-351-2019.
  • Rabinowicz, A., and Rosset, S. (2020a), “Assessing Prediction Error Atinterpolation and Extrapolation Points,” Electronic Journal of Statistics, 14, 272–301. DOI: 10.1214/19-EJS1666.
  • ——— (2020b), “Cross-Validation for Correlated Data,” Journal of the American Statistical Association, 117, 718–731. DOI: 10.1080/01621459.2020.1801451.
  • Rasmussen, C. E., and Williams, C. K. I. (2006), Gaussian Processes for Machine Learning, Cambridge, MA: The MIT Press.
  • Roberts, D. R., Bahn, V., Ciuti, S., Boyce, M. S., Elith, J., Guillera-Arroita, G., Hauenstein, S., Lahoz-Monfort, J. J., Schröder, B., Thuiller, W., Warton, D. I., Wintle, B. A., Hartig, F., and Dormann, C. F. (2017), “Cross-Validation Strategies for Data with Temporal, Spatial, Hierarchical, or Phylogenetic Structure,” Ecography, 40, 913–929. DOI: 10.1111/ecog.02881.
  • Roustant, O., Ginsbourger, D., and Deville, Y. (2012), “DiceKriging, DiceOptim: Two R Packages for the Analysis of Computer Experiments by Kriging-based Metamodelling and Optimization,” Journal of Statistical Software, 51, 1–55. DOI: 10.18637/jss.v051.i01.
  • Santner, T. J., Williams, B. J., and Notz, W. (2003), The Design and Analysis of Computer Experiments, New York: Springer.
  • Wei, T., and Simko, V. (2021), R package ’corrplot’: Visualization of a Correlation Matrix, available at https://github.com/taiyun/corrplot.
  • Zhang, P. (1993), “Model Selection via Multifold Cross Validation,” Annals of Statistics, 21, 299–313.
  • Zhang, Y., and Yang, Y. (2015), “Cross-Validation for Selecting a Model Selection Procedure,” Journal of Econometrics, 187, 95–112. DOI: 10.1016/j.jeconom.2015.02.006.