Generalised correlated cross-validation

Patrick S. Carmack Department of Mathematics, University of Central Arkansas, 201 Donaghey Avenue, Conway, AR, 72035-5001, USACorrespondence[email protected]

Jeffrey S. Spence Division of Epidemiology, Department of Internal Medicine, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Boulevard, Dallas, TX, 75390-8874, USA

William R. Schucany Department of Statistical Science, Southern Methodist University, PO Box 750332, Dallas, TX, 75275-0332, USA

Abstract

Since its introduction by [Stone, M. (1974), ‘Cross-validatory Choice and the Assessment of Statistical Predictions (with discussion)’, Journal of the Royal Statistical Society, B36, 111–133] and [Geisser, S. (1975), ‘The Predictive Sample Reuse Method with Applications’, Journal of the American Statistical Association, 70, 320–328], cross-validation has been studied and improved by several authors including [Burman, P., Chow, E., and Nolan, D. (1994), ‘A Cross-validatory Method for Dependent Data’, Biometrika, 81(2), 351–358], [Hart, J. and Yi, S. (1998), ‘One-sided Cross-validation’, Journal of the American Statistical Association, 93(442), 620–630], [Racine, J. (2000), ‘Consistent Cross-validatory Model-selection for Dependent Data: hv-block Cross-validation’, Journal of Econometrics, 99, 39–61], [Hart, J. and Lee, C. (2005), ‘Robustness of One-sided Cross-validation to Autocorrelation’, Journal of Multivariate Analysis, 92(1), 77–96], and [Carmack, P., Spence, J., Schucany, W., Gunst, R., Lin, Q., and Haley, R. (2009), ‘Far Casting Cross Validation’, Journal of Computational and Graphical Statistics, 18(4), 879–893]. Perhaps the most widely used and best known is generalised cross-validation (GCV) [Craven, P. and Wahba, G. (1979), ‘Smoothing Noisy Data with Spline Functions’, Numerical Mathematics, 31, 377–403], which establishes a single-pass method that penalises the fit by the trace of the smoother matrix assuming independent errors. We propose an extension to GCV in the context of correlated errors, which is motivated by a natural definition for residual degrees of freedom. The efficacy of the new method is investigated with a simulation experiment on a kernel smoother with bandwidth selection in local linear regression. Next, the winning methodology is illustrated by application to spatial modelling of fMRI data using a nonparametric semivariogram. We conclude with remarks about the heteroscedastic case and a potential maximum likelihood framework for Gaussian random processes.

Keywords:

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Generalised correlated cross-validation

Information for

Open access

Opportunities

Help and information

Generalised correlated cross-validation

Abstract

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature