149
Views
1
CrossRef citations to date
0
Altmetric
Original Articles

Extending Gaussian process emulation using cluster analysis and artificial neural networks to fit big training sets

, , ORCID Icon, & ORCID Icon
Pages 195-208 | Received 08 Aug 2016, Accepted 13 Jun 2018, Published online: 17 Jul 2018
 

ABSTRACT

Gaussian process (GP) emulation is a relatively recent statistical technique that provides a fast-running approximation to a complex computer model, given training data generated by the considered model. Despite its sound theoretical foundation, GP emulation falls short in practical applications where the training dataset is very large, due to numerical instabilities in inverting the correlation matrix. We show how GP emulation can be extended to handle large training sets by first dividing the training set into smaller subsets using cluster analysis, then training an emulator for each subset, and finally combining the emulators using an artificial neural network (ANN). Our work has also conceptual relevance, as it shows how to solve a big data problem by introducing a local level in input space, where each emulator specialises in a certain subregion, and a global level, where the identified local features of the computer model are combined into a global view.

Disclosure statement

No potential conflict of interest was reported by the authors.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.