1,879
Views
2
CrossRef citations to date
0
Altmetric
Applied & Interdisciplinary Mathematics

Determining the optimal number of folds to use in a K-fold cross-validation: A neural network classification experiment

ORCID Icon |
Article: 2201015 | Received 15 Jun 2022, Accepted 05 Apr 2023, Published online: 01 May 2023

Figures & data

Figure 1. A schematic of a 5-fold cross-validation.

Figure 1. A schematic of a 5-fold cross-validation.

Figure 2. A schematic of a single neuron in neural network.

Figure 2. A schematic of a single neuron in neural network.

Figure 3. A schematic of neuron k at the output layer of a multi-layer network.

Figure 3. A schematic of neuron k at the output layer of a multi-layer network.

Figure 4. A schematic of neuron j at the hidden layer of a multi-layer network

Figure 4. A schematic of neuron j at the hidden layer of a multi-layer network

Table 1. Parameters for the two-class normal mixture data

Figure 5. A plot of the number of hidden neurons and their respective error rate

Figure 5. A plot of the number of hidden neurons and their respective error rate

Figure 6. A plot of the network training error rates without early stopping for 1000 folds.

Figure 6. A plot of the network training error rates without early stopping for 1000 folds.

Table 2. Closest error rates without early stopping

Figure 7. A plot of the network training error rates with early stopping for 1000 folds

Figure 7. A plot of the network training error rates with early stopping for 1000 folds

Table 3. Closest error rates with early stopping

Data availability statement

The dataset used in this study was an artificial two-class normal mixture dataset generated using the MATLAB programming language version 9.2 (R2017a), with the MATLAB codes provided in the Appendix section of this article.