1,541
Views
1
CrossRef citations to date
0
Altmetric
Articles

The balance property in neural network modelling

Pages 1-9 | Received 09 Oct 2019, Accepted 15 Jan 2021, Published online: 21 Feb 2021

Figures & data

Table 1. Comparison of the homogeneous model (null model), the logistic regression model, the early stopping FNN and the GLM improved/regularised FNN.

Figure 1. Preliminary analysis exploring the early stopping rule: model fitting on the training data D0 (in upper graph) and tracking over-fitting on the validation data V (in lower graph); note that this is a standard output in Keras which (unfortunately) drops the factor 2 from the loss function (Equation7), thus, the y-axis needs to be scaled with 2.

Figure 1. Preliminary analysis exploring the early stopping rule: model fitting on the training data D0 (in upper graph) and tracking over-fitting on the validation data V (in lower graph); note that this is a standard output in Keras which (unfortunately) drops the factor 2 from the loss function (Equation7(7) (w,θ)↦L(w,θ;D)=−2nℓ(w,θ;D).(7) ), thus, the y-axis needs to be scaled with 2.

Figure 2. (lhs) Balance property (Equation4) of the early stopping FNN over 50 different seeds (starting points), the orange horizontal line shows the balance property of 5.007276%; (middle) in-sample learning losses on D and (rhs) out-of-sample test losses on T of the early stopping FNN (left box plots in graphs) and the GLM improved/regularised FNN (right box plots in graphs) over the 50 different seeds (starting values of the SGD algorithm).

Figure 2. (lhs) Balance property (Equation4(4) 1n∑i=1nYi=1n∑i=1npˆMLE(xi)=1n∑i=1nσwˆ0MLE+∑j=1qwˆjMLExi,j,(4) ) of the early stopping FNN over 50 different seeds (starting points), the orange horizontal line shows the balance property of 5.007276%; (middle) in-sample learning losses on D and (rhs) out-of-sample test losses on T of the early stopping FNN (left box plots in graphs) and the GLM improved/regularised FNN (right box plots in graphs) over the 50 different seeds (starting values of the SGD algorithm).

Table 2. Comparison of the homogeneous model (null model), the logistic regression model, the early stopping FNN and the GLM improved/regularised FNN, shrinkage regularised versions for different tuning parameters η{10,100,250,1000}.