1,405
Views
0
CrossRef citations to date
0
Altmetric
Statistical Practice

Missing Data Imputation with High-Dimensional Data

ORCID Icon &
Pages 240-252 | Received 03 Feb 2022, Accepted 19 Aug 2023, Published online: 17 Nov 2023

Figures & data

Table 1 Average processing time (hr) required by the MI algorithms to generate 10 imputations for 100 datasets.

Fig. 1 Bias (a), mean squared error (b) and coverage (c) of the 6 MI algorithms for the random intercept variance component σ12(u) (simulated value range [0.09;0.30]) from (4). The black dashed line in panel (c) indicates the nominal coverage level, fixed at 0.95.

Fig. 1 Bias (a), mean squared error (b) and coverage (c) of the 6 MI algorithms for the random intercept variance component σ12(u) (simulated value range [0.09;0.30]) from (4). The black dashed line in panel (c) indicates the nominal coverage level, fixed at 0.95.

Fig. 2 Bias (a), mean squared error (b) and coverage (c) of the 6 MI algorithms for the residual variance component σe2(u) (simulated value range [0.58;2.07]) from (4) per biomarker. The black dashed line in panel (c) indicates the nominal coverage level, fixed at 0.95.

Fig. 2 Bias (a), mean squared error (b) and coverage (c) of the 6 MI algorithms for the residual variance component σe2(u) (simulated value range [0.58;2.07]) from (4) per biomarker. The black dashed line in panel (c) indicates the nominal coverage level, fixed at 0.95.

Fig. 3 Bias (a), mean squared error (b) and coverage (c) of the 6 MI algorithms for the residual correlation ρ(u) (simulated value range [–0.35;0.52]) from (4) per biomarker. The black dashed line in panel (c) indicates the nominal coverage level, fixed at 0.95.

Fig. 3 Bias (a), mean squared error (b) and coverage (c) of the 6 MI algorithms for the residual correlation ρ(u) (simulated value range [–0.35;0.52]) from (4) per biomarker. The black dashed line in panel (c) indicates the nominal coverage level, fixed at 0.95.

Fig. 4 Average point (a) and standard error (b) estimates and t-test p-val’s (c) (over 10 imputations) of the intercept fixed effect μ(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study. The black dashed line in panel (c) indicates the significance level, fixed at 0.05.

Fig. 4 Average point (a) and standard error (b) estimates and t-test p-val’s (c) (over 10 imputations) of the intercept fixed effect μ(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study. The black dashed line in panel (c) indicates the significance level, fixed at 0.05.

Fig. 5 Average point (a) and standard error (b) estimates and t-test p-val’s (c) (over 10 imputations) of the fixed effect for BMI βBMI(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study. The black dashed line in panel (c) indicates the significance level, fixed at 0.05.

Fig. 5 Average point (a) and standard error (b) estimates and t-test p-val’s (c) (over 10 imputations) of the fixed effect for BMI βBMI(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study. The black dashed line in panel (c) indicates the significance level, fixed at 0.05.

Fig. 6 Average point (a) and standard error (b) estimates (over 10 imputations) of the random intercept σ2(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study.

Fig. 6 Average point (a) and standard error (b) estimates (over 10 imputations) of the random intercept σ2(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study.

Fig. 7 Average point (a) and standard error (b) estimates (over 10 imputations) of the residual correlation ρ(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study.

Fig. 7 Average point (a) and standard error (b) estimates (over 10 imputations) of the residual correlation ρ(u) for FIML and the imputation algorithms MICEPMM, MIPCA, and MIForest applied to the case study.
Supplemental material

Supplementary Material.zip

Download Zip (5.2 MB)

Data availability statement

Data for the case study is available upon request to Edwin R. van den Heuvel ([email protected]).