289
Views
0
CrossRef citations to date
0
Altmetric
ORIGINAL RESEARCH

Predicting Diabetes in Patients with Metabolic Syndrome Using Machine-Learning Model Based on Multiple Years’ Data

ORCID Icon, ORCID Icon, ORCID Icon &
Pages 2951-2961 | Received 25 Jul 2022, Accepted 16 Sep 2022, Published online: 26 Sep 2022

Figures & data

Table 1 The Criteria of the International Diabetes Federation (IDF) for the Definition of Metabolic Syndrome (MetS)

Table 2 The Missing Percentages of Each Variable

Figure 1 The definition of each longitudinal dataset in the timeline.

Figure 1 The definition of each longitudinal dataset in the timeline.

Table 3 Baseline Characteristics of Sub-Groups from Patient Cohorts

Table 4 Performance Metrics of Machine-Learning Models Using Longitudinal Data

Figure 2 ROC curves of the three models for all the datasets.

Notes: (A) the ROC curve of logistic regression for single-year models and multiple-year models; (B) the ROC curve of random forest for single-year models and multiple-year models; (C) the ROC curve of Xgboost for single-year models and multiple-year models.
Abbreviation: ROC, receiver operating characteristic.
Figure 2 ROC curves of the three models for all the datasets.

Figure 3 Feature importance of each dataset using LASSO.

Notes: Parameter “Del_xx” was abbreviated from “delta_xx”. “Var_xx” was abbreviated from “Variance_xx”.
Abbreviations: BMI, body mass index; WC, waist circumference; SBP, systolic blood pressure. DBP, diastolic blood pressure; FPG, fasting plasma glucose; HDLC, high-density lipoprotein cholesterol; LDLC, low-density lipoprotein cholesterol; TG, triglyceride; FPG, fasting plasma glucose; HbA1c, glycated hemoglobin; TSH, thyroid-stimulating hormone; UA, uric acid.
Figure 3 Feature importance of each dataset using LASSO.