1,301
Views
0
CrossRef citations to date
0
Altmetric
Cardio-renal Physiology and Disease Processes

Machine learning model and nomogram to predict the risk of heart failure hospitalization in peritoneal dialysis patients

, , , , , , & show all
Article: 2324071 | Received 22 Oct 2023, Accepted 22 Feb 2024, Published online: 17 Mar 2024

Abstract

Introduction

The study presented here aimed to establish a predictive model for heart failure (HF) and all-cause mortality in peritoneal dialysis (PD) patients with machine learning (ML) algorithm.

Methods

We retrospectively included 1006 patients who initiated PD from 2010 to 2016. XGBoost, random forest (RF), and AdaBoost were used to train models for assessing risk for 1-year and 5-year HF hospitalization and mortality. The performance was validated using fivefold cross-validation. The optimal ML algorithm was used to construct the models to predictive the risk of the HF and all-cause mortality. The prediction performance of ML methods and Cox regression was compared.

Results

Over a median follow-up of 49 months. Two hundred and ninety-eight patients developed HF required hospitalization; 199 patients died during the follow-up. The RF model (AUC = 0.853) was the best performing model for predicting HF, and the XGBoost model (AUC = 0.871) was the best model for predicting mortality. Baseline moderate or severe renal disease, systolic blood pressure (SBP), body mass index (BMI), age, Charlson Comorbidity Index (CCI) score were strongly associated with HF hospitalization, whereas age, CCI score, creatinine, age, high-density lipoprotein cholesterol (HDL-C), total cholesterol, baseline estimated glomerular filtration rate (eGFR) were the most significant predictors of mortality. For all the above endpoints, the ML models demonstrated better discrimination than Cox regression.

Conclusions

We developed and validated a novel method to predict the risk factors of HF and all-cause mortality that integrates readily available clinical, laboratory, and electrocardiographic variables to predict the risk of HF among PD patients.

Introduction

Peritoneal dialysis (PD) is one of the well-established renal replacement therapies that uses the peritoneal membrane for the liquid interchange; a catheter was placed in the peritoneal fundus through minor surgery or percutaneous technique. Peritoneal dialysis patients have many complications, among which cardiovascular disease is the most common cause of death in dialysis patients [Citation1,Citation2], and manifests as acute myocardial infarction, heart failure (HF), or sudden cardiac death, and EF preserved HF is one of the most frequent.

Cardiovascular disease is the main risk factor of death in maintenance dialysis patients, due to arteriosclerosis and chronic inflammation. Compared with hemodialysis patients, the use of glucose solution can lead to metabolic syndrome and obesity in PD patients. Because of this, the risk of dyslipidemia and hyperinsulinemia is higher. However, in terms of long-term survival, hemodialysis and PD are similar.

Despite current medical advances in the diagnosis and evidence-based management of HF, the outcome of HF remains unsatisfactory. This may be related to the poor accuracy of traditional medicine in predicting diseases. A number of clinical risk models based on linear models have been developed to predict early mortality in the dialysis patients, such as logistic or Cox model [Citation3–7]. The Framingham Risk Score (FRS) is one of these commonly used clinical models [Citation8]. In the research of HF, the Seattle Heart Failure Model is the most widely used risk prediction models, which achieved a C-statistic of 0.73 for mortality [Citation9]. However, there are few studies in PD patients.

In recent years, machine learning (ML) has been proven to be a very powerful method in medical fields [Citation10–13]. ML is a field of artificial intelligence, which mainly challenges big data and high dimensional data. The most prominent area of research which has seen rapid growth is in the field of diagnostics and prognosis [Citation14–18]. ML, as a kind of classical prediction model, has good predictive performance [Citation19]. By self-learning from big data, ML can produce a stable model that can predict the outcome of another dataset [Citation20]. ML has been increasingly used also in the field of HF research [Citation21,Citation22]. HF is a significant health problem in PD patients worldwide. In the United States, the annual mortality rate of such patients is as high as 20.7%, and the median morbidity hospitalization is two times per year [Citation23]. In this study, ML algorithm was used to build a model to predict the risk factors of HF and all-cause death in PD patients. Early diagnosis, early treatment, improve the quality of life.

Materials and methods

Study population

This study included patients who underwent PD catheterization and continuous ambulatory peritoneal dialysis (CAPD) in the Department of Nephrology, Fujian Provincial Hospital from January 2010 to December 2016. All adults aged ≧18 years. Patients were considered eligible for study inclusion when they had been on CAPD treatment for 3 months or more. Exclusion criteria included patients with underlying malignancy, pregnancy, severe mental illness, severe lung disease, severe heart disease (NYHA III–IV) or congenital heart disease, active stage of autoimmune disease requires large dose of glucocorticoid and immunosuppressive therapy, missing data >10%. In total, 1200 PD patients were enrolled in the study. Six patients with underlying malignancy, four patients with severe lung disease, 27 patients with severe heart disease (NYHA III–IV), and one patient with active stage of autoimmune disease requires large dose of glucocorticoid and immunosuppressive therapy and 18 patients had >10% missing data were excluded. One thousand one hundred and forty-four patients were included in this study. The participants were followed until February 2021. Forty patients transferred to hemodialysis, 23 patients received kidney transplantation, and 75 patients lost follow-up. The final analysis included 1006 participants. The participants were randomly assigned to development set and external validation cohort. The development set were 606 participants. The external validation cohort included 400 participants. The work flow of the patients’ inclusion procedure is presented in Suppl. Figure 1. This study was approved by the Fujian Provincial Hospital ethics committee as an exempt study with a waiver of informed consent, allowing a retrospective review of medical records.

Data set

The data were obtained from the electronic medical records system and the laboratory information management system of the Fujian Provincial Hospital. All the data were collected for routine patient management with no additional data input required for the modeling. Clinical data of eligible patients were extracted and aggregated into a data tables for use by the ML models.

The endpoints

The endpoints were defined as HF hospitalization and all-cause death. HF is a clinical syndrome characterized by a range of symptoms (dyspnea, limited physical activity, and edema) and signs (elevated jugular vein pressure, pulmonary congestion), usually caused by a structural or functional disease of the heart that results in ventricular filling and/or impaired ejection capacity. HF is classified according to New York Heart Association (NYHA) functional class. The primary end point was HF hospitalization, NYHA functional class III–IV. The secondary end point was all-cause death.

Diagnostic criteria for HF [Citation24]: (1) dyspnea, fatigue, or decreased activity endurance; (2) signs of fluid retention (pulmonary congestion and lower limb swelling): such as elevated jugular venous pressure, wet rales at the bottom of lung; (3) abnormal cardiac structure and/or function in echocardiography and pulmonary edema indicated by imaging. (4) The symptoms, signs, and imaging manifestations of the above patients were relieved by intensive dialysis, which included increasing the dialysis dose or using high concentration peritoneal dialysate. Physicians choose intensive dialysis according to the patient’s peritoneal function and the degree of HF.

Dyspnea and fatigue were measured using four-point and five-point exertion scales recorded respectively by the investigator [Citation25]. The investigator quantified the severity of lung congestion by Congestion Score Index (CSI) [Citation26,Citation27] and assessed the degree of peripheral edema at baseline [Citation28].The above symptoms, signs, and imaging scoring systems were also used to evaluate the remission of HF. The details of scales of dyspnea, fatigue, and lung congestion, peripheral edema are shown in the Suppl. Data.

Candidate variables

For each of the variable, selection was clinical routine data. In total, 113 patient characteristics (including demographics, comorbidities, vital signs, laboratory characteristics, electrocardiogram, echocardiogram, digital radiography, and therapy) were collected as variable candidates by medical record. Variables with >10% missing data were excluded, and removed variables with a correlation coefficient >0.5. Finally, the study analyzed 96 variables. Missing data <10% were imputed using a random forest (RF) imputation.

The Charlson Comorbidity Index (CCI) [Citation29] includes 19 variables related to comorbidities, with scores ranging from 1 to 6, and the sum of ownership weights is the single complication score for each patient.

In accordance with the 2016 European Society of Cardiology Guidelines for HF, an echocardiogram has at least one of the following abnormalities: LA enlargement, LV hypertrophy, or mitral valve inflow/tissue Doppler abnormalities. Echocardiography was performed by experienced sonographers after 3 months PD. The two examiners of echocardiography have undergone uniform training.

Peritoneal equilibration test (PET): proposed by Twardowski in 1987. Corrected serum calcium (mmol/L) = Ca(mmol/L) + 0.02 × [40 – ALB (g/L)]. Estimated glomerular filtration rate (eGFR) was calculated by the CKD-EPI formula. Left ventricular mass (LVM) was calculated via Devereux formula [Citation30]: LVM (g) = 0.80 [1.04 × (LVEDD + IVS + PWT)3 – LVED D3] + 0.6 g. The LVM index (LVMI) was calculated as LVM divided by the body surface area. Left ventricular hypertrophy (LVH) was defined as LVMI > 134 g/m2 in male and >110 g/m2 in female [Citation31,Citation32]. Ejection fraction (EF) was grouped by 50% (where ALB: albumin; Ca: calcium; LVEDD: left ventricular end diastolic dimension; IVS: interventricular septum; PWT: posterior wall thickness; LV: left ventricular; LA: left atrial).

Statistical analysis

Baseline data were analysis by SPSS 22.0 software (SPSS Inc., Chicago, IL). Baseline data were displayed as mean ± standard deviation or median (interquartile range) for continuous variables and as percentages for categorical variables. The normal distribution variables and non-normal distribution variables were tested using the t-test or Mann–Whitney’s U-test. The counting data were expressed by percentage and χ2 test. p < .05 was considered statistically significant. Homogeneity of variance analysis and Pearson’s correlation test were performed on statistically significant variables. Variables of VIF > 10 and correlations > 0.5 were eliminated.

Model development (RF-based and XGB-based selection)

Three different ML algorithms were considered: XGBoost, RF, and AdaBoost. The performance of the models was assessed using fivefold cross-validation. Eighty percent of the data were used as training set and 20% as validation set. All data were randomly divided into five subsets of similar size and mutually exclusive. Each round of training selects four subsets to form the training set and the rest subsets to form the validation set. Each model required training and validation five times, with different training and validation sets being used each time. The average of the five test results was accepted as the final result. The ML algorithm with the highest AUC value was selected for modeling. The important variables affecting the outcome were output and rank. The optimal model is output using the following measures: area under the ROC curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and F1-score. Compared with the prediction performance of optimal ML algorithm and Cox regression, all statistical analysis was done using R language 3.6.3 and python 3.7 environment (R Foundation for Statistical Computing, Vienna, Austria).

Results

Baseline characteristics

Among the development set, the mean (±SD) age was 52.6 ± 16.1 years, 63.67% of patients were male. The median follow-up time was 49 months (IQR 16–69). 60.07% of patients had chronic glomerulonephritis, 25.41% of patients had diabetes, and 1.82% of patients had hypertension. The remaining kidney diseases included polycystic kidney disease and obstructive kidney disease, accounting for 12.71% (shown in , Suppl. Tables 1 and 2).

Table 1. Baseline data table for heart failure endpoints.

Results for predicting the risks of HF hospitalization

In this study, 298 of 606 patients developed HF required hospitalization during follow-up. As many as 65% of patients with HF were left ventricular EF preserved. Twenty-four significant variables were choosen from baseline characteristics (shown in and Suppl. Table 1). The performance of XGBoost, RF, and AdaBoost were assessed using fivefold cross-validation, the AUC values of training set were 0.844, 0.931, and 0.821, respectively (shown in Suppl. Figure 2(A)), the AUC values of validation set were 0.793, 0.794, and 0.794, respectively (shown in Suppl. Figure 2(B)). Finally, the optimal model of RF with the highest AUC was selected for modeling. Each index of the optimal model in the test set and the test set ROC curve are presented in and . presents the risk variables for predicting HF from the RF model. CCI2 (the history of congestive HF) was the most important risk factor. Systolic blood pressure (SBP), body mass index (BMI), and other risk factors followed.

Figure 1. The ROC curve for the random forest model for predicting heart failure in the test set. AUC: area under the curve.

Figure 1. The ROC curve for the random forest model for predicting heart failure in the test set. AUC: area under the curve.

Figure 2. Variable importance analysis. Results indicate the decrease in accuracy of the final model on exclusion of each specific variable, quantified on a scale of 0 to 0.14, 250, respectively. While 0 represents the minimum importance (lowest decrease in the accuracy when excluded), 0.14 and 250 represent the maximum importance (highest decrease in the accuracy when excluded). (A) Variable importance analysis of heart failure. (B) Variable importance analysis of 1-year heart failure. (C) Variable importance analysis of 5-year heart failure; CCI1: myocardial infarction; CCI2: congestive heart failure; CCI5: dementia; CCI7: connective tissue disease/rheumatic disease; CCI9: mild liver disease; CCI13: renal disease. ESRD1: primary glomerulonephritis; ESRD2: diabetes; SBP: systolic blood pressure; BMI: body mass index.

Figure 2. Variable importance analysis. Results indicate the decrease in accuracy of the final model on exclusion of each specific variable, quantified on a scale of 0 to 0.14, 250, respectively. While 0 represents the minimum importance (lowest decrease in the accuracy when excluded), 0.14 and 250 represent the maximum importance (highest decrease in the accuracy when excluded). (A) Variable importance analysis of heart failure. (B) Variable importance analysis of 1-year heart failure. (C) Variable importance analysis of 5-year heart failure; CCI1: myocardial infarction; CCI2: congestive heart failure; CCI5: dementia; CCI7: connective tissue disease/rheumatic disease; CCI9: mild liver disease; CCI13: renal disease. ESRD1: primary glomerulonephritis; ESRD2: diabetes; SBP: systolic blood pressure; BMI: body mass index.

Table 2. Performance each index of the optimal model in the test set of AUC, accuracy, sensitivity, specificity, PPV, NPV, and F1-score.

Results for predicting the risks of the first year follow-up HF hospitalization

In this sub-endpoint, during the follow-up, 35 patients died within 1 year without HF. Seventy-nine of the 571 patients developed HF required hospitalization during the first year follow-up. Twenty-four significant variables were chosen from baseline characteristics (shown in and Suppl. Table 1). The performance of XGBoost, RF and AdaBoost was assessed using fivefold cross-validation, the AUC values of training set were 0.874, 0.943, and 0.840, respectively (shown in Suppl. Figure 3(A)), the AUC values of validation set were 0.691, 0.728, and 0.704, respectively (shown in Suppl. Figure 3(B)). Finally, the optimal model of RF with the highest AUC was selected for modeling. Each index of the optimal model in the test set and the test ROC curve are presented in and . presents the risk variables for predicting the HF in first year follow-up from the RF model. BMI was the most important risk factor. Age, SBP, and other risk factors followed.

Figure 3. The ROC curve for the random forest model for predicting heart failure at year 1 in the test set. AUC: area under the curve.

Figure 3. The ROC curve for the random forest model for predicting heart failure at year 1 in the test set. AUC: area under the curve.

Results for predicting the risks of the 5-year follow-up HF hospitalization

In this sub-endpoint, during the follow-up, 150 patients died within 5 year without HF. Two hundred and forty-six of the 456 patients developed HF during 5-year follow-up. Twenty-four significant variables were chosen from baseline characteristics (shown in and Suppl. Table 1). The performance of XGBoost, RF, and AdaBoost was assessed using fivefold cross-validation, the AUC values of training set were 0.944, 0.806, and 0.857, respectively (shown in Suppl. Figure 4(A)), the AUC values of validation set were 0.706, 0.703, and 0.685, respectively (shown in Suppl. Figure 4(B)). Finally, the optimal model of XGBoost with the highest AUC was selected for modeling. Each index of the optimal model in the test set and the test ROC curve are presented in and . presents the risk variables for predicting the HF in 5-year follow-up from the XGBoost model. CCI score was the most important risk factor. BMI, SBP, and other risk factors followed.

Figure 4. The ROC curve for the XGBoost model for predicting heart failure at year 5 in the test set. AUC: area under the curve.

Figure 4. The ROC curve for the XGBoost model for predicting heart failure at year 5 in the test set. AUC: area under the curve.

Comparison of predictive performance between Cox and ML model in HF hospitalization endpoint

In this study, Cox model was used to evaluate the influence of risk variables in the endpoint of HF. The C-index of the Cox model in training set was 0.668 and test set was 0.724. The AUC of ML optimal model in test set was 0.853. In the 1-year and 5-year follow-up HF endpoints, the AUC of ML optimal model in test set was 0.729 and 0.698, respectively, while the AUC of Cox were 0.723 and 0.759 (shown in Suppl. Figure 5(A,B)).

Nomogram for predicting HF

The ML model analysis contained risk variables for HF in PD patients, including categorical variables generated from continuous variables. The nomogram was quantified according to the weights of the variables selected by the model. In the nomogram analysis, age > 90, BMI > 30, SBP <100 mmHg, daily urine volume < 100, coronary artery disease, peripheral vascular disease, diabetes, ECG ST-T change, congestive HF, and smoking history were independent risk factors associated with HF hospitalization. Based on the ML model, a nomogram was plotted to predict the probability of HF hospitalization in PD patients ().

Figure 5. Nomograms based on heart failure endpoints. The nomogram was quantified according to the weights of the variables selected by the model. BMI: body mass index; ECG: electrocardiograph; SBP: systolic blood pressure.

Figure 5. Nomograms based on heart failure endpoints. The nomogram was quantified according to the weights of the variables selected by the model. BMI: body mass index; ECG: electrocardiograph; SBP: systolic blood pressure.

Risk score system for HF incidence

From the 10 identified top-performing HF predictors, a risk score system for HF incidence was created (shown in Suppl. Figure 6), the risk variables were age, BMI, SBP, smoking history, congestive HF, daily urine volume, coronary artery disease, peripheral vascular disease, diabetes, and electrocardiogram ST-T change. The risk score ranges from 0 to 56. The observed scores range from 0 to > 38. The risk of HF increased with the increase of the score at 1 and 5 years of follow-up. A one-classification increment in the risk score was associated with a 20% higher risk of HF at 1-year or 5-year. It is divided into three risk levels according to risk scores (risk = 0, risk = 1, and risk = 2).

External validation of the risk score system for HF incidence

The risk score system for HF incidence was externally validated in the subgroup of PD patients at baseline. The external validation cohort included 400 PD patients with 208 incident HF events (52%). After plotting the externally verified survival curve, we found that the survival prognosis of PD patients in the high-risk group was significantly worse than that in the low-risk group (shown in ). The risk score system also had good risk prediction performance.

Figure 6. External validation of the risk score system for HF incidence. Kaplan–Meier’s survival curves for PD patients with higher and lower risks of HF. The risk score is divided into three levels. The survival prognosis of PD patients in the high-risk group was significantly worse than that in the low-risk group follow-up time (m): the time from study enrollment to the end of HF.

Figure 6. External validation of the risk score system for HF incidence. Kaplan–Meier’s survival curves for PD patients with higher and lower risks of HF. The risk score is divided into three levels. The survival prognosis of PD patients in the high-risk group was significantly worse than that in the low-risk group follow-up time (m): the time from study enrollment to the end of HF.

Results for predicting the risks of all cause death

Regarding all-cause mortality, 199 of 606 patients died during the follow-up. Forty-four significant variables were choosen from baseline characteristics (shown in Suppl. Table 2). The performance of XGBoost, RF, and AdaBoost was assessed using fivefold cross-validation, the AUC values of training set were 0.897, 0.935, and 0.816, respectively (shown in Suppl. Figure 7(A)), the AUC values of validation set were 0.828, 0.829, and 0.746, respectively (shown in Suppl. Figure 7(B)). Finally, the optimal model of RF with the highest AUC was selected for modeling. Each index of the optimal model in the test set and the test ROC curve are presented in and Suppl. Figures 7(C) and 8(A) presents the risk variables for predicting the all-cause death endpoint from the RF model. Age was the most important risk factor. CCI score, CR, and other risk factors followed.

Results for predicting the risks of the 1-year all cause death

In this study, 64 of 606 patients died during the first year follow-up. Forty-four significant variables were chosen from baseline characteristics (shown in Suppl. Table 2). The performance of XGBoost, RF, and AdaBoost was assessed using fivefold cross-validation, the AUC values of training set were 0.876, 0.807, and 0.753, respectively (shown in Suppl. Figure 9(A)), the AUC values of validation set were 0.709, 0.667, and 0.667, respectively (shown in Suppl. Figure 9(B)). Finally, the optimal model of XGBoost with the highest AUC was selected for modeling. Each index of the optimal model in the test set and the test ROC curve are presented in and Suppl. Figure 9(C) and 8(B) presents the risk variables for predicting the 1-year all cause death from the XGBoost model. Age was the most important risk factor. High-density lipoprotein cholesterol (HDL-C), total cholesterol, and other risk factors followed.

Results for predicting the risks of the 5-year all cause death

In this study, 161 of 606 patients died during 5-year follow-up. Forty-four significant variables were chosen from baseline characteristics (shown in Suppl. Table 2). The performance of XGBoost, RF, and AdaBoost was assessed using fivefold cross-validation, the AUC values of training set were 0.899, 0.938, and 0.832, respectively (shown in Suppl. Figure 10(A)), the AUC values of validation set were 0.795, 0.809, and 0.774, respectively (shown in Suppl. Figure 10(B)). Finally, the optimal model of RF with the highest AUC was selected for modeling. Each index of the optimal model in the test set and the test ROC curve are presented in and Suppl. Figure 10(C) and 8(C) presents the risk variables for predicting the 5-year all cause death from the RF model. Age was the most important risk factor. CCI score, eGFR, and other risk factors followed.

Comparison of predictive performance between Cox and ML model in all cause death endpoint

In this study, Cox model was used to evaluate the influence of risk variables in the endpoint of all cause death. The C-index of the Cox model in training set was 0.758 and test set was 0.769. The AUC of ML optimal model in test set was 0.871. In the 1-year and 5-year follow-up all cause death endpoints, the AUC of ML optimal model in test set were 0.669 and 0.829, respectively, while the AUC of Cox were 0.789 and 0.804 (shown in Suppl. Figure 11(A,B)).

Discussion

The purpose of our study was accurately predicting the risks of HF hospitalization and all cause death in PD patients using ML tools. To our knowledge, this is the latest study to predict the risks of HF hospitalization and all cause death in PD patients using various ML algorithms. Our ML predicting models have several unique characteristics. First, ML has shown its advantages in the processing of big medical data, especially suitable for the study of PD patients with complex complications. Second, data noise and missing data are inevitable in data collecting from the real world, especially the retrospective studies, and ML algorithm can easy handle the complex problem. Third, compared to the traditional models, ML models allow data to be constantly updated and can accurately capture feature connections between data.

In this study, age was an independent risk variable for all-cause death and HF in PD patients. Aging is a major risk factor for cardiovascular disease and a prognostic determinant of chronic HF [Citation33,Citation34]. According to the risk factors which derived from the ML model, we suggest reducing the age-related comorbidities index, which may help select patients in PD. The Framingham study showed that for every additional 10 years of age, the rate of HF increases twofold. PD patients have many complications, even absence of other comorbidities, weakness associated with aging increased mortality. We also propose thresholds for age-related comorbidities index, which may help evaluate the risk of HF and death in PD patients.

Another outstanding advantage of our study is that ML algorithm has increased usefulness of the CCI. Among the chronic disease indices validated in the dialysis patients, CCI has been widely used in statistical analyses due to its simplicity and ability to predict mortality, the original CCI has been validated in PD patients [Citation35,Citation36]. PD patients have complicated comorbidities, so CCI is quantifiable to evaluate the comorbidities. In this study, CCI score had a high weight in predicting the risk of all cause death and HF in PD patients, which was consistent with relevant studies.

Pulliam et al. [Citation37] conducted a study on PD patients in North America and found that the mortality of PD patients was 10% within one year. In our study, during the first year follow-up, the mortality was 11.5%, the result is similar with relation studies. According to a 5-year follow-up report released by the American Kidney Disease Data System in 2015 since 2008 [Citation38], the 3-year survival rate of PD patients was 66.4%, and the 5-year survival rate was 50.3%, with the mortality decreasing year by year. In our ML algorithm, the risks of 5-year all-cause mortality were age, CCI score, eGFR, creatinine, right ventricular diameter, DBP, total cholesterol, HDL-C, ALB, etc. It shows that, as longer as the duration of dialysis, complications, dialysis adequacy, blood lipid, and nutritional affect the survival.

One of the most widely used risk prediction models of HF is the Seattle HF model, which obtained an AUC value of 0.73 [Citation9]. However, few studies have used ML methods to predict the risk factors of HF in PD patients; this is another highlight of our study. Recent data show that ML algorithms outperform logistic regression models in the prediction of HF outcomes [Citation39–41].The traditional indicators of assessing HF such as edema, pulmonary rales, BNP were not sensitive and lagging. ML model has high sensitivity and specificity for clinical risk predicting. In our study, the AUC value of the risk predict model is above 0.8. In comparison with the Cox model, ML is mostly better than the Cox model. However, in the partial time endpoints, the ML model does not show superiority, which is related to the sample size. In this study, NT-proBNP and EF in predicted HF showed no significant difference, it may be because in patients with renal failure and insufficiency, NT-proBNP metabolism is slow. As for EF in echocardiography, although the average EF of the HF group was slightly lower than that of the non-HF group, due to insufficient sample size, the difference was not statistically significant.

This ML-based approach can deal with large multidimensional time-to-data sets, it does not need to consider whether the data are normally distributed or not, and the over-fitting of the model. Based on the ML algorithm, we have also developed an integer-based risk score, which can effectively predict the risk of HF in PD patients. From the risk scoring system, the PD patients with higher scores show an upward trend with the increase of scores during 1-year and 5-year follow-up. We can develop a risk scoring software for clinical use to evaluate the risk and survival of HF in PD patients.

To conclude, the purpose of our study was using ML algorithmic to accurately predict the risks of HF and all-cause mortality in PD patients by focusing on comorbidities and clinical information. Our models could help informing patients and caregivers early detection of the HF risk and all-cause mortality risk and may aid decision making in PD patients who are faced with the complication. Our risk scoring system is also applicable to the clinical evaluation of the HF risk of PD patients.

Study strengths and limitations

Our study has several strengths. The advantage of this study is that although ML has been applied in the clinical use and even in the field of dialysis, the research on PD is still lacking. This is the first ML study of all-cause mortality and cardiovascular complications in PD patients.

However, the limitations of our study are also worth mentioning. First, our study is a retrospective, single-center study with a limited sample size. Second, ML algorithms fail to explain the relationship between variables and endpoints. Third, the gold standard for PD capacity evaluation is isotope dilution, and dual-energy X-ray and bioelectrical impedance, but these methods cannot be used as a routine evaluation method for large-scale clinical studies due to operational limitations. It is impossible to distinguish between water and sodium retention or cardiac dysfunction by the traditional indicators in our study, especially in cases of normal EF. Fourth, this study only recorded HF patients requiring hospitalization, but did not include patients with subclinical symptoms of HF. Fifth, the scoring system needs to be verified by a larger sample size. In addition, some variables were excluded due to the high degree of deletion in our cohort, especially the echocardiography data, which can be continued in the future study.

Conclusions

We implemented ML algorithms to accurately predict the risks of HF and all-cause mortality in PD patients. Finally, we provide comprehensive results of our study using different ML algorithms. Our study suggested potential of ML to facilitate risk stratification and to discover new predictors in complex clinical situations, such as PD patients with many complications. This ML approach proved to have comparable forecasting value with Cox regression and was better than Cox model.

Author contributions

Liping Xu: data curation, data analysis, and drafting of the article; Fang Cao: searched for literature and collaborative data entry; Lian Wang: data entry; Weihua Liu: data analysis; Meizhu Gao: data analysis; Li Zhang: searched for literature; Fuyuan Hong: data curation; Miao Lin: study supervision, coordination, funding support, and design of this study. All authors agreed to the publication of this work.

Ethics statement

Our study was approved by Clinical Ethics Review Committee in Fujian Provincial Hospital as an exempt study with a waiver of informed consent, allowing a retrospective review of medical records. This study protocol was reviewed and approved by Clinical Ethics Review Committee in Fujian Provincial Hospital, approval number K2021-03-026.

Supplemental material

Supplemental Material

Download PDF (1.4 MB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

Data are not publicly available due to ethical reasons. Further enquiries can be directed to the corresponding author.

Additional information

Funding

This study was supported by grants from Provincial Natural Science Foundation of Fujian Province (No. 2019J01172), Young and Middle-Aged Scholars Program of Fujian Health Commission (No. 2019-ZQN-7) and a Special Grant for Education and Research from Fujian Department of Finance (No. (2022)840). This study was supported by grants from Xiamen Medical and Health Guiding Project (No. 3502Z20224ZD1257).

References

  • Daratha KB, Short RA, Corbett CF, et al. Risks of subsequent hospitalization and death in patients with kidney disease. Clin J Am Soc Nephrol. 2012;7(3):1–11. doi:10.2215/CJN.05070511.
  • Ortiz A, Covic A, Fliser D, et al. Epidemiology, contributors to, and clinical trials of mortality risk in chronic kidney failure. Lancet. 2014;383(9931):1831–1843. doi:10.1016/S0140-6736(14)60384-6.
  • Mauri JM, Cleries M, Vela E, et al. Design and validation of a model to predict early mortality in haemodialysis patients. Nephrol Dial Transplant. 2008;23(5):1690–1696. doi:10.1093/ndt/gfm728.
  • Chua HR, Lau T, Luo N, et al. Predicting first-year mortality in incident dialysis patients with end-stage renal disease – the UREA5 study. Blood Purif. 2014;37(2):85–92. doi:10.1159/000357640.
  • Doi T, Yamamoto S, Morinaga T, et al. Risk score to predict 1-year mortality after haemodialysis initiation in patients with stage 5 chronic kidney disease under predialysis nephrology care. PLOS One. 2015;10(6):e0129180. doi:10.1371/journal.pone.0129180.
  • Floege J, Gillespie IA, Kronenberg F, et al. Development and validation of a predictive mortality risk score from a European Hemodialysis Cohort. Kidney Int. 2015;87(5):996–1008. doi:10.1038/ki.2014.419.
  • Quinn RR, Laupacis A, Hux JE, et al. Predicting the risk of 1-year mortality in incident dialysis patients: accounting for case-mix severity in studies using administrative data. Med Care. 2011;49(3):257–266. doi:10.1097/MLR.0b013e318202aa0b.
  • D’Agostino RBSr., Vasan RS, Pencina MJ, et al. General cardiovascular risk profile for use in primary care: the Framingham Heart Study. Circulation. 2008;117(6):743–753. doi:10.1161/CIRCULATIONAHA.107.699579.
  • Levy WC, Mozaffarian D, Linker DT, et al. The Seattle Heart Failure Model: prediction of survival in heart failure. Circulation. 2006;113(11):1424–1433. doi:10.1161/CIRCULATIONAHA.105.584102.
  • Attia ZI, Kapa S, Lopez-Jimenez F, et al. Screening for cardiac contractile dysfunction using an artificial intelligence-enabled electrocardiogram. Nat Med. 2019;25(1):70–74. doi:10.1038/s41591-018-0240-2.
  • Chen T, Li X, Li Y, et al. Prediction and risk stratification of kidney outcomes in IgA nephropathy. Am J Kidney Dis. 2019;74(3):300–309. doi:10.1053/j.ajkd.2019.02.016.
  • He J, Baxter SL, Xu J, et al. The practical implementation of artificial intelligence technologies in medicine. Nat Med. 2019;25(1):30–36. doi:10.1038/s41591-018-0307-0.
  • Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44–56. doi:10.1038/s41591-018-0300-7.
  • Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115–118. doi:10.1038/nature21056.
  • Gulshan V, Peng L, Coram M, et al. Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. JAMA. 2016;316(22):2402–2410. doi:10.1001/jama.2016.17216.
  • Weng SF, Reps J, Kai J, et al. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLOS One. 2017;12(4):e0174944. doi:10.1371/journal.pone.0174944.
  • Young J, Kempton MJ, McGuire P. Using machine learning to predict outcomes in psychosis. Lancet Psychiatry. 2016;3(10):908–909. doi:10.1016/S2215-0366(16)30218-8.
  • Yu KH, Zhang C, Berry GJ, et al. Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features. Nat Commun. 2016;7(1):12474. doi:10.1038/ncomms12474.
  • Jordan MI, Mitchell TM. Machine learning: trends, perspectives, and prospects. Science. 2015;349(6245):255–260. doi:10.1126/science.aaa8415.
  • Song Y, Gao S, Tan W, et al. Multiple machine learnings revealed similar predictive accuracy for prognosis of PNETs from the surveillance, epidemiology, and end result database. J Cancer. 2018;9(21):3971–3978. doi:10.7150/jca.26649.
  • Awan SE, Sohel F, Sanfilippo FM, et al. Machine learning in heart failure: ready for prime time. Curr Opin Cardiol. 2018;33(2):190–195. doi:10.1097/HCO.0000000000000491.
  • Tripoliti EE, Papadopoulos TG, Karanasiou GS, et al. Heart failure: diagnosis, severity estimation and prediction of adverse events through machine learning techniques. Comput Struct Biotechnol J. 2017;15:26–47. doi:10.1016/j.csbj.2016.11.001.
  • Collins AJ, Foley RN, Chavers B, et al. United States Renal Data System 2011 Annual Data Report: atlas of chronic kidney disease & end-stage renal disease in the United States. Am J Kidney Dis. 2012;59(1 Suppl. 1):A7, e1–420.
  • Heart Failure Group of Chinese Society of Cardiology of Chinese Medical A, Chinese Heart Failure Association of Chinese medical doctor A, editorial board of Chinese journal of C. Chinese guidelines for the diagnosis and treatment of heart failure 2018. Zhonghua Xin Xue Guan Bing Za Zhi. 2018;46(10):760–789.
  • Perez-Moreno AC, Jhund PS, MacDonald MR, et al. Fatigue as a predictor of outcome in patients with heart failure: analysis of CORONA (Controlled Rosuvastatin Multinational Trial in Heart Failure). JACC Heart Fail. 2014;2(2):187–197. doi:10.1016/j.jchf.2014.01.001.
  • Kobayashi M, Watanabe M, Coiro S, et al. Mid-term prognostic impact of residual pulmonary congestion assessed by radiographic scoring in patients admitted for worsening heart failure. Int J Cardiol. 2019;289:91–98. doi:10.1016/j.ijcard.2019.01.091.
  • Melenovsky V, Andersen MJ, Andress K, et al. Lung congestion in chronic heart failure: haemodynamic, clinical, and prognostic implications. Eur J Heart Fail. 2015;17(11):1161–1171. doi:10.1002/ejhf.417.
  • Caldentey G, Khairy P, Roy D, et al. Prognostic value of the physical examination in patients with heart failure and atrial fibrillation: insights from the AF-CHF trial (atrial fibrillation and chronic heart failure). JACC Heart Fail. 2014;2(1):15–23. doi:10.1016/j.jchf.2013.10.004.
  • Charlson ME, Pompei P, Ales KL, et al. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis. 1987;40(5):373–383. doi:10.1016/0021-9681(87)90171-8.
  • Woythaler JN, Singer SL, Kwan OL, et al. Accuracy of echocardiography versus electrocardiography in detecting left ventricular hypertrophy: comparison with postmortem mass measurements. J Am Coll Cardiol. 1983;2(2):305–311. doi:10.1016/s0735-1097(83)80167-3.
  • Ellouali F, Berkchi F, Bayahia R, et al. Comparison of the effects of dialysis methods (haemodialysis vs peritoneal dialysis) on diastolic left ventricular function dialysis methods and diastolic function. Open Cardiovasc Med J. 2016;10(1):171–178. doi:10.2174/1874192401610010171.
  • Essig M, Escoubet B, de Zuttere D, et al. Cardiovascular remodelling and extracellular fluid excess in early stages of chronic kidney disease. Nephrol Dial Transplant. 2008;23(1):239–248. doi:10.1093/ndt/gfm542.
  • Cunha FM, Pereira J, Ribeiro A, et al. Age affects the prognostic impact of diabetes in chronic heart failure. Acta Diabetol. 2018;55(3):271–278. doi:10.1007/s00592-017-1092-9.
  • Moskalev AA, Shaposhnikov MV, Plyusnina EN, et al. The role of DNA damage and repair in aging through the prism of Koch-like criteria. Ageing Res Rev. 2013;12(2):661–684. doi:10.1016/j.arr.2012.02.001.
  • Fried L, Bernardini J, Piraino B. Charlson Comorbidity Index as a predictor of outcomes in incident peritoneal dialysis patients. Am J Kidney Dis. 2001;37(2):337–342. doi:10.1053/ajkd.2001.21300.
  • van Manen JG, Korevaar JC, Dekker FW, et al. How to adjust for comorbidity in survival studies in ESRD patients: a comparison of different indices. Am J Kidney Dis. 2002;40(1):82–89. doi:10.1053/ajkd.2002.33916.
  • Pulliam J, Li NC, Maddux F, et al. First-year outcomes of incident peritoneal dialysis patients in the United States. Am J Kidney Dis. 2014;64(5):761–769. doi:10.1053/j.ajkd.2014.04.025.
  • Saran R, Li Y, Robinson B, et al. US Renal Data System 2015 Annual Data Report: epidemiology of kidney disease in the United States. Am J Kidney Dis. 2016;67(3 Suppl. 1): Svii, S1–305. doi:10.1053/j.ajkd.2015.12.014.
  • Desai RJ, Wang SV, Vaduganathan M, et al. Comparison of machine learning methods with traditional models for use of administrative claims with electronic medical records to predict heart failure outcomes. JAMA Netw Open. 2020;3(1):e1918962. doi:10.1001/jamanetworkopen.2019.18962.
  • Angraal S, Mortazavi BJ, Gupta A, et al. Machine learning prediction of mortality and hospitalization in heart failure with preserved ejection fraction. JACC Heart Fail. 2020;8(1):12–21. doi:10.1016/j.jchf.2019.06.013.
  • Kwon JM, Kim KH, Jeon KH, et al. Artificial intelligence algorithm for predicting mortality of patients with acute heart failure. PLOS One. 2019;14(7):e0219302. doi:10.1371/journal.pone.0219302.