1,199
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Utilization of interpretable machine learning model to forecast the risk of major adverse kidney events in elderly patients in critical care

, , , &
Article: 2215329 | Received 16 Feb 2023, Accepted 12 May 2023, Published online: 23 May 2023

Abstract

Major adverse kidney events within 30 d (MAKE30) implicates poor outcomes for elderly patients in the intensive care unit (ICU). This study aimed to predict the occurrence of MAKE30 in elderly ICU patients using machine learning. The study cohort comprised 2366 elderly ICU patients admitted to the Second Xiangya Hospital of Central South University between January 2020 and December 2021. Variables including demographic information, laboratory values, physiological parameters, and medical interventions were used to construct an extreme gradient boosting (XGBoost) -based prediction model. Out of the 2366 patients, 1656 were used for model derivation and 710 for testing. The incidence of MAKE30 was 13.8% in the derivation cohort and 13.2% in the test cohort. The average area under the receiver operating characteristic curve of the XGBoost model was 0.930 (95% CI: 0.912–0.946) in the training set and 0.851 (95% CI: 0.810–0.890) in the test set. The top 8 predictors of MAKE30 tentatively identified by the Shapley additive explanations method were Acute Physiology and Chronic Health Evaluation II score, serum creatinine, blood urea nitrogen, Simplified Acute Physiology Score II score, Sequential Organ Failure Assessment score, aspartate aminotransferase, arterial blood bicarbonate, and albumin. The XGBoost model accurately predicted the occurrence of MAKE30 in elderly ICU patients, and the findings of this study provide valuable information to clinicians for making informed clinical decisions.

Introduction

Medical advancements have significantly prolonged human life expectancy [Citation1], leading to a growth of critically ill patients over the age of 65 [Citation2]. Such patients often experience a high rate of renal adverse events, and to better reflect overall renal outcomes, the use of a composite and objective clinical outcome indicator called major adverse kidney events (MAKE30) is recommended [Citation3]. Studies have shown that the incidence of MAKE30 in intensive care unit (ICU) patients ranges from 13.6% to 38.9% [Citation4–10]. The incidence might be higher in the elderly population due to their increased comorbidities and higher severity of illness [Citation11,Citation12].

Early identification of patients at high risk of adverse kidney outcomes is essential for clinicians to adjust clinical decisions and reduce the risk of MAKE30 [Citation5]. Several factors have been found to correlate with the occurrence of MAKE30 in critically ill patients, including serum chloride [Citation8], serum renin [Citation13], plasma Galectin-3 [Citation10], positive liquid balance [Citation14], and abnormal hepatic or portal venous Doppler [Citation7]. Logistic regression models have been developed for the prediction of MAKE30 in critically ill patients with acute kidney injury (AKI) [Citation15], acute pancreatitis [Citation16], and those receiving veno-arterial extracorporeal membrane oxygenation [Citation17]. However, most of these models were based on a logistic regression approach and were not been established in critically ill elderly patients.

Machine learning has emerged as a promising approach for clinical prognosis evaluation due to its ability to handle complex nonlinear relationships [Citation18–20]. One such algorithm, extreme gradient boosting (XGBoost), has recently dominated applied machine learning and structured data competitions. Its interpretability has made it a popular choice in the medical field. Previous studies have shown that XGBoost performed well in predicting prognosis of elderly ICU patients [Citation20–22]. Thus, the aim of this study is to establish a prediction model for MAKE30 in elderly patients in the ICU using the XGBoost algorithm.

Methods

Ethics approval

This study followed the Declaration of Helsinki and the transparent reporting statement for individual prognostic or diagnostic multivariate prediction models [Citation23]. Central South University’s Medical Ethics Committee (2022-K031) approved the study protocol. Due to the retrospective nature of the study, informed consent was waived.

Data source

This cohort study included all elderly patients admitted to the ICUs of the Second Xiangya Hospital of Central South University from January 1, 2020, to December 31, 2021. Medical information of the ICU patients was extracted from electronic medical record system.

Patient population

Patients with age ≥ 65 years who were admitted to the ICU were included in the study. The following patients were excluded: (1) length of ICU stay < 48 h; (2) patients with chronic kidney diseases (CKD) stage 5 or uremia; (3) patients on maintenance hemodialysis; (4) patients with creatinine values less than once during hospitalization; (5) kidney transplant recipients. For patients who were admitted to the ICU multiple times within one year, only the first admission information was recorded.

Feature selection

We collected patients’ demographic information, comorbidities, physiological parameters, invasive and noninvasive laboratory indicators, medical interventions, and clinical scores as potential predictive variables. The first values of physiological parameters and laboratory indicators within 24 h since admission to the ICU were used, and medical interventions were identified within 48 h since admission to the ICU. Sequential Organ Failure Assessment (SOFA) score, Simplified Acute Physiology Score II (SAPS II), and Acute Physiology and Chronic Health Evaluation II (APACHE II) score were calculated based on the first measurement of the patient within 24 h after admission to the ICU. Baseline creatinine was defined as the lowest creatinine value in the 12 months before admission and was estimated with the use of the following formula if no value was available. [creatinine = 0.74 − 0.2 (if female) + 0.08 (if African American) + 0.003 × age (in years)]. If a patient had documented CKD, it was defined as the lowest creatinine value during hospitalization [Citation6,Citation9,Citation24,Citation25]. Baseline glomerular filtration rates were calculated using the Modification of Diet in takes diseases equation estimation [Citation26]. AKI was determined following the 2012 Kidney Disease: Improving Global Outcomes Clinical Practice Guidelines [Citation27].

Outcome assessment

The main outcome is the occurrence of MAKE30, which is a compound outcome, including (1) emergent dialysis; (2) the last creatinine at hospitalization was 200% of baseline creatinine (persistent kidney impairment); (3) death [Citation7,Citation14]. Outcomes were obtained at discharge (if discharged within 30 d) or on the 30th day since admission to the ICU. The length of hospital stay and ICU stay were secondary outcomes.

Data set partitioning and statistical analysis

The patients were randomly divided into a training group and a test group in a 7:3 ratio by caret package in R. Patients’ characteristics are described as numbers (percentages) for categorical variables and medians (interquartile ranges) for continuous variables. We compared the distributions of continuous variables using the Mann–Whitney U test and categorical variables using the chi-square test. Categorical variables were transformed in advance to factor variables. The missForest package in R was used to impute missing data using the random forest method [Citation28]. Each variable’s data type and proportion of missing values are listed in Table S1.

Feature selection

Variables with missing proportions greater than 30% are eliminated. The nearZeroVar function was applied to identify variables with variance close to zero and exclude them from further calculations. For the final model, the variables that were chosen by at least three methods, including the Least Absolute Shrinkage and Selection Operator, Boruta algorithm, random forest-recursive feature elimination, and random forest-filtering, were selected. The variance inflation factor (VIF) was calculated to identify multicollinearity.

Modeling and testing

We trained the XGBoost model using 5 random shuffles of 5-fold cross-validation and evaluated the model performance based on the average area under the receiver operating characteristic curve (AUROC). The performance of the final prediction model was further evaluated in the test set using AUROC, the area under the precision-recall curve (AUPRC), and calibration. The modeling framework is shown in Figure S1. Table S2 describes the functions, packages, and tuning parameters used.

Interpretability of the model

We use the Shapley additive explanations (SHAP) method to try to open the black box of machine learning [Citation29]. It can preliminarily provide interpretability for the model, and quantify the influence of each feature on the prediction made by the model [Citation30,Citation31].

Sensitivity analysis

Model performance was assessed in different subgroups. We focused on patients with different age groups (≤ 79 years old and > 79 years old), different types of hospitalization (medical hospitalization and surgical hospitalization), different severity of illness (APACHE II score ≤ 16 and APACHE II score > 16), and different time to endpoint (time to endpoint ≤ 13 d and > 13 d). For clinical translation, low and high-risk thresholds were delineated and the corresponding statistical indicators were calculated. To evaluate the impact of dataset imbalance on model construction, upsampling and downsampling models were established using traincontral function in R.

Results

Characteristics of the study population

A total of 2366 patients were identified in this study, with 1656 patients divided into the derivation cohort and the remaining 710 patients into the test cohort (). The characteristics of the derivation and test cohorts are shown in . In the derivation cohort, 229 patients developed MAKE30, of which 53 patients died, 128 patients had persistent kidney impairment, and 141 patients received new dialysis. In the test cohort, 94 patients occurred MAKE30, of whom 22 patients died, 59 patients had persistent kidney impairment, and 57 patients received new dialysis. Comparisons of baseline characteristics in patients with and without MAKE30 in the derivation and test cohorts were shown in Tables S3 and S4 respectively.

Figure 1. Flow diagram of patient selection. CKD: chronic kidney diseases; ICU: intensive care unit; SCr: serum creatinine.

Figure 1. Flow diagram of patient selection. CKD: chronic kidney diseases; ICU: intensive care unit; SCr: serum creatinine.

Table 1. Baseline characteristics and outcomes of the patients in the derivation and test cohorts.

Predictive variable and model performance

Variables exhibiting multicollinearity (VIF >10) were removed [Citation32]. The final prediction model included 38 variables (Table S5). The XGBoost model established by the 38 variables has good discrimination with the AUROC in the derivation cohort and test cohort being 0.930 (95% CI: 0.912–0.946) and 0.851 (95% CI: 0.810–0.890), respectively (). The AUPRC of the XGBoost model in the derivation cohort and test cohort were 0.76 and 0.51, respectively (Figure S2). The Brier scores of the XGBoost model in the derivation cohort and test cohort were 0.066 and 0.087, respectively. Figures S2 and S3 provide details about the performance metrics, including AUPRC and calibration plots. The inference time of the XGBoost classifier for one case is 6.1489 (lower quartile - upper quartile:5.9230–6.5812) milliseconds. The hardware of our computer for modeling is listed in Table S5.

Figure 2. Receiver operating characteristic curves of the extreme gradient boosting model.

Figure 2. Receiver operating characteristic curves of the extreme gradient boosting model.

Interpretability of the model

By using the SHAP algorithm, each factor’s importance for the MAKE30 was initially explored. shows the feature importance of the 38 variables most relevant to MAKE30 (in descending order). Among them, APACHE II score, serum creatinine, blood urea nitrogen (BUN), SAPS II, SOFA score, aspartate aminotransferase (AST), arterial blood bicarbonate and albumin (ALB) were the top eight predictor variables associated with MAKE30 in the XGBoost model. How individual variable values affect SHAP values is shown in . To facilitate the clinical utility of the XGBoost model, low and high-risk thresholds were determined. Based on low-risk and high-risk cutoffs, presents diagnostic test characteristics. When 0.154 was used as the low-risk critical value, the sensitivity, specificity, positive likelihood ratio, and negative likelihood ratio of the model on the training set were 85.6%, 85.1%, 5.73 and 0.17, respectively; the sensitivity, specificity, positive likelihood ratio and negative likelihood ratio of the model in the test set were 77.7%, 79.5%, 3.80 and 0.28, respectively. When 0.186 was used as the high-risk critical value, the sensitivity, specificity, positive likelihood ratio, and negative likelihood ratio of the model on the training set were 81.7%, 87.5%, 3.80 and 0.28, respectively; the sensitivity, specificity, positive likelihood ratio and negative likelihood ratio of the model in the test set were 74.5%, 84.1%, 4.68, 0.30, respectively.

Figure 3. Shapley additive explanations summary plots of the extreme gradient boosting model for MAKE30. AKI: acute kidney injury; ALB: albumin; ALT: alanine aminotransferase; APACHE II: Acute Physiology and Chronic Health Evaluation II; AST: aspartate aminotransferase; BUN: blood urea nitrogen; CKD: Chronic kidney disease; eGFR: Estimated Glomerular Filtration Rate; GCS: Glasgow Coma Scale; HB: hemoglobin; RBC: Red blood cells; HCT: hematocrit; CHF: chronic heart failure; NYHA: New York Heart Association; HR: heart rate at ICU admission; PaO2: arterial partial pressure of oxygen; PLT: platelet; PT: prothrombin time; RDW-CV: Red blood cell distribution width coefficient of variation; SAPS II: Simplified Acute Physiology Score II; SCr: serum creatinine; SOFA: sequential organ failure assessment score; T: body temperature at ICU admission; WBC: white blood cell.

Figure 3. Shapley additive explanations summary plots of the extreme gradient boosting model for MAKE30. AKI: acute kidney injury; ALB: albumin; ALT: alanine aminotransferase; APACHE II: Acute Physiology and Chronic Health Evaluation II; AST: aspartate aminotransferase; BUN: blood urea nitrogen; CKD: Chronic kidney disease; eGFR: Estimated Glomerular Filtration Rate; GCS: Glasgow Coma Scale; HB: hemoglobin; RBC: Red blood cells; HCT: hematocrit; CHF: chronic heart failure; NYHA: New York Heart Association; HR: heart rate at ICU admission; PaO2: arterial partial pressure of oxygen; PLT: platelet; PT: prothrombin time; RDW-CV: Red blood cell distribution width coefficient of variation; SAPS II: Simplified Acute Physiology Score II; SCr: serum creatinine; SOFA: sequential organ failure assessment score; T: body temperature at ICU admission; WBC: white blood cell.

Figure 4. Shapley additive explanations dependence plots for the association between the predictors and MAKE30 in the extreme gradient boosting model. ALB: albumin; APACHE II: Acute Physiology and Chronic Health Evaluation II; AST: aspartate aminotransferase; BUN: blood urea nitrogen; SAPS II: Simplified Acute Physiology Score II; SCr: serum creatinine; SOFA: Sequential Organ Failure Assessment.

Figure 4. Shapley additive explanations dependence plots for the association between the predictors and MAKE30 in the extreme gradient boosting model. ALB: albumin; APACHE II: Acute Physiology and Chronic Health Evaluation II; AST: aspartate aminotransferase; BUN: blood urea nitrogen; SAPS II: Simplified Acute Physiology Score II; SCr: serum creatinine; SOFA: Sequential Organ Failure Assessment.

Table 2. Diagnostic test characteristics of the extreme gradient boosting model at the low- and high-risk cutoff points.

Sensitivity analysis

The XGBoost model achieved good prediction performance in different age groups, different types of hospitalization, and different severity of illness, different time to endpoint (Figure S4–S11). The AUROC in subgroup with time to endpoint ≤ 13 d was 0.947 (95% CI: 0.931–0.961) in the derivation cohort and 0.882 (95% CI: 0.842–0.916) in the test cohort, and the AUROC in subgroup with time to endpoint >13 d was 0.871 (95% CI:0.820–0.910) in the derivation cohort and 0.748 (95% CI:0.637–0.849). The Delong test showed p = .016, suggesting the XGBoost model performs better in early-on cohort patients than in late-on cohort patients. In addition, when we used upsampling or downsampling to balance the derived cohorts to train the model, the model still provided good predictive performance (Figure S12 and S13).

Discussion

This cohort study aimed to establish a prediction model that accurately predicts the risk of MAKE30 in critically ill elderly patients using XGBoost. The AUROC for the model was 0.930 (95% CI: 0.912–0.946) in the derivation cohort and 0.851 (95% CI: 0.810–0.890) in the test cohort, demonstrating its effectiveness. We attempted to analyze the interpretability of XGBoost model using SHAP, and the results were consistent with clinical practice and other studies. This novel machine-learning model has potential applications in clinical decision-making and risk stratification.

Combining artificial intelligence and the medical field has shown great promise, with numerous studies demonstrating its efficacy [Citation33–36]. For example, Huang et al. trained the XGBoost model to predict in-hospital mortality for elderly patients with ischemic stroke, achieving an AUROC of 0.994 in the derivation cohort and 0.733 in the test cohort [Citation37]. Similarly, the XGBoost model developed by Li et al. to predict in-hospital mortality for patients with heart failure reached an AUROC of 0.824 in the test cohort [Citation38]. Given the good performance of XGBoost in predicting clinical prognosis and the lack of machine learning models for the prediction of MAKE30, this study presents a novel and valuable XGBoost model for the occurrence of MAKE30 in critically ill elderly patients.

In this study, we utilized the SHAP method to enhance the interpretability of the XGBoost model and identified several key factors that contribute to the risk of MAKE 30 in elderly patients in the ICU. Our analysis identified low platelet count, high baseline estimated glomerular filtration rate, and SOFA score as high-risk factors for MAKE30, which have been previously implicated in the construction of prognostic models for adverse renal outcomes in other populations [Citation3,Citation8,Citation16,Citation17]. Our findings revealed that various demographic and physiological factors, including the APACHE II score, serum creatinine and BUN levels, and SOFA score, are positively associated with the risk of MAKE30. These results suggest that the physiological health status of patients at ICU admission can predict adverse renal outcomes within 30 d. Additionally, our analysis showed that low levels of ‘Arterial blood HCO3’ were linked to a high risk of MAKE30. This may be due to renal compensatory mechanisms for maintaining normal acid-base homeostasis, leading to further renal injury in patients. For example, up-regulating renal endocrine hormones such as angiotensin II, aldosterone, and endothelin-1 by kidney promote renal injury, inflammation, and fibrosis [Citation39,Citation40]. The presence of metabolic acidosis, as indicated by low HCO3- levels, may reflect tubular interstitial disease and poor renal function, leading to an increased risk of AKI [Citation41]. Previous retrospective cohort studies have confirmed this association, and the use of sodium bicarbonate infusion has been explored as a potential intervention to improve kidney and survival outcomes [Citation25,Citation42].

Furthermore, we found that a significant increase in non-hepatic AST was associated with mortality, which may reflect skeletal muscle damage, myocardial damage, or hematological diseases [Citation43]. Our analysis also showed that low levels of albumin related to a high risk of MAKE30. Albumin is a protein that is mainly produced by the liver and circulates in the blood plasma. It has various functions, such as maintaining fluid balance, transporting hormones and drugs, and modulating immune responses. Low albumin levels might indicate liver dysfunction, kidney dysfunction, inflammation, or infections, which are all potential risk factors for adverse kidney events [Citation14,Citation44,Citation45]. Therefore, albumin levels can reflect the overall health status of elderly ICU patients and their susceptibility to MAKE30. Albumin levels might also be used as a biomarker to monitor the response to interventions and the prognosis of elderly ICU patients with AKI. These findings emphasize the importance of considering various physiological factors in predicting adverse renal outcomes in critically ill elderly patients and may have implications for clinical decision-making and risk stratification.

The XGBoost model has achieved exceptional prediction performance and has won multiple machine learning competitions. This is due to its ability to provide a relatively transparent solution to the problem of nonlinear relationships, as well as its avoidance of black-box machine learning models [Citation18,Citation46]. The impact of sample imbalance can be assessed by changing the sampling method. In our study, we confirmed that the XGBoost model performed well in predicting the occurrence of MAKE30 in elderly ICU patients.

This cohort study establishes the efficacy of XGBoost in predicting MAKE30 in critically ill elderly patients, with robustness and applicability in different populations. The model employed variables that are readily available in clinical practice, enhancing its accessibility and affordability. Furthermore, the ranking of predictors can aid healthcare providers in identifying high-risk patients and optimizing medical management.

Nevertheless, this study has several limitations. Firstly, the data were retrospectively collected from a single center. Single-center studies often have the deficiencies of insufficient sample size and data bias. The model may not generalize well when extended to different clinical practice settings and different patient populations. External validation and prospective studies may help to test the accurate prediction ability of the model and accelerate the clinical universality of the model. Therefore, external validation and prospective validation are necessary in the future. Secondly, intravenous fluids were not included as a potential variable due to data collection challenges. However, previous studies suggest that high-risk patients should avoid a higher cumulative fluid balance and receive lactated Ringer solution instead of normal saline. Further studies are required to investigate the impact of intravenous fluids on MAKE30 [Citation6,Citation47]. Thirdly, our study did not exclude patients with antineutrophil cytoplasmic antibody (ANCA)-associated vasculitis or anti-glomerular basement membrane disease who might have different renal events incidence with other patients, and further studies are needed to compare the performance of our model in different subgroups of patients. Fourth, the model was established in patients staying in ICU ≥48 h during the hospital admission, and the feasibility for patients staying in ICU <48 h is unknown. Finally, we preliminarily explored the interpretability of the model using the SHAP algorithm, but this method applied the assumption that the features were independent of each other, and it only provides correlation, not causation, between features and predictions, and it might be affected by spurious correlations or omitted variables. More general methods are to be explored.

Conclusion

Our study successfully developed an XGBoost machine learning model that exhibited good performance in both the derivation and test groups. This model has the potential to provide valuable information for clinical decision-making in the management of critically ill elderly patients in the ICU.

Supplemental material

Supplemental Material

Download PDF (1.6 MB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The author(s) reported there is no funding associated with the work featured in this article.

References

  • Becker S, Müller J, de Heer G, et al. Clinical characteristics and outcome of very elderly patients ≥90 years in intensive care: a retrospective observational study. Ann Intensive Care. 2015;5(1):1.
  • Roedl K, Daniels R, Theile P, et al. The independent impact of peripheral arterial disease on mortality in nonagenarians and centenarians who were treated in an intensive care unit: a consecutive cohort of 1108 patients. Eur J Vasc Endovasc Surg. 2023;65(4):582–9.
  • Xin Q, Xie T, Chen R, et al. Predictive nomogram model for major adverse kidney events within 30 days in sepsis patients with type 2 diabetes mellitus. Front Endocrinol. 2022;13:1024500.
  • Semler MW, Rice TW, Shaw AD, et al. Identification of major adverse kidney events within the electronic health record. J Med Syst. 2016;40(7):167.
  • McKown AC, Wang L, Wanderer JP, et al. Predicting major adverse kidney events among critically ill adults using the electronic health record. J Med Syst. 2017;41(10):156.
  • Semler MW, Wanderer JP, Ehrenfeld JM, et al. Balanced crystalloids versus saline in the intensive care unit. The SALT randomized trial. Am J Respir Crit Care Med. 2017;195(10):1362–1372.
  • Spiegel R, Teeter W, Sullivan S, et al. The use of venous doppler to predict adverse kidney events in a general ICU cohort. Crit Care. 2020;24(1):615.
  • Zhou D, Jiang J, Zhang J, et al. Increase in chloride is associated with major adverse kidney events in critically ill patients. Shock. 2023;59(3):338–343.
  • Semler MW, Self WH, Wanderer JP, et al. Balanced crystalloids versus saline in critically ill adults. N Engl J Med. 2018;378(9):829–839.
  • Boutin L, Legrand M, Sadoune M, et al. Elevated plasma galectin-3 is associated with major adverse kidney events and death after ICU admission. Crit Care. 2022;26(1):13.
  • Fuchs L, Chronaki CE, Park S, et al. ICU admission characteristics and mortality rates among elderly and very elderly patients. Intensive Care Med. 2012;38(10):1654–1661.
  • Ihra GC, Lehberger J, Hochrieser H, et al. Development of demographics and outcome of very old critically ill patients admitted to intensive care units. Intensive Care Med. 2012;38(4):620–626.
  • Flannery AH, Ortiz-Soriano V, Li X, et al. Serum renin and major adverse kidney events in critically ill patients: a multicenter prospective study. Crit Care. 2021;25(1):294.
  • Mele A, Cerminara E, Häbel H, et al. Fluid accumulation and major adverse kidney events in sepsis: a multicenter observational study. Ann Intensive Care. 2022;12(1):62.
  • Neyra JA, Ortiz-Soriano V, Liu LJ, et al. Prediction of mortality and major adverse kidney events in critically ill patients with acute kidney injury. Am J Kidney Dis. 2023;81(1):36–47.
  • Zhang X, Ye B, Mao W, et al. Major adverse kidney events within 30 days in patients with acute pancreatitis: a tertiary-center cohort study. HPB. 2022;24(2):169–175.
  • Vinclair C, De Montmollin E, Sonneville R, et al. Factors associated with major adverse kidney events in patients who underwent veno-arterial extracorporeal membrane oxygenation. Ann Intensive Care. 2020;10(1):44.
  • Venerito V, Emmi G, Cantarini L, et al. Validity of machine learning in predicting giant cell arteritis flare after glucocorticoids tapering. Front Immunol. 2022;13:860877.
  • Tai AMY, Albuquerque A, Carmona NE, et al. Machine learning and big data: implications for disease modeling and therapeutic discovery in psychiatry. Artif Intell Med. 2019;99:101704.
  • Luo XQ, Kang YX, Duan SB, et al. Machine learning-based prediction of acute kidney injury following pediatric cardiac surgery: model development and validation study. J Med Internet Res. 2023;25:e41142.
  • Abe D, Inaji M, Hase T, et al. A prehospital triage system to detect traumatic intracranial hemorrhage using machine learning algorithms. JAMA Netw Open. 2022;5(6):e2216393.
  • Lee SW, Lee HC, Suh J, et al. Multi-center validation of machine learning model for preoperative prediction of postoperative mortality. NPJ Digit Med. 2022;5(1):91.
  • Collins GS, Reitsma JB, Altman DG, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. Ann Intern Med. 2015;162(1):55–63.
  • Brown RM, Wang L, Coston TD, et al. Balanced crystalloids versus saline in sepsis. A secondary analysis of the SMART clinical trial. Am J Respir Crit Care Med. 2019;200(12):1487–1495.
  • Bian Y, Xu T, Le Y, et al. The efficacy and safety of sodium bicarbonate ringer’s solution in critically ill patients: a retrospective cohort study. Front Pharmacol. 2022;13:829394.
  • Zhang H, Jiang L, Fu R, et al. Impact of levosimendan on efficacy and renal function in acute heart failure according to renal function: a perspective, multi-center, real-world registry. Front Cardiovasc Med. 2022;9:986039.
  • Eckardt KU, Kasiske BL. Kidney disease: improving global outcomes. Nat Rev Nephrol. 2009;5(11):650–657.
  • Stekhoven DJ, Bühlmann P. MissForest–non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112–118.
  • Lundberg S, Lee SI. A unified approach to interpreting model predictions. arXiv. 2017.
  • Loh HW, Ooi CP, Seoni S, et al. Application of explainable artificial intelligence for healthcare: a systematic review of the last decade (2011–2022). Comput Methods Programs Biomed. 2022;226:107161.
  • Zou Y, Shi Y, Sun F, et al. Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: individual prediction using SHapley additive exPlanations. Comput Methods Programs Biomed. 2022;225:107038.
  • Yuan Y, Wang X, Shi M, et al. Performance comparison of RGB and multispectral vegetation indices based on machine learning for estimating hopea hainanensis SPAD values under different shade conditions. Front Plant Sci. 2022;13:928953.
  • Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115–118.
  • Liang H, Tsui BY, Ni H, et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med. 2019;25(3):433–438.
  • Chilamkurthy S, Ghosh R, Tanamala S, et al. Deep learning algorithms for detection of critical findings in head CT scans: a retrospective study. Lancet. 2018;392(10162):2388–2396.
  • Al’Aref SJ, Maliakal G, Singh G, et al. Machine learning of clinical variables and coronary artery calcium scoring for the prediction of obstructive coronary artery disease on coronary computed tomography angiography: analysis from the CONFIRM registry. Eur Heart J. 2020;41(3):359–367.
  • Huang J, Jin W, Duan X, et al. Twenty-eight-day in-hospital mortality prediction for elderly patients with ischemic stroke in the intensive care unit: interpretable machine learning models. Front Public Health. 2022;10:1086339.
  • Li J, Liu S, Hu Y, et al. Predicting mortality in intensive care unit patients with heart failure using an interpretable machine learning model: retrospective cohort study. J Med Internet Res. 2022;24(8):e38082.
  • Raphael KL. Metabolic acidosis and subclinical metabolic acidosis in CKD. J Am Soc Nephrol. 2018;29(2):376–382.
  • Wesson DE, Buysse JM, Bushinsky DA. Mechanisms of metabolic acidosis-induced kidney injury in chronic kidney disease. J Am Soc Nephrol. 2020;31(3):469–482.
  • Zhu A, Whitlock RH, Ferguson TW, et al. Metabolic acidosis is associated with acute kidney injury in patients with CKD. Kidney Int Rep. 2022;7(10):2219–2229.
  • Huang S, Peng Y, Wang L, et al. Effectiveness of sodium bicarbonate infusion on mortality for elderly septic patients with acute metabolic acidosis. Front Pharmacol. 2022;13:974271.
  • Han JH, Kwak JY, Lee SS, et al. Markedly elevated aspartate aminotransferase from non-hepatic causes. JCM. 2022;12(1):310.
  • Sheng S, Zhang YH, Ma HK, et al. Albumin levels predict mortality in sepsis patients with acute kidney injury undergoing continuous renal replacement therapy: a secondary analysis based on a retrospective cohort study. BMC Nephrol. 2022;23(1):52.
  • Takegawa R, Kabata D, Shimizu K, et al. Serum albumin as a risk factor for death in patients with prolonged sepsis: an observational study. J Crit Care. 2019;51:139–144.
  • Teipel SJ, Grothe MJ, Metzger CD, et al. Robust detection of impaired resting state functional connectivity networks in Alzheimer’s disease using elastic net regularized regression. Front Aging Neurosci. 2016;8:318.
  • Bledsoe J, Peltan ID, Bunnell RJ, et al. Order substitutions and education for balanced crystalloid solution use in an integrated health care system and association with major adverse kidney events. JAMA Netw Open. 2022;5(5):e2210046.