135
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Enhancing mortality prediction after coronary artery bypass graft: a machine learning approach utilizing EuroScore

ORCID Icon
Article: FSO959 | Received 06 Aug 2023, Accepted 05 Jan 2024, Published online: 17 Jun 2024

Abstract

Aim: We developed a machine learning model using EuroScore assumptions and preoperative and intraoperative risk factors to predict mortality after coronary artery bypass graft (CABG). Materials & methods: We retrospectively examined data from 108 CABG patients at King Abdullah University Hospital, classifying them into risk groups via EuroScore and predicting mortality through random forest classification. Results: High-risk patients displayed longer surgical times and significant factors such as age and surgery choice. The median EuroScore was 0.95 (0.5–6.4). The model yielded high AUC scores (0.98, 0.95) indicating strong predictive accuracy. Conclusion: Our findings showed that the machine learning models combined with the EuroScore significantly improve post-CABG mortality prediction. For further validation, larger datasets are needed.

Plain language summary

Coronary artery bypass grafting (CABG) is a well-established procedure for ischemic heart disease, yet with a 2.0% mortality risk within 30 days, its frequency has declined. EuroScore aids in predicting CABG mortality, considering various risk factors. Postoperative complications like myocardial infarction and heart failure can be severe, and aspirin use post-CABG may reduce mortality for up to 4 years, particularly in multivessel heart disease. This research project explores EuroScore's role in CABG mortality analysis, emphasizing its significance in assessing cardiac surgery quality. Prolonged aortic clamp and bypass times correlate with higher morbidity, while low-risk patients benefit from more left internal mammary artery grafts for improved survival.

Summary points
  • Coronary artery bypass grafting (CABG) is an established procedure to improve clinical outcomes in patients with ischemic heart disease.

  • EuroScore was developed to enhance patient selection for cardiac surgeries by predicting mortality risk after CABG, considering patient characteristics and risk factors.

  • CABG is considered a high-risk procedure with mortality rates of up to 2.0% within 30 days, leading to a decline in the number of CABG surgeries in recent years.

  • Postoperative complications following CABG can lead to serious morbidity and mortality, with myocardial infarction, heart failure, and atrial fibrillation being significant complications.

  • Aspirin use after CABG has been shown to lower mortality rates for up to 4 years, and patients with multivessel heart disease may benefit from CABG as the preferred treatment option.

  • The goal of the research project is to use EuroSCORE to analyze mortality risk following CABG and explore potential risk factors.

  • EuroScore is of prognostic significance after primary CABG, and its accuracy is critical for assessing the quality of care in cardiac surgery.

  • Prolonged aortic cross-clamp and cardiopulmonary bypass time are linked to higher morbidity and mortality rates after CABG.

  • The number of left internal mammary artery (LIMA) grafts is significantly higher in low-risk patients, providing a survival advantage.

Serious morbidity and mortality can result from postoperative complications following Coronary artery bypass grafting (CABG) surgery. With a mortality rate of up to 2.0% within 30 days, it is regarded as a high-risk procedure. Although over 200,000 CABG surgeries are still carried out in the USA every year, the number of CABGs has been steadily declining over the past 10 years [Citation1,Citation2].

CABG stands out as the most strongly supported method for enhancing outcomes of myocardial damage and neurocognitive deficit in individuals with ischemic heart disease [Citation3]. It has been carried out, for more than 50 years and has improved considerably throughout that time [Citation4]. The European System for Cardiac Operative Risk Evaluation (EuroScore) was created to enhance the patient selection process for cardiac surgeries as it predicts the mortality risk following the procedure [Citation5]. Using the EuroScore, patient characteristics and risk factors related to cardiac and surgical procedures are gathered and weighted separately [Citation6].

Myocardial infarction, heart failure, and atrial fibrillation were found to be the three distinct complications following CABG in a previous study [Citation3]. According to current UK guidelines, patients who underwent CABG following a myocardial infarction should take aspirin for the rest of their lives as aspirin use after surgery has been proven to lower mortality rates for up to 4 years following CABG. Moreover, patients should also take another antiplatelet medication, such as clopidogrel, for up to 12 months [Citation7]. When a patient has multivessel heart disease, CABG is the preferred course of treatment, and there is a relative extent of graft failure that raises mortality rates [Citation7].

Consequently, intensive research is required to evaluate the patient characteristics and potential risk factors that predispose to mortality following the operation. The goal of our research project is to use the EuroScore model to carry out a descriptive statistical analysis to assess the mortality risk following CABG and to further explore the potential risk factors.

Materials & methods

Study design

This is a unicentric, retrospective study from medical records of the King Abdullah University Hospital (KAUH), the main hospital in north Jordan from January 2018 to December 2020. Exclusion criteria consisted of any patients with previous cardiac surgeries. The Institutional Review Board (IRB) at our hospital institute and university approved the study. The study was performed in accordance with the principles of the Declaration of Helsinki, and informed consent was waived by the IRB committee due to the retrospective nature of the study.

Demographic data were collected for age, sex, and body mass index (BMI), clinical data for the history of hypertension (HTN), diabetes mellitus (DM), smoking, previous myocardial infarction (MI), respiratory illness, history of cerebrovascular accidents (CVA), surgical variables including type of surgery, choice of surgery, time of anesthesia, surgery, aortic cross-clamp (ACC), cardiopulmonary bypass (CPB), and post-operative variables of complications, reopening and length of stay.

Evaluation of mortality risk after CABG surgery

The European system for cardiac operative risk evaluation (EuroScore) is a risk model to calculate the mortality risk after cardiac surgery. The EuroScore database was developed based on the data of >14,000 patients. The risk factors used to develop the EuroScore model have been described elsewhere. Patients were considered with a low-risk of mortality if the EuroScore was <3, and medium-high risk if the EuroScore was >3.

Statistical analysis

Continuous variables were described using mean ± standard deviation (SD) if the data were normally distributed according to the Shapiro-Wilk test, or median (Range) if the data deviated from normality. Categorical variables were described using frequencies (percentages). To analyze the correlation between demographic, clinical, and operative variables with mortality risk groups, Wilcoxon (Mann–Whitney U) test was used for continuous variables, chi-squared χ2, and the fisher-exact test was used for categorical variables if the category count was <5. A significant difference was considered with a p-value <0.05.

A machine learning random forest classification (RFC) ensemble was built to predict the mortality risk group (high-risk vs low-risk). The RFC model, hereafter referred to as RFC29, was trained on demographic, clinical, intra-, and post-operative variables including gender, age, history of DM, history of HTN, history of smoking, BMI, previous MI, respiratory illness, history of CVA, choice of surgery (urgent vs elective), type of surgery (CABG, CABG + valve, valve, or more than one valve), CPB time, aortic cross-clamp (ACC) time, anesthesia time, surgery time, pleural drain, inotrope support, complications, hyperthermia, time of mechanical ventilation (TOMV), site reopens, reintubation, length of intensive care unit stay (LOICUS), length of hospital stay (LOH), intra-aortic balloon pump (IABP), left ventricular (LV) impairment, internal mammary artery (IMA) harvest, and bleeding amount. Then, patients' data were randomly split into 80:20 training-testing sets. Model's performance was evaluated on the testing set using a mean bootstrap estimate with 95% CI, tenfold cross-validation (CV), classification report for precision and recall, and area under the receiver operating characteristics curve (AUC/ROC). The relative contribution of each feature was calculated using the permutation importance approach, by calculating the decrease in the model's score when shuffling the features. All machine learning analysis has been conducted using the scikit-learn package from Python version 3.10.8.

Results

Patients' characteristics

A total of 108 post-CABG surgery were enrolled in this study. Of which, 92 (85.2%) were males and with a median (IQR) age of 58.5 (52–80). Eight (7.4%) patients had also valve surgery of which three patients with the aortic valve involved, five patients had mitral valve replacement. Four (3.7%) had more than one valve involved. Only seven (6.5%) of patients did an urgent CABG surgery. Five patients had both CABG and valve surgery, two of which had aortic valve replacement, and three patients had mitral valve replacement. Most patients had a history of HTN (73.1%) and DM (54.6%) while no patient was presented with a CVA. Based on the EuroScore, 98 (90.7%) patients were grouped as low-risk and ten (9.3%) patients as high-risk group. Only two patients in the high-risk group were females, with a significantly higher median age compared with the low-risk group (p < 0.01) as shown in . None of the high-risk group had a history of respiratory illnesses, while 30 (30.6%) patients of the low-risk group presented with a history of respiratory disease as shown in . Anesthesia, ACC, surgery and CPB time showed a significant trend toward a positive correlation with the EURO score (p < 0.01, R >3) in addition to LOICUS and TOMV as shown in .

Figure 1. A comparison between high-risk and low-risk groups.

(A) A bar plot for the frequency of surgery choice between risk group. (B) A box plot showing a significant difference in age between risk groups.

Figure 1. A comparison between high-risk and low-risk groups.(A) A bar plot for the frequency of surgery choice between risk group. (B) A box plot showing a significant difference in age between risk groups.

Table 1. Demographic and clinical characteristics of patients included in the study.

Figure 2. Scatter plots for the significant correlations between intra- and post-operative variables and the EuroScore.

Figure 2. Scatter plots for the significant correlations between intra- and post-operative variables and the EuroScore.

Intra- & post-operative characteristics

For the intra-operative variables, a total of 62 (57.4%) patients had left IMA harvest, out of which only three patients from the high-risk group underwent LIMA harvest, while 59 patients from the low-risk group underwent LIMA harvest (p < 0.001) as shown in D. Only two patients had intra-aortic balloon pumps. High-risk group patients showed to be associated with an overall longer time for anesthesia, surgery, CPB and ACC, with a significantly higher anesthesia and surgery time compared with the low-risk group (median: 270 vs 240, p = 0.01; 240 vs 210, p < 0.01, respectively) and a partial significance in CPB time (median: 99 vs 81, p = 0.05) as shown in A–C. Detailed intra-operative characteristics are shown in . For the post-operative characteristics, seven patients passed which were all from the high-risk group (70%), one patient from the low-risk group showed psychosis and one patient had a cerebrovascular accident. Most of the high-risk group (n = 9, 90%) had left ventricular (LV) impairment and were on inotrope support of dobutamine, dopamine and adrenaline. The median length of ICU stay (LOICUS) was significantly higher in the high-risk group (3 vs 1, p = 0.04) as shown in .

Figure 3. Boxplots for significant correlated intra-operative variables with risk groups.

(A) High-risk group showed a significantly higher anesthesia time than low-risk group (p = 0.01). (B) High-risk group also showed a significantly higher surgery time (p < 0.01). (C) Partial significance in CPB time in which the high-risk group showed longer CPB time. (D) Bar graph showing the frequency of IMA harvesting between risk group where most of the low-risk group had LIMA harvest.

Figure 3. Boxplots for significant correlated intra-operative variables with risk groups.(A) High-risk group showed a significantly higher anesthesia time than low-risk group (p = 0.01). (B) High-risk group also showed a significantly higher surgery time (p < 0.01). (C) Partial significance in CPB time in which the high-risk group showed longer CPB time. (D) Bar graph showing the frequency of IMA harvesting between risk group where most of the low-risk group had LIMA harvest.

Table 2. Intra-operative variables between risk groups.

Table 3. Post-operative characteristics.

Predictive ML model of mortality risk

Two supervised ML ensembles were trained on 80% (n = 86) of the data and internally validated on 20% (n = 22). The RFC29 model which included 29 demographic, clinical, intra-, and post-operative features, predicted mortality risk with a high mean bootstrap estimate of 0.92 and 95% CI: [0.79–1.00], tenfold CV of 0.91 and an AUC of 0.95 as shown in A. Permutation importance showed that post-operative complications, AAC time, age, CPB time, and hyperthermia were the highest contributing features as shown in B. To test the predictive role of the top contributing features, the RFC8 model was trained on the top eight features with a mean decrease impurity higher than 0.06. The RFC8 model performed on the testing set with an inferior mean bootstrap estimate of 0.83 and 95% CI: [0.62–1.00], a superior AUC of 0.98, and a tenfold cross-validation of 0.96 as shown in .

Figure 4. Machine learning models performance.

(A) Area under the Receiver Operating Characteristic curve (AUC/ROC) for RFC29 and RFC8 models. The RFC8 model showed better performance and higher AUC. (B) Permutation importance based on the decrease in model's score when the feature is shuffled. Higher values indicate higher importance and contribution to model's prediction. Post-operative complications, aortic cross clamp time, age and cardiopulmonary bypass time showed higher contribution.

Figure 4. Machine learning models performance.(A) Area under the Receiver Operating Characteristic curve (AUC/ROC) for RFC29 and RFC8 models. The RFC8 model showed better performance and higher AUC. (B) Permutation importance based on the decrease in model's score when the feature is shuffled. Higher values indicate higher importance and contribution to model's prediction. Post-operative complications, aortic cross clamp time, age and cardiopulmonary bypass time showed higher contribution.

Table 4. Evaluation metrics for machine learning models.

Discussion

EuroScore is of prognostic significance after primary CABG. Its importance is limited to the lowest and the highest risk patients. Between 2011 and 2013, an analysis of the Society of Thoracic Surgeons (STS) Database revealed an operative mortality rate of 1.9% in isolated CABG patients. Compared with that, the operative mortality rate according to the New York Cardiac Surgery Reporting System in 2011 was 1.24% [Citation8].

One of the biggest, most thorough and most accurate databases in European cardiac surgery history serves as the foundation for EuroScore, a prognostic system for evaluating heart surgery [Citation9]. Operative or hospital mortality is widely recognized as an indication of the quality level of care in cardiac surgery [Citation9,Citation10]. It is critical that EuroScore be reliable because operative mortality must be related to the risk levels of the patients undergoing surgery [Citation9]. Since there were no dead patients in the low-risk group, the EuroScore estimate of the high-risk group mortality was matched by the fact that seven out of the ten patients in the high-risk group who underwent CABG in our institution died. In a cohort study that was published in 2009, 30-day mortality was 0.8%, whereas EuroScore predicted that it would be 3.6% after CABG. Additionally, they noted that Norway's 30-day mortality rate following open heart surgery was lower than EuroScore prediction [Citation10].

Prolonged aortic cross-clamp (ACC) and cardiopulmonary bypass time (CPBT) are linked to higher rates of morbidity and mortality, possibly as a result of the myocardial damage and inflammatory response brought on by cardioplegic cardiac arrest and cardiopulmonary bypass [Citation11–13]. In our cohort, significant positive correlations between the EuroScore and anesthesia, ACC, surgery, and CPB time were observed. Patients in the high-risk group illustrated an overall longer time for ACC and CPB as well as longer anesthesia and surgery times when compared with patients in the low-risk group. However, only anesthesia and surgery times were statistically significant. Al-Sarraf et al. showed prolonged cross-clamp time significantly correlates with major postoperative morbidity and mortality in both low-and high-risk patients [Citation14].

Our study showed that the number of LIMA harvests was significantly higher in the low-risk group. Which is a 15-year survival advantage was detected in patients who had received LIMA to the left anterior descending artery (LAD), and arterial grafting of the non-left anterior descending vessels compared with patients who had received venous grafting [Citation15].

Post-operative left ventricular impairment, inotrope requirement, and length of stay in the ICU are higher in the high-risk group than in the low-risk group.

In 2012, the EuroScore I model underwent an update that resulted in the EuroScore II [Citation16], which includes more precise and comprehensive patient, cardiac and operation-related factors, and a wider range of surgeries. It's been demonstrated to have higher predictive discrimination for intraoperative mortality [Citation17]. Thus, a potential limitation of our study is that the data was gathered before the publication of EuroScore II. Other drawbacks of the study include its retrospective nature, which makes it susceptible to selection bias, and its small sample size. The generalizability of our findings may also be impacted by the fact that our study was restricted to a single institution and used a small sample size. The EuroScore II algorithm seems to possess greater complexity compared with its predecessors, despite having almost the same core set of risk factors. Some definitions have become more precise in this version. For instance, the symptomatic status now incorporates the NYHA class and the CCS Class 4, and the outdated category of unstable angina has been eliminated. Renal impairment is now classified considering creatinine clearance, and a more detailed categorization has been applied to define ejection fraction, pulmonary artery systolic pressure, and urgency [Citation18,Citation19].

The operation's mortality rate has a significant effect on society [Citation10]. The benefit of our study is that it assesses the mortality rates for patients who underwent CABG at the leading tertiary hospital in northern Jordan. We implemented a machine learning model that uses algorithms to boost the prediction process on its own, making it simpler and more reliable [Citation20]. Our project is the first machine learning project at our institution. We examined a variety of demographic, clinical, and intraoperative features when making our prediction, including ACC time, CPB time, hyperthermia, surgery type, and anesthesia type. Of these, ACC time, CPB time, and hyperthermia were particularly important as they influenced EuroScore.

Greater benefits for clinical management of CABG may arise from machine learning models used on larger datasets. Furthermore, to ensure a better prognosis following CABG, more research on the intraoperative and preoperative predictors of mortality should be conducted widely.

We validated the machine learning model internally; however, further external validation from larger datasets could help identify other factors associated with EuroScore prediction.

Conclusion

EuroScore is a useful tool for determining the quality of CABG and is a reliable indicator of the mortality rate after the procedure. According to our study findings, the use of machine learning models could effectively result in significant improvements in the ability to predict the rate of mortality following CABG based on the intraoperative conditions that could be improved to obtain a better prognosis after the surgery.

Author contributions

E Hijazi has worked solely throughout all stages of the research, including the conception, design, and implementation of the study. They have jointly collected and analyzed data and interpreted the results.

Financial disclosure

The author has no financial involvement with any organization or entity with a financial interest in or financial conflict with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

Writing disclosure

No writing assistance was utilized in the production of this manuscript.

Ethical conduct of research

This study was approved by the Institutional Review Board (IRB) at King Abdullah University Hospital (KAUH) and informed consent was waived by the IRB committee due to the retrospective nature of the study.

Competing interests disclosure

The author has no competing interests or relevant affiliations with any organization or entity with the subject matter or materials discussed in the manuscript. This includes employment, consultancies, honoraria, stock ownership or options, expert testimony, grants or patents received or pending, or royalties.

References

  • Montrief T, Koyfman A, Long B. Coronary artery bypass graft surgery complications: a review for emergency clinicians. Am. J. Emerg. Med. 36(12), 2289–2297 (2018).
  • Duggan JP, Peters AS, Trachiotis GD, Antevil JL. Epidemiology of Coronary Artery Disease. Surg. Clin. North Am. 102(3), 499–516 (2022).
  • Siribaddana S. Cardiac dysfunction in the CABG patient. Curr Opin Pharmacol. 12(2), 166–171 (2012).
  • Dimeling G, Bakaeen L, Khatri J, Bakaeen FG. CABG: when, why, and how? Cleve. Clin. J. Med. 88(5), 295–303 (2021).
  • Silverborn M, Nielsen S, Karlsson M. The performance of EuroSCORE II in CABG patients in relation to sex, age, and surgical risk: a nationwide study in 14,118 patients. J. Cardiothorac. Surg. 18(1), 40 (2023).
  • Bauer A, Korten I, Juchem G, Kiesewetter I, Kilger E, Heyn J. EuroScore and IL-6 predict the course in ICU after cardiac surgery. Eur. J. Med. Res. 26(1), 29 (2021).
  • Sembi N, Cheng T, Ravindran W, Ulucay E, Ahmed A, Harky A. Anticoagulation and antiplatelet therapy post coronary artery bypass surgery. J. Card. Surg. 36(3), 1091–1099 (2021).
  • Kimmaliardjuk DM, Toeg H, GlineurD, SohmerB, Ruel M. Operative mortality with coronary artery bypass graft: where do we stand in 2015? Curr. Opin. Cardiol. 30(6), 611–618 (2015).
  • European system for cardiac operative risk evaluation (EuroSCORE).
  • Survival after CABG--better than predicted by EuroSCORE and equal to the general population.
  • Nissinen J, Biancari F, Wistbacka JO et al. Safe time limits of aortic cross-clamping and cardiopulmonary bypass in adult cardiac surgery. Perfusion. 24(5), 297–305 (2009).
  • Salis S, Mazzanti VV, Merli G et al. Cardiopulmonary bypass duration is an independent predictor of morbidity and mortality after cardiac surgery. J Cardiothorac Vasc Anesth. 22(6), 814–822 (2008).
  • Iino K, Miyata H, Motomura N et al. Prolonged cross-clamping during aortic valve replacement is an independent predictor of postoperative morbidity and mortality: analysis of the japan cardiovascular surgery database. Ann. Thorac. Surg. 103(2), 602–609 (2017).
  • Al-Sarraf N, Thalib L, Hughes A et al. Cross-clamp time is an independent predictor of mortality and morbidity in low- and high-risk cardiac patients. Int J Surg. 9(1), 104–109 (2011).
  • Locker C, Schaff HV, Dearani JA et al. Multiple arterial grafts improve late survival of patients undergoing coronary artery bypass graft surgery: analysis of 8622 patients with multivessel disease. Circulation 126(9), 1023–1030 (2012).
  • Nashef SAM, Roques F, Sharples LD et al. EuroSCORE II. Eur. J. Cardiothorac. Surg. 41(4), 734–745 (2012).
  • Ad N, Holmes SD, Patel J, Pritchard G, Shuman DJ, Halpin L. Comparison of EuroSCORE II, Original EuroSCORE, and The Society of Thoracic Surgeons Risk Score in Cardiac Surgery Patients. Ann. Thorac. Surg. 102(2), 573–579 (2016).
  • Barili F, Pacini D, Capo A et al. Does EuroSCORE II perform better than its original versions? A multicentre validation study. Eur. Heart J. 34(1), 22–29 (2013).
  • Nashef SAM, Roques F, Sharples LD et al. EuroSCORE II. Eur. J. Cardiothorac. Surg. 41(4), 734–745 (2012).
  • Fan Y, Dong J, Wu Y et al. Development of machine learning models for mortality risk prediction after cardiac surgery. Cardiovasc Diagn Ther. 12(1), 12–23 (2022).