Skip to main content

Personalized prediction of mortality in patients with acute ischemic stroke using explainable artificial intelligence

Abstract

Background

Research into the acute kidney disease (AKD) after acute ischemic stroke (AIS) is rare, and how clinical features influence its prognosis remain unknown. We aim to employ interpretable machine learning (ML) models to study AIS and clarify its decision-making process in identifying the risk of mortality.

Methods

We conducted a retrospective cohort study involving AIS patients from January 2020 to June 2021. Patient data were randomly divided into training and test sets. Eight ML algorithms were employed to construct predictive models for mortality. The performance of the best model was evaluated using various metrics. Furthermore, we created an artificial intelligence (AI)-driven web application that leveraged the top ten most crucial features for mortality prediction.

Results

The study cohort consisted of 1633 AIS patients, among whom 257 (15.74%) developed subacute AKD, 173 (10.59%) experienced AKI recovery, and 65 (3.98%) met criteria for both AKI and AKD. The mortality rate stood at 4.84%. The LightGBM model displayed superior performance, boasting an AUROC of 0.96 for mortality prediction. The top five features linked to mortality were ACEI/ARE, renal function trajectories, neutrophil count, diuretics, and serum creatinine. Moreover, we designed a web application using the LightGBM model to estimate mortality risk.

Conclusions

Complete renal function trajectories, including AKI and AKD, are vital for fitting mortality in AIS patients. An interpretable ML model effectively clarified its decision-making process for identifying AIS patients at risk of mortality. The AI-driven web application has the potential to contribute to the development of personalized early mortality prevention.

Keypoints

  • What was known: Acute kidney disease (AKD) defines patients with acute kidney injury (AKI) or subacute loss of kidney function lasting for more than 7 days, which links well AKI to subsequent chronic kidney disease (CKD). Little is known about the risk and prognosis of AKD in acute ischemic stroke (AIS) patients.

  • This study adds: Renal function trajectories, including both AKI and AKD, play a crucial role in predicting mortality in AIS patients. The LightGBM model elucidates decision processes by providing explanations at both global and local levels. The AI web application aids in reducing mortality rates and helps physicians make informed treatment decisions.

  • Potential impact: Adding AKD as a definition for renal failure lasting > 7 days up to 90 days is of clinical importance in addition to the existing definitions for AKI and CKD. Research activities and clinical practice should also focus on AKD, which is far more accurate to predict prognosis especially mortality.

Background

The global impact of stroke is substantial, ranking second in mortality and third in disability, with an estimated annual cost exceeding US$891 billion worldwide [1, 2]. Notably, ischemic strokes constituted over 60% of all stroke events [3]. Renal impairment is a critical adverse complication in AIS patients, often induced by factors such as mechanical thrombectomy, which increases the risk of mortality [4,5,6]. Existing research has primarily focused on AKI and CKD, with a scarcity of reports addressing the renal function trajectory during the 7–90 days following kidney injury [7, 8].

AKI and CKD do not represent distinct clinical syndromes but rather frequently present as a disease continuum [9]. No consensus exists for defining criteria to evaluate kidney recovery after AKI [10]. The 2012 Kidney Disease Improving Global Outcomes (KDIGO) guideline first introduced the term ‘Acute Kidney Diseases and Disorders’, defining it as abnormalities in kidney function and/or structure lasting less than 3 months, which includes AKI [11]. The 2017 Acute Disease Quality Initiative (ADQI) workgroup defines acute kidney disease (AKD) as acute or subacute damage and/or loss of kidney function persisting for 7 to 90 days following an AKI-triggering event [12]. Although the diagnostic criteria for AKD differ between the two guidelines, both stress the importance of considering AKD as a condition of equal significance to AKI.

Artificial intelligence (AI) is at the forefront of digital medicine [13]. Machine learning (ML), a fundamental branch of AI, excels in deciphering complex nonlinear associations among multidimensional features [14]. It has been extensively applied in the realm of healthcare, spanning areas such as medical diagnostics and the prediction of disease risks [15, 16]. Numerous studies employ ML models to predict mortality risk in patients with conditions such as heart failure, surgical interventions, and sepsis [17,18,19]. These studies predominantly utilize decision tree-based algorithms, which handle nonlinear features more effectively and mitigate overfitting compared to traditional regression models. In addition, ML significantly enhances outcome interpretability by elucidating influential variables, complex internal operations, and learned decision-making paths. SHapley Additive exPlanations (SHAP), a prominent interpretive method, quantify the marginal contribution of each feature upon integration into a ‘black-box’ model, providing explanations at both global and local levels [20, 21]. Its strength lies in precisely measuring the impact’s degree and direction that each feature exerts on the model’s output. In assessing mortality risk for AIS patients, research primarily focuses on those in intensive care unit (ICU) [22, 23], which creates a gap in prognostic evaluations for non-ICU AIS patients. Studies involving non-ICU AIS patients face challenges related to imbalanced data distribution, with a mortality rate of less than 5%, and this imbalance remains unaddressed [24]. Importantly, there is a dearth of research dedicated to predicting the impact of AKD on the mortality of AIS patients.

Hence, this study aimed to achieve the following objectives: (1) evaluate the incidence of AKI, AKD, and mortality among AIS patients; (2) assess mortality risk using various ML algorithms and identify the most optimal model; (3) utilize SHAP analysis to elucidate the contributions of individual features to the outcome and unveil the underlying decision-making process; (4) compare the predictive capabilities of using AKD independently or in combination with AKI for predicting mortality; (5) develop a user-friendly online prediction tool for estimating the probability of mortality in AIS patients.

Materials and methods

Study design

This retrospective cohort study involved 1633 patients diagnosed with AIS between January 2020 and June 2021. All patients were randomly assigned to a test set comprising 15% of samples not seen during model development; this set was used to assess the final model’s performance. An 85% sample subset was designated as the training set for model building. During the training phase, we employed a grid search with tenfold cross-validation to fine-tune model hyperparameters and prevent overfitting [25].

Patients diagnosed with AIS were included according to the International Classification of Diseases version 10 (ICD-10). Individuals meeting any of the following criteria were excluded: (1) age < 18 years; (2) hospitalization duration < 24 h; (3) hospital-acquired or traumatic brain injury with concurrent stroke, or comorbid intracranial tumor, transient ischemic attack, or other intracranial disorders; (4) concurrent Stage 5 CKD, undergoing renal replacement therapy, or having undergone kidney transplant; and (5) patients with incomplete data recording.

Data collection

Clinical information was extracted using natural language processing and parsing methods applied to structured data within the electronic health record. Data pertaining to demographic characteristics, medical history, and comorbidities were collected upon admission. Medication records were compiled during hospitalization, with particular attention to instances where these medications were administered before the onset of kidney injury. Comprehensive blood counts, coagulation markers, blood chemistry analyses, and urine tests were conducted within 1 week of admission. Initially, we included 104 readily available features based on expert clinical opinions and literature reviews. Following the removal of features with a missing proportion greater than 15%, we retained 86 features for building the prediction models.

Outcome definitions

The study investigated AKI and AKD as short-term outcomes, and mortality as a long-term outcome. AKI was defined in accordance with the 2012 KDIGO criteria, signifying either a rise in serum creatinine (Scr) greater than 0.3 mg/dL from baseline within 48 h or an increase to 1.5 times the baseline value within 7 days [11]. As stipulated by the 2017 ADQI guidelines, AKD was characterized by the acute or subacute impairment and/or loss of kidney function occurring within 7 to 90 days following an AKI event [12]. Based on the diagnostic criteria for AKI and AKD, patients exhibited three distinct renal function trajectories following kidney injury: (1) AKI recovery, indicating that Scr returned to baseline value within 7 days; (2) subacute AKD, denoting a slow increase in Scr levels lasting more than 7 days (AKD without AKI); and (3) AKD with AKI, representing the persistence of stage 1 or greater AKI for ≥ 7 days after an AKI initiating event (AKI progressing to AKD). The final classification encompassed four categories: (1) no kidney disease (NKD), (2) AKI recovery, (3) subacute AKD, and (4) AKD with AKI. Mortality was defined by the vital status for survival or death at the last follow-up. Clinical features, incorporating renal function trajectories, were incorporated to develop a risk prediction model, with mortality as the binary endpoint, to evaluate mortality risk in AIS patients.

The baseline Scr level was defined as the initial Scr measurement obtained upon hospital admission. The timing of AKI and AKD diagnosis was determined when patients initially met the respective diagnostic criteria. Each patient underwent a minimum of three Scr tests, which included two tests during their hospitalization and one at their first follow-up appointment. If elevated Scr levels did not return to baseline, additional tests were performed weekly during hospitalization or at the subsequent follow-up. The estimated glomerular filtration rate (eGFR) was calculated using the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) creatinine formula [26].

Model development and interpretation

Data were trained on the following eight ML models: (1) light gradient boosting machine (LightGBM), (2) GBM, (3) random forest (RF), (4) K-nearest neighbors (KNN), (5) multi-layer perceptron (MLP), (6) naive Bayes (NB), (7) support vector machine (SVM), and (8) logistic regression (LR). LightGBM and GBM are gradient-based learning frameworks that employ decision trees and boosting. LightGBM, in comparison to GBM, shortens training times and reduces memory usage by partitioning data using histograms [27]. RF constructs individual decision trees using random subsets of the training data and combines their results through majority voting for classification [28]. KNN is a frequently used supervised learning algorithm that conducts classification or regression based on feature similarity among neighboring data points [29]. MLP relies on the stacking of multiple layers of neurons, employing layer-wise propagation and nonlinear activation functions to learn and represent intricate data relationships [30]. NB is rooted in Bayes’ theorem and performs classification by calculating the posterior probabilities of different categories under given feature conditions [31]. SVM is a supervised learning algorithm that makes predictions by identifying the optimal separating hyperplane [32]. LR is a linear model that predicts probabilities based on the logistic function [33]. All models using the same dataset and applying consistent imputation and scaling techniques.

SHAP was used to interpret the results of the top-performing model. Features with positive SHAP values enhance the output, with larger numerical values indicating more significant contributions [34]. SHAP summary plots offer visualizations of essential feature rankings and the overarching relationships and directions concerning features and outcomes. SHAP force and decision plots offer an intuitive visualization of how distinct features influence an individual prediction.

Data balancing

In our study, there exists an imbalance, as the mortality rate is approximately 5%. To address this imbalance, we utilized a weight rebalancing technique to adjust the weights of both the majority and minority classes [35]. Solely the training dataset underwent balancing. The test datasets remained unaltered to evaluate model performance using representative data. The scikit-learn Python library includes a built-in parameter called “class weight” or “weights” for LR, RF, LightGBM, SVM, and KNN. The model automatically assigns a weight to each class that is inversely proportional to its frequency. The balanced weight for each class is calculated using the equation: Class weight = total number of samples/(number of classes × class sample size). The class weight for mortality was 10.34, while the class weight for non-mortality was 0.53 when the “balanced” option was used. In the case of the NB classifier, we established a prior probability of 0.5 for each class to achieve group balance. In future work, we plan to adjust class weights in the MLP classifier by modifying the loss function’s weights.

AI-driven web application

A web-based calculator for predicting mortality among AIS patients was developed using the “Streamlit” application (https://share.streamlit.io/) to implement the optimal model [36]. To enhance the user-friendliness of the web calculator, this study introduced two panels: one for inputting model parameters and obtaining mortality probability, and another for providing a model introduction.

Statistical analysis

Features with missing values exceeding 15% were omitted from the dataset. Multiple imputation techniques were then applied to estimate the missing data. Utilizing LR to compute the required sample size with mortality as the outcome, we ascertained that a minimum of 801 patients is essential to achieve a statistical power of 90% for the detection of an effect size of 0.10 at a two-sided significance level (α) of 0.05. Normally distributed continuous features are reported as the median ± standard deviation (SD) and were compared using independent t test. For non-normally distributed features, we present them as the median (interquartile range) and utilized the Mann–Whitney U test for comparisons. Categorical features were characterized in terms of percentages and underwent comparison through the Pearson’s Chi-squared test. We evaluated the models’ predictive performance using a variety of metrics, including the area under the receiver operating characteristic curve (AUROC), precision, recall, accuracy, F1 score, Brier score loss (BSL), Matthew’s correlation coefficient, and decision curve analysis (DCA). The AUROC and F1 score were utilized to identify the optimal model. A significance level of less than 0.05 (two-tailed) was utilized. Our analysis was conducted using the Python programming language (Python Software Foundation, version 3.9.13) within the integrated development environment Visual Studio Code 1.81.1.

Results

Study cohort

A retrospective review of medical records was conducted for 1876 AIS patients from January 2020 to June 2021, with 1633 were eligible for further analysis (Fig. 1). Table 1 presents the baseline characteristics of the study population, and Table S1 stratifies the same cohort based on mortality. The incidence rates of AKI, AKD, and mortality were 14.57% (238/1633), 19.72% (322/1633), and 4.84% (79/1633), respectively. From the perspective of renal function trajectories, a total of 495 patients (30.31%) developed acute/subacute kidney dysfunction (meeting AKI and/or AKD criteria), comprising 257 patients (15.74%) with subacute AKD, 173 patients (10.59%) who experienced recovery from AKI, and 65 patients (3.98%) meeting both AKI and AKD criteria. Increased mortality rates were noted in elderly individuals (mean age: 73 vs. 68 years), those experiencing fever (15.19% vs. 8.04%), and patients with AKD coupled with AKI (31.65% vs. 13.92% in subacute AKD, 25.32% in AKI recovery, and 29.11% in NKD patients).

Fig. 1
figure 1

Architectural diagram of study

Table 1 Baseline characteristics of inpatients [mean ± SD; n (%)]

Model performance

A comprehensive set of 86 features served as predictors for mortality and were integrated into the ML models. Among all ML models, the LightGBM model displayed the best performance, with an AUROC of 0.96 and an F1 score of 0.47 (Fig. S1, Table S1, and Table S2). After data balancing, the model showed no significant difference in AUROC and accuracy, but it achieved a better balance between precision and recall (Table 2 and Table S3). When the model incorporated only the top 10 features, the AUROC remained high at 0.93, while maintaining a balance between precision and recall. Consequently, the LightGBM model was utilized in later stages for result interpretation and the development of an AI-driven web application. DCA revealed that the LightGBM model possessed high clinical utility (Fig. S2). Additional information concerning various performance metrics, such as accuracy, BSL, and Matthews correlation coefficient, is available in Table 2 and Table S2.

Table 2 Performance of LightGBM model for predicting mortality*

SHAP interpreter for the model

Figure 2A, B illustrates the SHAP summary plot of the LightGBM model. The top five features associated with mortality were ACEI/ARE, renal function trajectories (including AKI recovery, subacute AKD, and AKD with AKI), neutrophil count, diuretics use, and Scr. Substituting “AKD grade” for “renal function trajectories” in predicting the risk of mortality resulted in a decrease in the model’s AUROC to 0.92, which was lower than the predictive model constructed by combining AKI and AKD. Furthermore, the importance ranking of “AKD grade” falls outside the top 15 and is not a primary feature for predicting mortality (Fig. S3).

Fig. 2
figure 2

The SHAP summary plots for LightGBM models and force plots for two representative patients. A The ranking of feature importance within the mortality prediction model. Features with higher mean absolute SHAP values signify increased predictive influence. B Each dot represents the SHAP value of a specific feature for an individual, with red and blue indicating high and low feature values, respectively. On the x-axis, a positive or negative SHAP value signifies that the feature positively or negatively influenced the AKD prediction for the individual. C provides a personalized explanation for a case with a mortality probability below 10% and an actual outcome of survival. Features are ranked from the center to both ends based on the extent of their impact. The impact of a feature on the model’s output is directly proportional to the size of the arrow. The positive impact of a feature is depicted in red, elevating the prediction from the base value, while the negative effect is shown in blue, lowering the prediction. Certain features, such as Scr (107 μmol/L) and TBIL (13.6 μmol/L), exhibit a positive influence, while the absence of ACEI/ARB, diuretics, and antibiotics, as well as the absence of kidney disease, contribute negatively to predicting mortality. D provides a personalized explanation for a case with a mortality probability exceeding 90% and an actual outcome of mortality. The base value represents the averaged predicted results

The SHAP interaction plot visually elucidates the interplays among the top 15 features in mortality model (Fig. S4). SHAP dependence plots illustrate the impact of a single feature or the interaction between two features on mortality prediction (Fig. S5). The force plots (Fig. 2C, D) depict the prediction process for two representative patients. The cases shown in Fig. S6 illustrate patients with similar predicted probabilities, yet the constituent feature compositions leading to these predictions differ.

AI-driven web application

Employing LightGBM for mortality prediction, we have created an AI-driven web application within the Streamlit framework. In the test set (Table 2), compared to the LightGBM model built with all features, the model constructed with the top ten features showed no significant decrease in accuracy (0.89 vs. 0.91) and AUROC (0.93 vs. 0.96), with a slight increase in the F1 score (0.50 vs. 0.47). Therefore, this study utilizes the top ten features for constructing an online predictive model. When users visit the website, they input features data, which is then encoded and sent to the server for real-time mortality prediction. No private data are required besides feature information, and all input is promptly deleted after generating the prediction result. The calculator is accessible at https://strokemortalityapppy-gupkbhhnwkoghqnhvtul8b.streamlit.app/.

Discussion

To the best of our knowledge, this study is the first to develop and compare multiple ML models for predicting mortality in AIS patients using AKD data. Among 1633 AIS patients, the mortality rate was 4.84%, and 30.31% of patients developed acute/subacute kidney dysfunction. Of these, 65 (3.98%) met both AKI and AKD criteria, 257 (15.74%) developed subacute AKD, and 173 (10.59%) experienced recovery from AKI. LightGBM demonstrated the strongest predictive performance, achieving an AUROC of 0.96 for mortality prediction. The five most important features for assessing mortality risk are ACEI/ARE, renal function trajectories, neutrophil count, diuretic use, and Scr. Compared to using AKD alone, the combined use of AKI and AKD enhances the model’s predictive performance. We further employ various SHAP plots to interpret the “black box model” at both the global and local levels. Ultimately, an AI-driven web application based on the LightGBM model was created for inputting patient data to facilitate the clinicians’ assessment of mortality in AIS patients.

Huang et al. developed various ML algorithms, including eXtreme Gradient Boosting (XGBoost), to develop a mortality prediction model for severe stroke patients [37]. XGBoost outperforms traditional regression models, especially in handling imbalanced and high-dimensional data. Our study compared different ML models using AUROC and F1 scores, and LightGBM demonstrated superior predictive performance. In contrast to XGBoost, LightGBM effectively mitigates overfitting through gradient-based one-side sampling and exclusive feature bundling. In addition, it enhances computational speed and reduces memory usage by employing histogram techniques and a leaf-wise growth strategy [27].

The prediction of mortality risk in AIS patients primarily focuses on ICU patients [22, 23, 37]. Wang et al. developed a mortality prediction model for non-ICU AIS patients using various ML algorithms [24]. However, this study encountered data imbalance issues that remained unaddressed. Several investigations employing regression models have identified AKI and CKD as significant risk factors for mortality in AIS patients [38,39,40]. The impact of renal function trajectory between 7 and 90 days on mortality remains unclear. This study marks the first attempt to analyze the relationship between AKD and mortality in AIS patients. It underscores that comprehensive renal function trajectories encompassing both AKI and AKD are more vital and precise in predicting mortality risk compared to isolated AKD. This highlights the importance of monitoring the renal function trajectory from 7 to 90 days, even when AIS patients have subacute kidney dysfunction or experience rapid kidney function recovery within 7 days after AKI.

Our study utilized a variety of SHAP plots to address the challenge of the ‘black box’ in mortality risk assessment. Among these, the SHAP summary plot prioritized features based on their importance, identifying ACEI/ARB and renal function trajectories as the two most critical indicators for predicting mortality. SHAP dependence plots demonstrated that patients with acute or subacute kidney injury, particularly those with AKD and AKI, showed an increased risk of mortality associated with ACEI/ARB use. SHAP force plots and decision plots revealed variations in feature contributions for patients with similar predicted probabilities, effectively enhancing the personalization and transparency of the decision-making process.

Our study has some limitations to acknowledge. First, this study lacks specific stroke-related information that could influence mortality, such as the NIHSS score. Second, the follow-up period was too brief to ascertain whether patients developed CKD. Consequently, this study did not assess the influence of AKD on the emergence of new-onset CKD. Third, we have no data specifying the time interval between AIS onset and Scr measurement. However, patients with acute strokes are usually promptly admitted to the hospital, and blood samples are drawn shortly after their arrival. Consequently, the time lapse is unlikely to exceed a few hours. Forth, the AI-driven web application is crafted to assist clinicians in discerning AIS patients with elevated risk of mortality, rather than serving as a replacement for clinical diagnosis. Due to the retrospective nature of data collection, it is crucial to undertake additional validation using an independent population to ensure robust predictive validity across diverse usage scenarios. Fifth, our study is limited to a single center. To enhance the robustness of our findings and ensure their applicability across various scenarios, we will validate our results using an independent population.

Conclusions

In summary, AKD plays a crucial role in evaluating the mortality risk of AIS patients. Comprehensive renal function trajectories, encompassing both AKI and AKD, are of paramount importance for predicting mortality. The LightGBM model exhibited robust performance as a tool for mortality prediction in AIS patients. The utilization of this AI-driven web application has the potential to significantly reduce mortality rates and assist physicians in making informed treatment decisions.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ACEI/ARB:

Angiotensin-converting enzyme inhibitor/angiotensin receptor blocker

ADQI:

Acute disease quality initiative

AI:

Artificial intelligence

AIS:

Acute ischemic stroke

AKD:

Acute kidney disease

AKI:

Acute kidney injury

AUROC:

Area under the receiver operating characteristic curve

BSL:

Brier score loss

CKD:

Chronic kidney disease

CKD-EPI:

The Chronic Kidney Disease Epidemiology Collaboration

DCA:

Decision curve analysis

eGFR:

Estimated glomerular filtration rate

GBM:

Gradient boosting machine

ICU:

Intensive care unit

KDIGO:

Kidney disease improving global outcomes

KNN:

K-nearest neighbors

LightGBM:

Light gradient boosting machine

LR:

Logistic regression

ML:

Machine learning

MLP:

Multi-layer perceptron

NB:

Naive Bayes

NKD:

No kidney disease

RF:

Random forest

Scr:

Serum creatinine

SD:

Standard deviation

SHAP:

Shapley additive explanations

SVM:

Support vector machine

XGBoost:

Extreme gradient boosting

References

  1. Feigin VL, Owolabi MO. Pragmatic solutions to reduce the global burden of stroke: a World Stroke Organization-Lancet Neurology Commission. Lancet Neurol. 2023. https://doi.org/10.1016/S1474-4422(23)00277-6.

    Article  PubMed  Google Scholar 

  2. Zhang X, Li H, Wang H, Zhang Q, Deng X, Zhang S, Wang L, Guo C, Zhao F, Yin Y, Zhou T, Zhong J, Feng H, Chen W, Zhang J, Feng H, Hu R. Iron/ROS/Itga3 mediated accelerated depletion of hippocampal neural stem cell pool contributes to cognitive impairment after hemorrhagic stroke. Redox Biol. 2024;71: 103086.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Feigin VL, Stark BA, Johnson CO, Roth GA, Bisignano C, Abady GG, Abbasifard M, Abbasi-Kangevari M, Abd-Allah F, Abedi V, Abualhasan A. Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 2021;20:795–820.

    Article  CAS  Google Scholar 

  4. Hojs Fabjan T, Penko M, Hojs R. Renal dysfunction predicts mortality in type 2 diabetic patients suffering from an acute ischemic stroke. Eur J Inter Med. 2018;52:e22–4.

    Article  Google Scholar 

  5. Yao QY, Fu ML, Zhao Q, Zheng XM, Tang K, Cao LM. Image-based visualization of stents in mechanical thrombectomy for acute ischemic stroke: preliminary findings from a series of cases. World J Clin Cases. 2023;11:5047–55.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Zhang C, Ge H, Zhang S, Liu D, Jiang Z, Lan C, Li L, Feng H, Hu R. Hematoma evacuation via image-guided para-corticospinal tract approach in patients with spontaneous intracerebral hemorrhage. Neurol Ther. 2021;10:1001–13.

    Article  PubMed  PubMed Central  Google Scholar 

  7. Wu HH, Chang TY, Liu CH, Lin JR, Liou CW, Lee JD, Peng TI, Lee M, Lee TH. Impact of chronic kidney disease severity on causes of death after first-ever stroke: a population-based study using nationwide data linkage. PLoS ONE. 2020;15: e0241891.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Zorrilla-Vaca A, Ziai W, Connolly ES Jr, Geocadin R, Thompson R, Rivera-Lara L. Acute kidney injury following acute ischemic stroke and intracerebral hemorrhage: a meta-analysis of prevalence rate and mortality risk. Cerebrovasc Dis (Basel, Switzerland). 2018;45:1–9.

    Article  Google Scholar 

  9. Chawla LS, Eggers PW, Star RA, Kimmel PL. Acute kidney injury and chronic kidney disease as interconnected syndromes. N Engl J Med. 2014;371:58–66.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Sawhney S, Ball W, Bell S, Black C, Christiansen CF, Heide-Jørgensen U, Jensen SK, Lambourg E, Ronksley PE, Tan Z, Tonelli M, James MT. Recovery of kidney function after acute kidney disease—a multi-cohort analysis. Nephrol Dial Transplant. 2023. https://doi.org/10.1093/ndt/gfad180.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Palevsky PM, Liu KD, Brophy PD, Chawla LS, Parikh CR, Thakar CV, Tolwani AJ, Waikar SS, Weisbord SD. KDOQI US commentary on the 2012 KDIGO clinical practice guideline for acute kidney injury. Am J Kidney Dis. 2012;61(2013):649–72.

    Google Scholar 

  12. Chawla LS, Bellomo R, Bihorac A, Goldstein SL, Siew ED, Bagshaw SM, Bittleman D, Cruz D, Endre Z, Fitzgerald RL, Forni L, Kane-Gill SL, Hoste E, Koyner J, Liu KD, Macedo E, Mehta R, Murray P, Nadim M, Ostermann M, Palevsky PM, Pannu N, Rosner M, Wald R, Zarbock A, Ronco C, Kellum JA. Acute kidney disease and renal recovery: consensus report of the Acute Disease Quality Initiative (ADQI) 16 Workgroup. Nat Rev Nephrol. 2017;13:241–57.

    Article  PubMed  Google Scholar 

  13. Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nat Biomed Eng. 2018;2:719–31.

    Article  PubMed  Google Scholar 

  14. Nadkarni GN. Introduction to artificial intelligence and machine learning in nephrology. Clin J Am Soc Nephrol. 2023;18:392–3.

    Article  PubMed  Google Scholar 

  15. Barrera FJ, Brown EDL, Rojo A, Obeso J, Plata H, Lincango EP, Terry N, Rodríguez-Gutiérrez R, Hall JE, Shekhar S. Application of machine learning and artificial intelligence in the diagnosis and classification of polycystic ovarian syndrome: a systematic review. Front Endocrinol. 2023;14:1106625.

    Article  Google Scholar 

  16. Yang T, Martinez-Useros J, Liu J, Alarcón I, Li C, Li W, Xiao Y, Ji X, Zhao Y, Wang L, Morales-Conde S, Yang Z. A retrospective analysis based on multiple machine learning models to predict lymph node metastasis in early gastric cancer. Front Oncol. 2022;12:1023110.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Jawadi Z, He R, Srivastava PK, Fonarow GC, Khalil SO, Krishnan S, Eskin E, Chiang JN, Nsair A. Predicting in-hospital mortality among patients admitted with a diagnosis of heart failure: a machine learning approach. ESC Heart Fail. 2024. https://doi.org/10.1002/ehf2.14796.

    Article  PubMed  Google Scholar 

  18. Mosfeldt M, Jørgensen HL, Lauritzen JB, Jansson K. Development and internal validation of a multivariable prediction model for mortality after hip fracture with machine learning techniques. Calcif Tissue Int. 2024;114:568–82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Park SW, Yeo NY, Kang S, Ha T, Kim TH, Lee D, Kim D, Choi S, Kim M, Lee D, Kim D, Kim WJ, Lee SJ, Heo YJ, Moon DH, Han SS, Kim Y, Choi HS, Oh DK, Lee SY, Park M, Lim CM, Heo J. Early prediction of mortality for septic patients visiting emergency room based on explainable machine learning: a real-world multicenter study. J Korean Med Sci. 2024;39: e53.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee SI. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2:56–67.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Ali S, Akhlaq F, Imran AS, Kastrati Z, Daudpota SM, Moosa M. The enlightening role of explainable artificial intelligence in medical and healthcare domains: a systematic literature review. Comput Biol Med. 2023;166: 107555.

    Article  PubMed  Google Scholar 

  22. Liu W, Ma W, Bai N, Li C, Liu K, Yang J, Zhang S, Zhu K, Zhou Q, Liu H, Guo J, Li L. Identification of key predictors of hospital mortality in critically ill patients with embolic stroke using machine learning. Biosci Rep. 2022;42:BSR20220995.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Kurtz P, Peres IT, Soares M, Salluh JIF, Bozza FA. Hospital length of stay and 30-day mortality prediction in stroke: a machine learning analysis of 17,000 ICU admissions in Brazil. Neurocrit Care. 2022;37:313–21.

    Article  PubMed  Google Scholar 

  24. Wang K, Gu L, Liu W, Xu C, Yin C, Liu H, Rong L, Li W, Wei X. The predictors of death within 1 year in acute ischemic stroke patients based on machine learning. Front Neurol. 2023;14:1092534.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Adnan M, Alarood AAS, Uddin MI, Ur Rehman I. Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models. PeerJ Comput Sci. 2022;8: e803.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Levey AS, Stevens LA, Schmid CH, Zhang YL, Castro AF 3rd, Feldman HI, Kusek JW, Eggers P, Van Lente F, Greene T, Coresh J. A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150:604–12.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Rufo DD, Debelee TG, Ibenthal A, Negera WG. Diagnosis of diabetes mellitus using gradient boosting machine (LightGBM). Diagnostics (Basel). 2021;11:1714.

    Article  CAS  PubMed  Google Scholar 

  28. Hu J, Szymczak S. A review on longitudinal data analysis with random forest. Brief Bioinform. 2023;24:bbad002.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Uddin S, Haque I, Lu H, Moni MA, Gide E. Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction. Sci Rep. 2022;12:6256.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Liu R, Li Y, Tao L, Liang D, Zheng HT. Are we ready for a new paradigm shift? A survey on visual deep MLP. Patterns (New York, NY). 2022;3:100520.

    Google Scholar 

  31. Harada D, Asanoi H, Noto T, Takagawa J. Naive Bayes prediction of the development of cardiac events in heart failure with preserved ejection fraction in an outpatient clinic - beyond B-type natriuretic peptide. Circ J. 2021;86:37–46.

    Article  PubMed  Google Scholar 

  32. Dong C, Yang N, Zhao R, Yang Y, Gu X, Fu T, Sun C, Gu Z. SVM-based model combining patients’ reported outcomes and lymphocyte phenotypes of depression in systemic lupus erythematosus. Biomolecules. 2023;13:723.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: a systematic review and meta-analysis. Int J Med Informatics. 2021;151: 104484.

    Article  Google Scholar 

  34. Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, Liston DE, Low DK, Newman SF, Kim J, Lee SI. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2:749–60.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Ren Y, Wu D, Tong Y, López-DeFede A, Gareau S. Issue of data imbalance on low birthweight baby outcomes prediction and associated risk factors identification: establishment of benchmarking key machine learning models with data rebalancing strategies. J Med Internet Res. 2023;25: e44081.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Nápoles-Duarte JM, Biswas A, Parker MI, Palomares-Baez JP, Chávez-Rojo MA, Rodríguez-Valdez LM. Stmol: a component for building interactive molecular visualizations within streamlit web-applications. Front Mol Biosci. 2022;9: 990846.

    Article  PubMed  PubMed Central  Google Scholar 

  37. Huang J, Chen H, Deng J, Liu X, Shu T, Yin C, Duan M, Fu L, Wang K, Zeng S. Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation. Front Neurol. 2023;14:1185447.

    Article  PubMed  PubMed Central  Google Scholar 

  38. Ovbiagele B. Chronic kidney disease and risk of death during hospitalization for stroke. J Neurol Sci. 2011;301:46–50.

    Article  PubMed  Google Scholar 

  39. Laible M, Jenetzky E, Möhlenbruch MA, Bendszus M, Ringleb PA, Rizos T. The impact of post-contrast acute kidney injury on in-hospital mortality after endovascular thrombectomy in patients with acute ischemic stroke. Front Neurol. 2021;12: 665614.

    Article  PubMed  PubMed Central  Google Scholar 

  40. Arnold J, Sims D, Gill P, Cockwell P, Ferro C. Acute kidney injury calculated using admission serum creatinine underestimates 30-day and 1-year mortality after acute stroke. Clin Kidney J. 2020;13:46–54.

    Article  CAS  PubMed  Google Scholar 

Download references

Funding

This work was supported by the Taishan Scholar Program of Shandong Province (grant number tstp20230665); the National Natural Science Foundation of China (grant numbers 81970582 and 82270724); the Qingdao Key Health Discipline Development Fund; and the Qingdao Key Clinical Specialty Elite Discipline.

Author information

Authors and Affiliations

Authors

Contributions

LYX and CYL contributed equally to this work and should be considered the co-first authors. Conceptualization and design: LYX, CYL and YX; writing—original draft: LYX and CYL; methodology: LYX, JQZ, CYY, and LZ; artificial intelligence: LYX, CG, XFS, and NXZ; data curation and interpretation: TYL, BZ, and QDB; writing—review and editing: CYL and YX. All the authors read and approved the final manuscript.

Corresponding author

Correspondence to Yan Xu.

Ethics declarations

Ethics approval and consent to participate

This study obtained approval from the Ethics Committee of the Affiliated Hospital of Qingdao University (Approval No. QYFY WZLL 27276). The ethics committee supervised the complete de-identification process, resulting in the waiver of informed consent for individual patients.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

40001_2024_1940_MOESM1_ESM.pdf

Additional file 1. Fig. S1. ROC curves and AUROC values for mortality prediction in test set. Fig. S2. ROC (A) and DCA (B) curves of training and test sets for mortality prediction. Fig. S3. The SHAP summary plot of LightGBM model using “AKD grade” as a proxy for “renal function trajectories” in predicting mortality risk. Fig. S4. The SHAP interaction plot depicting the interactions among the top fifteen features of the lightGBM model for mortality prediction. Fig. S5. The SHAP dependence plots illustrate the correlations between key features in the prediction of mortality. A depicts the correlation between renal function trajectories and ACEI/ARB in predicting mortality. The x-axis represents the actual values of renal function trajectories, whereas the y-axis shows the SHAP values for trajectories, with values above zero suggesting an increased risk of mortality. Each dot represents an individual case, with the color transitioning from blue to red to indicate whether ACEI/ARB were taken or not. Specifically, the impact of ACEI/ARB on the mortality probability varies across different renal function trajectories. Among patients with normal kidney function, the use of ACEI/ARB is associated with a decrease in the risk of mortality. Conversely, for patients with AKD accompanied by AKI, the use of ACEI/ARB significantly increases the risk of mortality. B Illustrates the correlation between baseline eGFR and ACEI/ARB in predicting mortality. Among patients with lower baseline eGFR levels, the use of ACEI/ARB is associated with a slight increase in the risk of mortality. C Depicts the correlation between neutrophil count and renal function trajectories in predicting mortality. D Depicts the correlation between neutrophil count and antibiotics in predicting mortality. Fig. S6. The SHAP decision plots provided a detailed view of the inner workings of the lightGBM model. A, B provides personalized explanations for two cases with mortality probabilities below 10% and actual outcomes of survival. The direction of the line visualizes the decision process of the LightGBM model from the base value to the predicted value. The values adjacent to the line denote the measured values of the features. C, D provides personalized explanations for two cases with mortality probabilities exceeding 90% and actual outcomes of death. Table S1. Characteristics of hospital encounters in the study sample, overall and according to mortality [mean ± SD; n (%)]. Table S2. Performance of eight ML models for predicting mortality. Table S3. Performance of the LightGBM model for predicting mortality in the test set without data balancing.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, L., Li, C., Zhang, J. et al. Personalized prediction of mortality in patients with acute ischemic stroke using explainable artificial intelligence. Eur J Med Res 29, 341 (2024). https://doi.org/10.1186/s40001-024-01940-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s40001-024-01940-2

Keywords