Personalized prediction of mortality in patients with acute ischemic stroke using explainable artificial intelligence

Xu, Lingyu; Li, Chenyu; Zhang, Jiaqi; Guan, Chen; Zhao, Long; Shen, Xuefei; Zhang, Ningxin; Li, Tianyang; Yang, Chengyu; Zhou, Bin; Bu, Quandong; Xu, Yan

doi:10.1186/s40001-024-01940-2

Research
Open access
Published: 20 June 2024

Personalized prediction of mortality in patients with acute ischemic stroke using explainable artificial intelligence

Lingyu Xu¹^na1,
Chenyu Li^1,2^na1,
Jiaqi Zhang³,
Chen Guan¹,
Long Zhao¹,
Xuefei Shen¹,
Ningxin Zhang¹,
Tianyang Li¹,
Chengyu Yang¹,
Bin Zhou¹,
Quandong Bu¹ &
…
Yan Xu¹

European Journal of Medical Research volume 29, Article number: 341 (2024) Cite this article

257 Accesses
Metrics details

Abstract

Background

Research into the acute kidney disease (AKD) after acute ischemic stroke (AIS) is rare, and how clinical features influence its prognosis remain unknown. We aim to employ interpretable machine learning (ML) models to study AIS and clarify its decision-making process in identifying the risk of mortality.

Methods

We conducted a retrospective cohort study involving AIS patients from January 2020 to June 2021. Patient data were randomly divided into training and test sets. Eight ML algorithms were employed to construct predictive models for mortality. The performance of the best model was evaluated using various metrics. Furthermore, we created an artificial intelligence (AI)-driven web application that leveraged the top ten most crucial features for mortality prediction.

Results

The study cohort consisted of 1633 AIS patients, among whom 257 (15.74%) developed subacute AKD, 173 (10.59%) experienced AKI recovery, and 65 (3.98%) met criteria for both AKI and AKD. The mortality rate stood at 4.84%. The LightGBM model displayed superior performance, boasting an AUROC of 0.96 for mortality prediction. The top five features linked to mortality were ACEI/ARE, renal function trajectories, neutrophil count, diuretics, and serum creatinine. Moreover, we designed a web application using the LightGBM model to estimate mortality risk.

Conclusions

Complete renal function trajectories, including AKI and AKD, are vital for fitting mortality in AIS patients. An interpretable ML model effectively clarified its decision-making process for identifying AIS patients at risk of mortality. The AI-driven web application has the potential to contribute to the development of personalized early mortality prevention.

Keypoints

What was known: Acute kidney disease (AKD) defines patients with acute kidney injury (AKI) or subacute loss of kidney function lasting for more than 7 days, which links well AKI to subsequent chronic kidney disease (CKD). Little is known about the risk and prognosis of AKD in acute ischemic stroke (AIS) patients.
This study adds: Renal function trajectories, including both AKI and AKD, play a crucial role in predicting mortality in AIS patients. The LightGBM model elucidates decision processes by providing explanations at both global and local levels. The AI web application aids in reducing mortality rates and helps physicians make informed treatment decisions.
Potential impact: Adding AKD as a definition for renal failure lasting > 7 days up to 90 days is of clinical importance in addition to the existing definitions for AKI and CKD. Research activities and clinical practice should also focus on AKD, which is far more accurate to predict prognosis especially mortality.

Background

The global impact of stroke is substantial, ranking second in mortality and third in disability, with an estimated annual cost exceeding US$891 billion worldwide [1, 2]. Notably, ischemic strokes constituted over 60% of all stroke events [3]. Renal impairment is a critical adverse complication in AIS patients, often induced by factors such as mechanical thrombectomy, which increases the risk of mortality [4,5,6]. Existing research has primarily focused on AKI and CKD, with a scarcity of reports addressing the renal function trajectory during the 7–90 days following kidney injury [7, 8].

AKI and CKD do not represent distinct clinical syndromes but rather frequently present as a disease continuum [9]. No consensus exists for defining criteria to evaluate kidney recovery after AKI [10]. The 2012 Kidney Disease Improving Global Outcomes (KDIGO) guideline first introduced the term ‘Acute Kidney Diseases and Disorders’, defining it as abnormalities in kidney function and/or structure lasting less than 3 months, which includes AKI [11]. The 2017 Acute Disease Quality Initiative (ADQI) workgroup defines acute kidney disease (AKD) as acute or subacute damage and/or loss of kidney function persisting for 7 to 90 days following an AKI-triggering event [12]. Although the diagnostic criteria for AKD differ between the two guidelines, both stress the importance of considering AKD as a condition of equal significance to AKI.

Artificial intelligence (AI) is at the forefront of digital medicine [13]. Machine learning (ML), a fundamental branch of AI, excels in deciphering complex nonlinear associations among multidimensional features [14]. It has been extensively applied in the realm of healthcare, spanning areas such as medical diagnostics and the prediction of disease risks [15, 16]. Numerous studies employ ML models to predict mortality risk in patients with conditions such as heart failure, surgical interventions, and sepsis [17,18,19]. These studies predominantly utilize decision tree-based algorithms, which handle nonlinear features more effectively and mitigate overfitting compared to traditional regression models. In addition, ML significantly enhances outcome interpretability by elucidating influential variables, complex internal operations, and learned decision-making paths. SHapley Additive exPlanations (SHAP), a prominent interpretive method, quantify the marginal contribution of each feature upon integration into a ‘black-box’ model, providing explanations at both global and local levels [20, 21]. Its strength lies in precisely measuring the impact’s degree and direction that each feature exerts on the model’s output. In assessing mortality risk for AIS patients, research primarily focuses on those in intensive care unit (ICU) [22, 23], which creates a gap in prognostic evaluations for non-ICU AIS patients. Studies involving non-ICU AIS patients face challenges related to imbalanced data distribution, with a mortality rate of less than 5%, and this imbalance remains unaddressed [24]. Importantly, there is a dearth of research dedicated to predicting the impact of AKD on the mortality of AIS patients.

Hence, this study aimed to achieve the following objectives: (1) evaluate the incidence of AKI, AKD, and mortality among AIS patients; (2) assess mortality risk using various ML algorithms and identify the most optimal model; (3) utilize SHAP analysis to elucidate the contributions of individual features to the outcome and unveil the underlying decision-making process; (4) compare the predictive capabilities of using AKD independently or in combination with AKI for predicting mortality; (5) develop a user-friendly online prediction tool for estimating the probability of mortality in AIS patients.

Materials and methods

Study design

This retrospective cohort study involved 1633 patients diagnosed with AIS between January 2020 and June 2021. All patients were randomly assigned to a test set comprising 15% of samples not seen during model development; this set was used to assess the final model’s performance. An 85% sample subset was designated as the training set for model building. During the training phase, we employed a grid search with tenfold cross-validation to fine-tune model hyperparameters and prevent overfitting [25].

Patients diagnosed with AIS were included according to the International Classification of Diseases version 10 (ICD-10). Individuals meeting any of the following criteria were excluded: (1) age < 18 years; (2) hospitalization duration < 24 h; (3) hospital-acquired or traumatic brain injury with concurrent stroke, or comorbid intracranial tumor, transient ischemic attack, or other intracranial disorders; (4) concurrent Stage 5 CKD, undergoing renal replacement therapy, or having undergone kidney transplant; and (5) patients with incomplete data recording.

Data collection

Clinical information was extracted using natural language processing and parsing methods applied to structured data within the electronic health record. Data pertaining to demographic characteristics, medical history, and comorbidities were collected upon admission. Medication records were compiled during hospitalization, with particular attention to instances where these medications were administered before the onset of kidney injury. Comprehensive blood counts, coagulation markers, blood chemistry analyses, and urine tests were conducted within 1 week of admission. Initially, we included 104 readily available features based on expert clinical opinions and literature reviews. Following the removal of features with a missing proportion greater than 15%, we retained 86 features for building the prediction models.

Outcome definitions

The study investigated AKI and AKD as short-term outcomes, and mortality as a long-term outcome. AKI was defined in accordance with the 2012 KDIGO criteria, signifying either a rise in serum creatinine (Scr) greater than 0.3 mg/dL from baseline within 48 h or an increase to 1.5 times the baseline value within 7 days [11]. As stipulated by the 2017 ADQI guidelines, AKD was characterized by the acute or subacute impairment and/or loss of kidney function occurring within 7 to 90 days following an AKI event [12]. Based on the diagnostic criteria for AKI and AKD, patients exhibited three distinct renal function trajectories following kidney injury: (1) AKI recovery, indicating that Scr returned to baseline value within 7 days; (2) subacute AKD, denoting a slow increase in Scr levels lasting more than 7 days (AKD without AKI); and (3) AKD with AKI, representing the persistence of stage 1 or greater AKI for ≥ 7 days after an AKI initiating event (AKI progressing to AKD). The final classification encompassed four categories: (1) no kidney disease (NKD), (2) AKI recovery, (3) subacute AKD, and (4) AKD with AKI. Mortality was defined by the vital status for survival or death at the last follow-up. Clinical features, incorporating renal function trajectories, were incorporated to develop a risk prediction model, with mortality as the binary endpoint, to evaluate mortality risk in AIS patients.

The baseline Scr level was defined as the initial Scr measurement obtained upon hospital admission. The timing of AKI and AKD diagnosis was determined when patients initially met the respective diagnostic criteria. Each patient underwent a minimum of three Scr tests, which included two tests during their hospitalization and one at their first follow-up appointment. If elevated Scr levels did not return to baseline, additional tests were performed weekly during hospitalization or at the subsequent follow-up. The estimated glomerular filtration rate (eGFR) was calculated using the Chronic Kidney Disease Epidemiology Collaboration (CKD-EPI) creatinine formula [26].

Model development and interpretation

Data were trained on the following eight ML models: (1) light gradient boosting machine (LightGBM), (2) GBM, (3) random forest (RF), (4) K-nearest neighbors (KNN), (5) multi-layer perceptron (MLP), (6) naive Bayes (NB), (7) support vector machine (SVM), and (8) logistic regression (LR). LightGBM and GBM are gradient-based learning frameworks that employ decision trees and boosting. LightGBM, in comparison to GBM, shortens training times and reduces memory usage by partitioning data using histograms [27]. RF constructs individual decision trees using random subsets of the training data and combines their results through majority voting for classification [28]. KNN is a frequently used supervised learning algorithm that conducts classification or regression based on feature similarity among neighboring data points [29]. MLP relies on the stacking of multiple layers of neurons, employing layer-wise propagation and nonlinear activation functions to learn and represent intricate data relationships [30]. NB is rooted in Bayes’ theorem and performs classification by calculating the posterior probabilities of different categories under given feature conditions [31]. SVM is a supervised learning algorithm that makes predictions by identifying the optimal separating hyperplane [32]. LR is a linear model that predicts probabilities based on the logistic function [33]. All models using the same dataset and applying consistent imputation and scaling techniques.

SHAP was used to interpret the results of the top-performing model. Features with positive SHAP values enhance the output, with larger numerical values indicating more significant contributions [34]. SHAP summary plots offer visualizations of essential feature rankings and the overarching relationships and directions concerning features and outcomes. SHAP force and decision plots offer an intuitive visualization of how distinct features influence an individual prediction.

Data balancing

In our study, there exists an imbalance, as the mortality rate is approximately 5%. To address this imbalance, we utilized a weight rebalancing technique to adjust the weights of both the majority and minority classes [35]. Solely the training dataset underwent balancing. The test datasets remained unaltered to evaluate model performance using representative data. The scikit-learn Python library includes a built-in parameter called “class weight” or “weights” for LR, RF, LightGBM, SVM, and KNN. The model automatically assigns a weight to each class that is inversely proportional to its frequency. The balanced weight for each class is calculated using the equation: Class weight = total number of samples/(number of classes × class sample size). The class weight for mortality was 10.34, while the class weight for non-mortality was 0.53 when the “balanced” option was used. In the case of the NB classifier, we established a prior probability of 0.5 for each class to achieve group balance. In future work, we plan to adjust class weights in the MLP classifier by modifying the loss function’s weights.

AI-driven web application

A web-based calculator for predicting mortality among AIS patients was developed using the “Streamlit” application (https://share.streamlit.io/) to implement the optimal model [36]. To enhance the user-friendliness of the web calculator, this study introduced two panels: one for inputting model parameters and obtaining mortality probability, and another for providing a model introduction.

Statistical analysis

Features with missing values exceeding 15% were omitted from the dataset. Multiple imputation techniques were then applied to estimate the missing data. Utilizing LR to compute the required sample size with mortality as the outcome, we ascertained that a minimum of 801 patients is essential to achieve a statistical power of 90% for the detection of an effect size of 0.10 at a two-sided significance level (α) of 0.05. Normally distributed continuous features are reported as the median ± standard deviation (SD) and were compared using independent t test. For non-normally distributed features, we present them as the median (interquartile range) and utilized the Mann–Whitney U test for comparisons. Categorical features were characterized in terms of percentages and underwent comparison through the Pearson’s Chi-squared test. We evaluated the models’ predictive performance using a variety of metrics, including the area under the receiver operating characteristic curve (AUROC), precision, recall, accuracy, F1 score, Brier score loss (BSL), Matthew’s correlation coefficient, and decision curve analysis (DCA). The AUROC and F1 score were utilized to identify the optimal model. A significance level of less than 0.05 (two-tailed) was utilized. Our analysis was conducted using the Python programming language (Python Software Foundation, version 3.9.13) within the integrated development environment Visual Studio Code 1.81.1.

Results

Study cohort

A retrospective review of medical records was conducted for 1876 AIS patients from January 2020 to June 2021, with 1633 were eligible for further analysis (Fig. 1). Table 1 presents the baseline characteristics of the study population, and Table S1 stratifies the same cohort based on mortality. The incidence rates of AKI, AKD, and mortality were 14.57% (238/1633), 19.72% (322/1633), and 4.84% (79/1633), respectively. From the perspective of renal function trajectories, a total of 495 patients (30.31%) developed acute/subacute kidney dysfunction (meeting AKI and/or AKD criteria), comprising 257 patients (15.74%) with subacute AKD, 173 patients (10.59%) who experienced recovery from AKI, and 65 patients (3.98%) meeting both AKI and AKD criteria. Increased mortality rates were noted in elderly individuals (mean age: 73 vs. 68 years), those experiencing fever (15.19% vs. 8.04%), and patients with AKD coupled with AKI (31.65% vs. 13.92% in subacute AKD, 25.32% in AKI recovery, and 29.11% in NKD patients).

Table 1 Baseline characteristics of inpatients [mean ± SD; n (%)]

Full size table

Model performance

A comprehensive set of 86 features served as predictors for mortality and were integrated into the ML models. Among all ML models, the LightGBM model displayed the best performance, with an AUROC of 0.96 and an F1 score of 0.47 (Fig. S1, Table S1, and Table S2). After data balancing, the model showed no significant difference in AUROC and accuracy, but it achieved a better balance between precision and recall (Table 2 and Table S3). When the model incorporated only the top 10 features, the AUROC remained high at 0.93, while maintaining a balance between precision and recall. Consequently, the LightGBM model was utilized in later stages for result interpretation and the development of an AI-driven web application. DCA revealed that the LightGBM model possessed high clinical utility (Fig. S2). Additional information concerning various performance metrics, such as accuracy, BSL, and Matthews correlation coefficient, is available in Table 2 and Table S2.

Table 2 Performance of LightGBM model for predicting mortality*

Full size table

SHAP interpreter for the model

Figure 2A, B illustrates the SHAP summary plot of the LightGBM model. The top five features associated with mortality were ACEI/ARE, renal function trajectories (including AKI recovery, subacute AKD, and AKD with AKI), neutrophil count, diuretics use, and Scr. Substituting “AKD grade” for “renal function trajectories” in predicting the risk of mortality resulted in a decrease in the model’s AUROC to 0.92, which was lower than the predictive model constructed by combining AKI and AKD. Furthermore, the importance ranking of “AKD grade” falls outside the top 15 and is not a primary feature for predicting mortality (Fig. S3).

The SHAP interaction plot visually elucidates the interplays among the top 15 features in mortality model (Fig. S4). SHAP dependence plots illustrate the impact of a single feature or the interaction between two features on mortality prediction (Fig. S5). The force plots (Fig. 2C, D) depict the prediction process for two representative patients. The cases shown in Fig. S6 illustrate patients with similar predicted probabilities, yet the constituent feature compositions leading to these predictions differ.

AI-driven web application

Employing LightGBM for mortality prediction, we have created an AI-driven web application within the Streamlit framework. In the test set (Table 2), compared to the LightGBM model built with all features, the model constructed with the top ten features showed no significant decrease in accuracy (0.89 vs. 0.91) and AUROC (0.93 vs. 0.96), with a slight increase in the F1 score (0.50 vs. 0.47). Therefore, this study utilizes the top ten features for constructing an online predictive model. When users visit the website, they input features data, which is then encoded and sent to the server for real-time mortality prediction. No private data are required besides feature information, and all input is promptly deleted after generating the prediction result. The calculator is accessible at https://strokemortalityapppy-gupkbhhnwkoghqnhvtul8b.streamlit.app/.

Discussion

To the best of our knowledge, this study is the first to develop and compare multiple ML models for predicting mortality in AIS patients using AKD data. Among 1633 AIS patients, the mortality rate was 4.84%, and 30.31% of patients developed acute/subacute kidney dysfunction. Of these, 65 (3.98%) met both AKI and AKD criteria, 257 (15.74%) developed subacute AKD, and 173 (10.59%) experienced recovery from AKI. LightGBM demonstrated the strongest predictive performance, achieving an AUROC of 0.96 for mortality prediction. The five most important features for assessing mortality risk are ACEI/ARE, renal function trajectories, neutrophil count, diuretic use, and Scr. Compared to using AKD alone, the combined use of AKI and AKD enhances the model’s predictive performance. We further employ various SHAP plots to interpret the “black box model” at both the global and local levels. Ultimately, an AI-driven web application based on the LightGBM model was created for inputting patient data to facilitate the clinicians’ assessment of mortality in AIS patients.

Huang et al. developed various ML algorithms, including eXtreme Gradient Boosting (XGBoost), to develop a mortality prediction model for severe stroke patients [37]. XGBoost outperforms traditional regression models, especially in handling imbalanced and high-dimensional data. Our study compared different ML models using AUROC and F1 scores, and LightGBM demonstrated superior predictive performance. In contrast to XGBoost, LightGBM effectively mitigates overfitting through gradient-based one-side sampling and exclusive feature bundling. In addition, it enhances computational speed and reduces memory usage by employing histogram techniques and a leaf-wise growth strategy [27].

The prediction of mortality risk in AIS patients primarily focuses on ICU patients [22, 23, 37]. Wang et al. developed a mortality prediction model for non-ICU AIS patients using various ML algorithms [24]. However, this study encountered data imbalance issues that remained unaddressed. Several investigations employing regression models have identified AKI and CKD as significant risk factors for mortality in AIS patients [38,39,40]. The impact of renal function trajectory between 7 and 90 days on mortality remains unclear. This study marks the first attempt to analyze the relationship between AKD and mortality in AIS patients. It underscores that comprehensive renal function trajectories encompassing both AKI and AKD are more vital and precise in predicting mortality risk compared to isolated AKD. This highlights the importance of monitoring the renal function trajectory from 7 to 90 days, even when AIS patients have subacute kidney dysfunction or experience rapid kidney function recovery within 7 days after AKI.

Our study utilized a variety of SHAP plots to address the challenge of the ‘black box’ in mortality risk assessment. Among these, the SHAP summary plot prioritized features based on their importance, identifying ACEI/ARB and renal function trajectories as the two most critical indicators for predicting mortality. SHAP dependence plots demonstrated that patients with acute or subacute kidney injury, particularly those with AKD and AKI, showed an increased risk of mortality associated with ACEI/ARB use. SHAP force plots and decision plots revealed variations in feature contributions for patients with similar predicted probabilities, effectively enhancing the personalization and transparency of the decision-making process.

Our study has some limitations to acknowledge. First, this study lacks specific stroke-related information that could influence mortality, such as the NIHSS score. Second, the follow-up period was too brief to ascertain whether patients developed CKD. Consequently, this study did not assess the influence of AKD on the emergence of new-onset CKD. Third, we have no data specifying the time interval between AIS onset and Scr measurement. However, patients with acute strokes are usually promptly admitted to the hospital, and blood samples are drawn shortly after their arrival. Consequently, the time lapse is unlikely to exceed a few hours. Forth, the AI-driven web application is crafted to assist clinicians in discerning AIS patients with elevated risk of mortality, rather than serving as a replacement for clinical diagnosis. Due to the retrospective nature of data collection, it is crucial to undertake additional validation using an independent population to ensure robust predictive validity across diverse usage scenarios. Fifth, our study is limited to a single center. To enhance the robustness of our findings and ensure their applicability across various scenarios, we will validate our results using an independent population.

Conclusions

In summary, AKD plays a crucial role in evaluating the mortality risk of AIS patients. Comprehensive renal function trajectories, encompassing both AKI and AKD, are of paramount importance for predicting mortality. The LightGBM model exhibited robust performance as a tool for mortality prediction in AIS patients. The utilization of this AI-driven web application has the potential to significantly reduce mortality rates and assist physicians in making informed treatment decisions.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

ACEI/ARB:: Angiotensin-converting enzyme inhibitor/angiotensin receptor blocker
ADQI:: Acute disease quality initiative
AI:: Artificial intelligence
AIS:: Acute ischemic stroke
AKD:: Acute kidney disease
AKI:: Acute kidney injury
AUROC:: Area under the receiver operating characteristic curve
BSL:: Brier score loss
CKD:: Chronic kidney disease
CKD-EPI:: The Chronic Kidney Disease Epidemiology Collaboration
DCA:: Decision curve analysis
eGFR:: Estimated glomerular filtration rate
GBM:: Gradient boosting machine
ICU:: Intensive care unit
KDIGO:: Kidney disease improving global outcomes
KNN:: K-nearest neighbors
LightGBM:: Light gradient boosting machine
LR:: Logistic regression
ML:: Machine learning
MLP:: Multi-layer perceptron
NB:: Naive Bayes
NKD:: No kidney disease
RF:: Random forest
Scr:: Serum creatinine
SD:: Standard deviation
SHAP:: Shapley additive explanations
SVM:: Support vector machine
XGBoost:: Extreme gradient boosting

References

Feigin VL, Owolabi MO. Pragmatic solutions to reduce the global burden of stroke: a World Stroke Organization-Lancet Neurology Commission. Lancet Neurol. 2023. https://doi.org/10.1016/S1474-4422(23)00277-6.
Article PubMed Google Scholar
Zhang X, Li H, Wang H, Zhang Q, Deng X, Zhang S, Wang L, Guo C, Zhao F, Yin Y, Zhou T, Zhong J, Feng H, Chen W, Zhang J, Feng H, Hu R. Iron/ROS/Itga3 mediated accelerated depletion of hippocampal neural stem cell pool contributes to cognitive impairment after hemorrhagic stroke. Redox Biol. 2024;71: 103086.
Article CAS PubMed PubMed Central Google Scholar
Feigin VL, Stark BA, Johnson CO, Roth GA, Bisignano C, Abady GG, Abbasifard M, Abbasi-Kangevari M, Abd-Allah F, Abedi V, Abualhasan A. Global, regional, and national burden of stroke and its risk factors, 1990–2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Neurol. 2021;20:795–820.
Article CAS Google Scholar
Hojs Fabjan T, Penko M, Hojs R. Renal dysfunction predicts mortality in type 2 diabetic patients suffering from an acute ischemic stroke. Eur J Inter Med. 2018;52:e22–4.
Article Google Scholar
Yao QY, Fu ML, Zhao Q, Zheng XM, Tang K, Cao LM. Image-based visualization of stents in mechanical thrombectomy for acute ischemic stroke: preliminary findings from a series of cases. World J Clin Cases. 2023;11:5047–55.
Article PubMed PubMed Central Google Scholar
Zhang C, Ge H, Zhang S, Liu D, Jiang Z, Lan C, Li L, Feng H, Hu R. Hematoma evacuation via image-guided para-corticospinal tract approach in patients with spontaneous intracerebral hemorrhage. Neurol Ther. 2021;10:1001–13.
Article PubMed PubMed Central Google Scholar
Wu HH, Chang TY, Liu CH, Lin JR, Liou CW, Lee JD, Peng TI, Lee M, Lee TH. Impact of chronic kidney disease severity on causes of death after first-ever stroke: a population-based study using nationwide data linkage. PLoS ONE. 2020;15: e0241891.
Article CAS PubMed PubMed Central Google Scholar
Zorrilla-Vaca A, Ziai W, Connolly ES Jr, Geocadin R, Thompson R, Rivera-Lara L. Acute kidney injury following acute ischemic stroke and intracerebral hemorrhage: a meta-analysis of prevalence rate and mortality risk. Cerebrovasc Dis (Basel, Switzerland). 2018;45:1–9.
Article Google Scholar
Chawla LS, Eggers PW, Star RA, Kimmel PL. Acute kidney injury and chronic kidney disease as interconnected syndromes. N Engl J Med. 2014;371:58–66.
Article PubMed PubMed Central Google Scholar
Sawhney S, Ball W, Bell S, Black C, Christiansen CF, Heide-Jørgensen U, Jensen SK, Lambourg E, Ronksley PE, Tan Z, Tonelli M, James MT. Recovery of kidney function after acute kidney disease—a multi-cohort analysis. Nephrol Dial Transplant. 2023. https://doi.org/10.1093/ndt/gfad180.
Article PubMed PubMed Central Google Scholar
Palevsky PM, Liu KD, Brophy PD, Chawla LS, Parikh CR, Thakar CV, Tolwani AJ, Waikar SS, Weisbord SD. KDOQI US commentary on the 2012 KDIGO clinical practice guideline for acute kidney injury. Am J Kidney Dis. 2012;61(2013):649–72.
Google Scholar
Chawla LS, Bellomo R, Bihorac A, Goldstein SL, Siew ED, Bagshaw SM, Bittleman D, Cruz D, Endre Z, Fitzgerald RL, Forni L, Kane-Gill SL, Hoste E, Koyner J, Liu KD, Macedo E, Mehta R, Murray P, Nadim M, Ostermann M, Palevsky PM, Pannu N, Rosner M, Wald R, Zarbock A, Ronco C, Kellum JA. Acute kidney disease and renal recovery: consensus report of the Acute Disease Quality Initiative (ADQI) 16 Workgroup. Nat Rev Nephrol. 2017;13:241–57.
Article PubMed Google Scholar
Yu KH, Beam AL, Kohane IS. Artificial intelligence in healthcare. Nat Biomed Eng. 2018;2:719–31.
Article PubMed Google Scholar
Nadkarni GN. Introduction to artificial intelligence and machine learning in nephrology. Clin J Am Soc Nephrol. 2023;18:392–3.
Article PubMed Google Scholar
Barrera FJ, Brown EDL, Rojo A, Obeso J, Plata H, Lincango EP, Terry N, Rodríguez-Gutiérrez R, Hall JE, Shekhar S. Application of machine learning and artificial intelligence in the diagnosis and classification of polycystic ovarian syndrome: a systematic review. Front Endocrinol. 2023;14:1106625.
Article Google Scholar
Yang T, Martinez-Useros J, Liu J, Alarcón I, Li C, Li W, Xiao Y, Ji X, Zhao Y, Wang L, Morales-Conde S, Yang Z. A retrospective analysis based on multiple machine learning models to predict lymph node metastasis in early gastric cancer. Front Oncol. 2022;12:1023110.
Article PubMed PubMed Central Google Scholar
Jawadi Z, He R, Srivastava PK, Fonarow GC, Khalil SO, Krishnan S, Eskin E, Chiang JN, Nsair A. Predicting in-hospital mortality among patients admitted with a diagnosis of heart failure: a machine learning approach. ESC Heart Fail. 2024. https://doi.org/10.1002/ehf2.14796.
Article PubMed Google Scholar
Mosfeldt M, Jørgensen HL, Lauritzen JB, Jansson K. Development and internal validation of a multivariable prediction model for mortality after hip fracture with machine learning techniques. Calcif Tissue Int. 2024;114:568–82.
Article CAS PubMed PubMed Central Google Scholar
Park SW, Yeo NY, Kang S, Ha T, Kim TH, Lee D, Kim D, Choi S, Kim M, Lee D, Kim D, Kim WJ, Lee SJ, Heo YJ, Moon DH, Han SS, Kim Y, Choi HS, Oh DK, Lee SY, Park M, Lim CM, Heo J. Early prediction of mortality for septic patients visiting emergency room based on explainable machine learning: a real-world multicenter study. J Korean Med Sci. 2024;39: e53.
Article PubMed PubMed Central Google Scholar
Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B, Katz R, Himmelfarb J, Bansal N, Lee SI. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2:56–67.
Article PubMed PubMed Central Google Scholar
Ali S, Akhlaq F, Imran AS, Kastrati Z, Daudpota SM, Moosa M. The enlightening role of explainable artificial intelligence in medical and healthcare domains: a systematic literature review. Comput Biol Med. 2023;166: 107555.
Article PubMed Google Scholar
Liu W, Ma W, Bai N, Li C, Liu K, Yang J, Zhang S, Zhu K, Zhou Q, Liu H, Guo J, Li L. Identification of key predictors of hospital mortality in critically ill patients with embolic stroke using machine learning. Biosci Rep. 2022;42:BSR20220995.
Article CAS PubMed PubMed Central Google Scholar
Kurtz P, Peres IT, Soares M, Salluh JIF, Bozza FA. Hospital length of stay and 30-day mortality prediction in stroke: a machine learning analysis of 17,000 ICU admissions in Brazil. Neurocrit Care. 2022;37:313–21.
Article PubMed Google Scholar
Wang K, Gu L, Liu W, Xu C, Yin C, Liu H, Rong L, Li W, Wei X. The predictors of death within 1 year in acute ischemic stroke patients based on machine learning. Front Neurol. 2023;14:1092534.
Article PubMed PubMed Central Google Scholar
Adnan M, Alarood AAS, Uddin MI, Ur Rehman I. Utilizing grid search cross-validation with adaptive boosting for augmenting performance of machine learning models. PeerJ Comput Sci. 2022;8: e803.
Article PubMed PubMed Central Google Scholar
Levey AS, Stevens LA, Schmid CH, Zhang YL, Castro AF 3rd, Feldman HI, Kusek JW, Eggers P, Van Lente F, Greene T, Coresh J. A new equation to estimate glomerular filtration rate. Ann Intern Med. 2009;150:604–12.
Article PubMed PubMed Central Google Scholar
Rufo DD, Debelee TG, Ibenthal A, Negera WG. Diagnosis of diabetes mellitus using gradient boosting machine (LightGBM). Diagnostics (Basel). 2021;11:1714.
Article CAS PubMed Google Scholar
Hu J, Szymczak S. A review on longitudinal data analysis with random forest. Brief Bioinform. 2023;24:bbad002.
Article PubMed PubMed Central Google Scholar
Uddin S, Haque I, Lu H, Moni MA, Gide E. Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction. Sci Rep. 2022;12:6256.
Article CAS PubMed PubMed Central Google Scholar
Liu R, Li Y, Tao L, Liang D, Zheng HT. Are we ready for a new paradigm shift? A survey on visual deep MLP. Patterns (New York, NY). 2022;3:100520.
Google Scholar
Harada D, Asanoi H, Noto T, Takagawa J. Naive Bayes prediction of the development of cardiac events in heart failure with preserved ejection fraction in an outpatient clinic - beyond B-type natriuretic peptide. Circ J. 2021;86:37–46.
Article PubMed Google Scholar
Dong C, Yang N, Zhao R, Yang Y, Gu X, Fu T, Sun C, Gu Z. SVM-based model combining patients’ reported outcomes and lymphocyte phenotypes of depression in systemic lupus erythematosus. Biomolecules. 2023;13:723.
Article CAS PubMed PubMed Central Google Scholar
Song X, Liu X, Liu F, Wang C. Comparison of machine learning and logistic regression models in predicting acute kidney injury: a systematic review and meta-analysis. Int J Med Informatics. 2021;151: 104484.
Article Google Scholar
Lundberg SM, Nair B, Vavilala MS, Horibe M, Eisses MJ, Adams T, Liston DE, Low DK, Newman SF, Kim J, Lee SI. Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nat Biomed Eng. 2018;2:749–60.
Article PubMed PubMed Central Google Scholar
Ren Y, Wu D, Tong Y, López-DeFede A, Gareau S. Issue of data imbalance on low birthweight baby outcomes prediction and associated risk factors identification: establishment of benchmarking key machine learning models with data rebalancing strategies. J Med Internet Res. 2023;25: e44081.
Article PubMed PubMed Central Google Scholar
Nápoles-Duarte JM, Biswas A, Parker MI, Palomares-Baez JP, Chávez-Rojo MA, Rodríguez-Valdez LM. Stmol: a component for building interactive molecular visualizations within streamlit web-applications. Front Mol Biosci. 2022;9: 990846.
Article PubMed PubMed Central Google Scholar
Huang J, Chen H, Deng J, Liu X, Shu T, Yin C, Duan M, Fu L, Wang K, Zeng S. Interpretable machine learning for predicting 28-day all-cause in-hospital mortality for hypertensive ischemic or hemorrhagic stroke patients in the ICU: a multi-center retrospective cohort study with internal and external cross-validation. Front Neurol. 2023;14:1185447.
Article PubMed PubMed Central Google Scholar
Ovbiagele B. Chronic kidney disease and risk of death during hospitalization for stroke. J Neurol Sci. 2011;301:46–50.
Article PubMed Google Scholar
Laible M, Jenetzky E, Möhlenbruch MA, Bendszus M, Ringleb PA, Rizos T. The impact of post-contrast acute kidney injury on in-hospital mortality after endovascular thrombectomy in patients with acute ischemic stroke. Front Neurol. 2021;12: 665614.
Article PubMed PubMed Central Google Scholar
Arnold J, Sims D, Gill P, Cockwell P, Ferro C. Acute kidney injury calculated using admission serum creatinine underestimates 30-day and 1-year mortality after acute stroke. Clin Kidney J. 2020;13:46–54.
Article CAS PubMed Google Scholar

Download references

Funding

This work was supported by the Taishan Scholar Program of Shandong Province (grant number tstp20230665); the National Natural Science Foundation of China (grant numbers 81970582 and 82270724); the Qingdao Key Health Discipline Development Fund; and the Qingdao Key Clinical Specialty Elite Discipline.

Author information

Lingyu Xu and Chenyu Li have contributed equally to this work.

Authors and Affiliations

Department of Nephrology, The Affiliated Hospital of Qingdao University, 16 Jiangsu Road, Qingdao, 266003, China
Lingyu Xu, Chenyu Li, Chen Guan, Long Zhao, Xuefei Shen, Ningxin Zhang, Tianyang Li, Chengyu Yang, Bin Zhou, Quandong Bu & Yan Xu
Division of Nephrology, Medizinische Klinik Und Poliklinik IV, Klinikum der Universität, Munich, Germany
Chenyu Li
Yidu Central Hospital of Weifang, Weifang, China
Jiaqi Zhang

Authors

Lingyu Xu
View author publications
You can also search for this author in PubMed Google Scholar
Chenyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiaqi Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Chen Guan
View author publications
You can also search for this author in PubMed Google Scholar
Long Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xuefei Shen
View author publications
You can also search for this author in PubMed Google Scholar
Ningxin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Chengyu Yang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Quandong Bu
View author publications
You can also search for this author in PubMed Google Scholar
Yan Xu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

LYX and CYL contributed equally to this work and should be considered the co-first authors. Conceptualization and design: LYX, CYL and YX; writing—original draft: LYX and CYL; methodology: LYX, JQZ, CYY, and LZ; artificial intelligence: LYX, CG, XFS, and NXZ; data curation and interpretation: TYL, BZ, and QDB; writing—review and editing: CYL and YX. All the authors read and approved the final manuscript.

Corresponding author

Correspondence to Yan Xu.

Ethics declarations

Ethics approval and consent to participate

This study obtained approval from the Ethics Committee of the Affiliated Hospital of Qingdao University (Approval No. QYFY WZLL 27276). The ethics committee supervised the complete de-identification process, resulting in the waiver of informed consent for individual patients.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

40001_2024_1940_MOESM1_ESM.pdf

Additional file 1. Fig. S1. ROC curves and AUROC values for mortality prediction in test set. Fig. S2. ROC (A) and DCA (B) curves of training and test sets for mortality prediction. Fig. S3. The SHAP summary plot of LightGBM model using “AKD grade” as a proxy for “renal function trajectories” in predicting mortality risk. Fig. S4. The SHAP interaction plot depicting the interactions among the top fifteen features of the lightGBM model for mortality prediction. Fig. S5. The SHAP dependence plots illustrate the correlations between key features in the prediction of mortality. A depicts the correlation between renal function trajectories and ACEI/ARB in predicting mortality. The x-axis represents the actual values of renal function trajectories, whereas the y-axis shows the SHAP values for trajectories, with values above zero suggesting an increased risk of mortality. Each dot represents an individual case, with the color transitioning from blue to red to indicate whether ACEI/ARB were taken or not. Specifically, the impact of ACEI/ARB on the mortality probability varies across different renal function trajectories. Among patients with normal kidney function, the use of ACEI/ARB is associated with a decrease in the risk of mortality. Conversely, for patients with AKD accompanied by AKI, the use of ACEI/ARB significantly increases the risk of mortality. B Illustrates the correlation between baseline eGFR and ACEI/ARB in predicting mortality. Among patients with lower baseline eGFR levels, the use of ACEI/ARB is associated with a slight increase in the risk of mortality. C Depicts the correlation between neutrophil count and renal function trajectories in predicting mortality. D Depicts the correlation between neutrophil count and antibiotics in predicting mortality. Fig. S6. The SHAP decision plots provided a detailed view of the inner workings of the lightGBM model. A, B provides personalized explanations for two cases with mortality probabilities below 10% and actual outcomes of survival. The direction of the line visualizes the decision process of the LightGBM model from the base value to the predicted value. The values adjacent to the line denote the measured values of the features. C, D provides personalized explanations for two cases with mortality probabilities exceeding 90% and actual outcomes of death. Table S1. Characteristics of hospital encounters in the study sample, overall and according to mortality [mean ± SD; n (%)]. Table S2. Performance of eight ML models for predicting mortality. Table S3. Performance of the LightGBM model for predicting mortality in the test set without data balancing.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Xu, L., Li, C., Zhang, J. et al. Personalized prediction of mortality in patients with acute ischemic stroke using explainable artificial intelligence. Eur J Med Res 29, 341 (2024). https://doi.org/10.1186/s40001-024-01940-2

Download citation

Received: 08 March 2024
Accepted: 17 June 2024
Published: 20 June 2024
DOI: https://doi.org/10.1186/s40001-024-01940-2

Personalized prediction of mortality in patients with acute ischemic stroke using explainable artificial intelligence

Abstract

Background

Methods

Results

Conclusions

Keypoints

Background

Materials and methods

Study design

Data collection

Outcome definitions

Model development and interpretation

Data balancing

AI-driven web application

Statistical analysis

Results

Study cohort

Model performance

SHAP interpreter for the model

AI-driven web application

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher's Note

Supplementary Information

40001_2024_1940_MOESM1_ESM.pdf

Rights and permissions

About this article

Cite this article

Share this article

Keywords

European Journal of Medical Research

Contact us