Zum Hauptinhalt springen

Data analytics approach for short- and long-term mortality prediction following acute non-ST-elevation myocardial infarction (NSTEMI) and Unstable Angina (UA) in Asians.

Kasim, S ; Amir Rudin, PNF ; et al.
In: PloS one, Jg. 19 (2024-02-15), Heft 2, S. e0298036
Online academicJournal

Data analytics approach for short- and long-term mortality prediction following acute non-ST-elevation myocardial infarction (NSTEMI) and Unstable Angina (UA) in Asians  Introduction

Background: Traditional risk assessment tools often lack accuracy when predicting the short- and long-term mortality following a non-ST-segment elevation myocardial infarction (NSTEMI) or Unstable Angina (UA) in specific population. Objective: To employ machine learning (ML) and stacked ensemble learning (EL) methods in predicting short- and long-term mortality in Asian patients diagnosed with NSTEMI/UA and to identify the associated features, subsequently evaluating these findings against established risk scores. Methods: We analyzed data from the National Cardiovascular Disease Database for Malaysia (2006–2019), representing a diverse NSTEMI/UA Asian cohort. Algorithm development utilized in-hospital records of 9,518 patients, 30-day data from 7,133 patients, and 1-year data from 7,031 patients. This study utilized 39 features, including demographic, cardiovascular risk, medication, and clinical features. In the development of the stacked EL model, four base learner algorithms were employed: eXtreme Gradient Boosting (XGB), Support Vector Machine (SVM), Naive Bayes (NB), and Random Forest (RF), with the Generalized Linear Model (GLM) serving as the meta learner. Significant features were chosen and ranked using ML feature importance with backward elimination. The predictive performance of the algorithms was assessed using the area under the curve (AUC) as a metric. Validation of the algorithms was conducted against the TIMI for NSTEMI/UA using a separate validation dataset, and the net reclassification index (NRI) was subsequently determined. Results: Using both complete and reduced features, the algorithm performance achieved an AUC ranging from 0.73 to 0.89. The top-performing ML algorithm consistently surpassed the TIMI risk score for in-hospital, 30-day, and 1-year predictions (with AUC values of 0.88, 0.88, and 0.81, respectively, all p < 0.001), while the TIMI scores registered significantly lower at 0.55, 0.54, and 0.61. This suggests the TIMI score tends to underestimate patient mortality risk. The net reclassification index (NRI) of the best ML algorithm for NSTEMI/UA patients across these periods yielded an NRI between 40–60% (p < 0.001) relative to the TIMI NSTEMI/UA risk score. Key features identified for both short- and long-term mortality included age, Killip class, heart rate, and Low-Molecular-Weight Heparin (LMWH) administration. Conclusions: In a broad multi-ethnic population, ML approaches outperformed conventional TIMI scoring in classifying patients with NSTEMI and UA. ML allows for the precise identification of unique characteristics within individual Asian populations, improving the accuracy of mortality predictions. Continuous development, testing, and validation of these ML algorithms holds the promise of enhanced risk stratification, thereby revolutionizing future management strategies and patient outcomes.

While modern techniques in percutaneous coronary intervention (PCI) have evolved, incorporating innovations like drug-eluting stents and the increased use of pharmacological agents such as beta-blockers and renin-angiotensin system blockers, acute myocardial infarction (AMI) still ranks as a major cause of mortality in Asia [[1]].

AMI can be classified as either ST-elevation myocardial infarction (STEMI) or non-ST-elevation myocardial infarction (NSTEMI), depending on the results of an electrocardiogram (ECG). It is crucial to have the ability to predict morbidity and mortality in AMI patients in order to identify those individuals who are at an increased risk, who therefore require intense therapy and vigilant monitoring [[3]].

Frequently used risk assessment techniques, such as Thrombolysis in Myocardial Infarction (TIMI) and the Global Registry of Acute Coronary Events (GRACE), play an important role in predicting the likelihood of death and recurrent cardiovascular events in patients with AMI, particularly those with NSTEMI [[4]]. The improved GRACE risk predictor (version 2.0) has been thoroughly evaluated for its accuracy in predicting outcomes in NSTEMI patients, broadening the scope of risk assessment tools beyond standard models [[6]]. Additionally, other models such as Naples Score [[7]] and Intermountain Risk Score [[8]], have also been used to validate mortality risk in patients with STEMI. It is worth noting that these risk scores were originally established to predict mortality for illnesses such as cancer and other health outcomes, demonstrating their adaptability in measuring the probability of adverse occurrences in many clinical circumstances.

The limitation of commonly used risk assessments tools for NSTEMI such TIMI and GRACE were formulated based on data from Western Caucasian AMI patients during the 1990s to early 2000s. This occurred prior to the introduction of drug-eluting stents and the most recent iteration of antiplatelet medications. Significantly, these methods inadequately represent the Asian demography, despite Asia comprising around 60% of the world's population [[9]]. Furthermore, the prediction models' outcomes were limited to short-term mortality, such as in-hospital, 14-day, and 30-day death [[3], [10]].

Recent research indicates that incorporating machine learning (ML) and stacking ensemble learning (EL) techniques can improve the effectiveness of AMI prediction models, overcoming the limits of conventional risk score methodology. The conventional risk score has several shortcomings, including a major dependence on linearity, parametric assumptions, and a restricted capabilities to examine higher-order interactions [[12]].

ML is tailored to manage complex data, facilitating the development of precise risk-prediction models [[13]]. Several ML algorithms, including Support Vector Machine (SVM), Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Naive Bayes (NB), have validated this concept [[14]–[17]]. The ML and EL algorithms have been implemented at a population-specific level in countries like Malaysia, Korea, and China for mortality predictions. These algorithms outperformed conventional risk scores such as TIMI and GRACE in terms of Area Under the Curve (AUC) for mortality prediction post-ACS, STEMI, and NSTEMI [[9], [13], [18]–[21]]. Accurate prediction is useful for determining the appropriate level of care as well as the appropriate pharmacological or invasive treatment; however, there is only a limited risk classification model on NSTEMI/UA [[22]–[25]], and none of them have been built specifically for NSTEMI/UA Asian patients as of yet.

Furthermore, although some individual ML models for predicting post-ACS mortality demonstrate exceptional outcomes, there are still several obstacles that need to be addressed. Firstly, no singular ML technique consistently surpasses others within the same field. Furthermore, combining multiple algorithms may result in enhanced performance, especially in the medical domain. Accuracy is of utmost importance in this field, as it directly impacts the efficiency of diagnoses and the survival outcomes of patients [[26]]. As a result, new prognostic methods for predicting mortality in Asian NSTEMI/UA patients must be considered throughout time.

Patients who are admitted with NSTEMI/UA in lower-income or Asian nations typically have a younger age, are frequently diabetic, and are current smokers. They also tend to have fewer chronic health conditions. Additionally, a significant majority (76.7%) of these patients satisfy the requirements for an early invasive surgery, indicating an exceedingly high risk [[27]]. These NSTEMI/UA patients exhibit a higher likelihood of adverse outcomes such as mortality, recurrent myocardial infarction, symptomatic ischemia, severe arrhythmias, heart failure, and stroke. According to published studies [[28]–[31]], although the in-hospital mortality rate is lower for patients with NSTEMI/UA compared to those with STEMI, the long-term risk of CVD is similar between them, making risk assessment difficult for this NSTEMI/UA population. Therefore, the therapeutic significance of risk stratification in patients with AMI, especially those with NSTEMI/UA, is substantial. This is due to the wide range of clinical manifestations and subsequent prognosis associated with these conditions [[32]].

The primary objective of this study is to discern features associated with mortality and to develop ML and EL models that can accurately predict the short- and long-term mortality risks in Asian NSTEMI/UA patients. To realize this objective, four established ML algorithms—SVM, RF, XGBoost, and NB—will be employed. Additionally, these algorithms will serve as base learners in the construction of the stacked EL model. Our research has made a substantial contribution by identifying the significant prognostic features correlated with in-hospital, 30-day, and 1-year mortality in a multi-ethnic Asian cohort diagnosed with NSTEMI/UA, achieved using the SVM feature selection approach.

According to existing literature [[34]], the implementation of feature selection techniques is critical for improving the predictive accuracy of ML algorithms. Consequently, the identification of mortality-associated factors is crucial for identifying high-risk patient characteristics, thereby facilitating improved patient care in clinical settings. To achieve this, the SVM feature selection method is employed, followed by the construction of the ML, and stacking EL models utilizing the selected features. Then, we compare the performance of individual ML models to that of stacked EL models with complete and selected feature sets. Lastly, we intend to validate the performance measures of the developed prediction models using multi-ethnic registry data on Asian NSTEMI/UA and TIMI NSTEMI/UA risk scores.

Methods

Study data

Patient data for this study was collected from the Malaysian National Cardiovascular Disease Database (NCVD-ACS) registry, encompassing multiethnic Asian individuals hospitalized with NSTEMI and UA from 2006 to 2019. The Medical Review & Ethics Committee (MREC) of the Ministry of Health, Malaysia, authorized the establishment of the NCVD registry in 2007, with the approval code NMRR-07-20-250. An exemption from informed consent for participants was permitted by the MREC for this NCVD study. The registry includes standardized clinical, demographic, procedural, and patient outcomes from all consecutive patients at associated centers [[36]]. Ethical clearance for data collection was also obtained from the UITM Ethics Committee (Reference number: 600-TNCPI (5/1/6)) and the National Heart Association of Malaysia (NHAM).

The data was gathered using a standardized case report that covered the period from the patients' admission to the hospital until their discharge, as well as the subsequent follow-up. The major outcome of interest was the patients' survival status, which was classified as alive or dead. The dataset included information on patients' demographics, clinical presentation, initial diagnostic tests, electrocardiography results, therapeutic procedures, and pharmaceutical medication. Each individual patient was assigned a unique identification number to avoid any instances of duplication. The data used in this study were anonymized before analysis since our research focuses entirely on the values and features of the data without gaining access to any personal information about the patients.

The study included all patients who were enrolled in the NSTEMI/UA registry, without any events of exclusion. Patients who received reperfusion treatments such as primary percutaneous coronary intervention (PCI), angiography demonstrating spontaneous reperfusion, further fibrinolysis, or urgent coronary artery bypass grafting (CABG) were included in this group. Within this context, NSTEMI was defined as an incident of AMI with evidence of a positive cardiac marker but no ST-segment elevation in electrocardiographic (ECG) measurements. This study used a complete set of datasets, including a total of 39 features, in adherence to established clinical criteria. This study's features included sociodemographic characteristics, CVD diagnosis and severity, risk factors associated with CVD, comorbidities linked to CVD, comorbidities unrelated to CVD, biomarkers, and medication consumption. The yearly verification of mortality was carried out by establishing data links with the Malaysian National Registration Department. The following data points were originally meant to collect these features; however, they were unfortunately omitted from the research due to a significant number of missing values. The primary goal of this research was the development of an algorithm aimed at modifying policy endpoints, with a special emphasis on severe endpoints such as death, in order to augment the study's importance. This methodology is similarly used in other academic publications [[9], [38]].

Classification and sample pre-processing

The development of ML and stacked EL model was based on complete cases data to ensure the robustness and validity of the research findings. Complete cases refer to instances when there is no missing data related to the predictors. A total of 54,926 cases of NSTEMI/UA were obtained from the registry. Among these cases, 23,682 were classified as complete cases. From these, subsets of 9,518 for in-hospital, 7,133 for 30-day, and 7,031 for 1-year outcomes. The study used an extensive set of 39 features, consisting of 8 continuous and 31 categorical features. The frequency of missing data across these features is detailed in S1 Table.

For the secondary analysis, we included an additional 31,244 instances that had missing data. This resulted in a comprehensive dataset that consists of 54,926 patients. This missing cases was imputed through multivariate imputation by chained equations, utilizing predictive mean matching from the MICE package in R [[40]]. This imputation approach estimates missing values by utilizing actual data from similar cases, thus maintaining the integrity of the dataset.

An "incomplete dataset" is defined in the study as a dataset in which the missing data for any particular feature does not exceed 30%. It is crucial to emphasize that the missing data only refers to patient demographics and clinical features and does not include information on treatment outcomes. Given the prospective design of our dataset and the retrospective approach to data management, it is imperative to recognize that the missing values across all features occurred randomly and were not influenced by the researchers. This randomness of missing data, affirmed by its classification as Missing Completely at Random (MCAR), ensures that these missing values are uncorrelated with any of the features used or unused in the analysis, supporting the assumption that the missingness is independent of the observed or unobserved data. This randomness of missing data, as indicated by its classification as Missing Completely at Random (MCAR), ensures that these missing values are uncorrelated with any of the features used or omitted in the study, supporting the assumption that missingness is independent of observed or unobserved data.

Data splitting

In line with Kuhn and Johnson's methodology [[41]], the dataset was subjected to stratified random sampling and divided into two groups: 70% for model training and 30% for validation. This separation applied to both complete and imputed cases to avoid data leakage, as recommended by best practices [[42]]. To avoid potential duplications due to multiple admissions, patient data were aggregated under unique patient identities, ensuring that all records for a particular patient were solely included in either the training or validation set. For this study, a randomly generated patient ID system was used to replace actual patient identifiers, preserving data anonymity, and conforming to confidentiality procedures.

Data pre-processing techniques, including imputation for handling missing instances and balancing for both full and missing cases, were exclusively used for the training data. Normalization techniques were applied to the training data, and the same scaling parameters were then used to normalize the testing dataset, ensuring the integrity of the model evaluation process, and preventing data leakage.

In the study, distinct normalizing techniques were applied to the training and testing datasets in a different manner. The performance of the resulting model and the TIMI risk score for NSTEMI/UA was assessed using a validation dataset, which constituted 30% of the total data and was entirely excluded from the model training phase to preserve the integrity of the model's performance assessment.

Data balancing

Class imbalance between the minority non-survival group and the majority survival group was addressed by pre-processing the training dataset. To rectify the imbalance and provide an equal representation of both classes, a combined method of up sampling the minority class and down sampling the majority class was used, utilizing the ROSE package as described by Lunardon et al. (2014) [[43]]. This balanced the classes to prevent bias in the subsequent ML model training.

Data normalization

Normalization approaches were used in the study to ensure that all continuous features contribute equally to the model's performance, preventing larger scale features from dominating the pattern recognition process [[41]]. In this study, standardization, also known as z-score normalization, was applied to continuous features such as age, heart rate, systolic and diastolic blood pressure, total cholesterol, HDL, LDL, triglycerides, and fasting blood glucose to ensure a consistent scale across these variables.

Development of risk models

ML algorithms

Several ML techniques were used to develop prediction models for NSTEMI/ UA using the R programming language. These algorithms include Support Vector Machine (SVM) [[44]], Random Forest (RF) [[45]], Extreme Gradient Boosting (XGBoost) [[46]], and Naive Bayes [[47]].

The SVM is a robust ML algorithm known for its ability to classify by finding the optimal hyperplane that maximizes the margin between two classes. For our study, the SVM model utilized two types of kernels: a linear kernel for linearly separable data and a Radial Basis Function (RBF) kernel for non-linear data separation. RF is an EL technique that builds multiple decision trees during training. It utilizes the bagging technique to determine the classification outcome based on the majority vote from individual trees, improving prediction robustness and accuracy. XGBoost is part of the gradient boosting algorithm family. It is well-known for its regularization strategies, which are intended to prevent overfitting, which is a common problem in complex models, resulting in a more generalized model that performs better on unseen data. The Naive Bayes (NB) algorithm is a probabilistic classifier that applies Bayes' Theorem while making the 'naive' assumption of conditional independence between every pair of features given the class label. Despite its simplicity, NB can perform remarkably well, particularly when the independence assumption is obeyed. The selection of these algorithms was based on prior studies pertaining to mortality associated with cardiovascular disease [[13], [17], [48]–[51]]. A combination of random search and manual adjustment was used to fine-tune the hyperparameters of the ML and stacked EL models, ensuring that the models are optimally configured for the task. Details on the hyperparameters can be found in S2 Table.

Ensemble stacking

Stacking is a meta-learning technique that uses the outputs of various base learners as input for a subsequent meta-learner, which then makes the final prediction. This ensemble approach is well-known for its ability to outperform any individual classifier or regressor. We used several kinds of base learners for developing our stacked EL model, including SVM with both linear and RBF kernels, RF, XGBoost, and NB. The predictions from these base learners were combined using a meta-learner, a Generalized Linear Model (GLM), which combines the individual predictions into a final, ensemble prediction. This method relies on the strengths of each base learner and often generates improved predictive accuracy [[52]–[54]].

Feature selection and model interpretation

Previous research [[19], [49], [51]] has shown that features selected by SVM improve model performance when compared to other ML techniques. This led to the prioritization of SVM-ranked features in our feature selection procedure. The Backward Feature Elimination (BFE) method was used to systematically eliminate less significant features, ranking them ascendingly based on their contribution to the predictive abilities of the model [[55]]. This elimination was applied to the SVM-ranked features [[56]]. The models were trained and evaluated for each feature subset iteration using the 30% hold-out validation set, which was independent of the model training stage. This step was crucial in determining and choosing models that provided optimal performance while utilizing the fewest features possible, resulting in a more generalizable reliable model. These optimized feature sets were then used to develop the final ML and stacked EL models. To reduce the risk of overfitting, 10-fold cross-validation was used on the training dataset during the model training process [[57]].

The integration of ML models into clinical practice is impeded given their 'black-box' nature. The behavior of ML models, which are inherently agnostic, can be deduced by modifying inputs and analyzing the resulting output [[58]]. Changes can be made to align these inputs with human comprehensibility. Shapley Additive Explanations (SHAP) is employed in this study for interpreting the most optimal ML model. Shapley values are used to assess the impact of input features on the output of an ML model. The SHAP values indicate the level of significance that a model assigns to specific features, allowing for the establishment of a feature ranking [[59]].

Model evaluation, validation, and performance measures

In order to assess the calibration of the models, standardized metrics were utilized [[60]]. The area under the curve (AUC) was used as a predictor of performance due to its independence from class distribution. Additional performance metrics, including accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV), were used to further evaluate the calibration of the models. A paired resampled t-test was carried out as a means to comparison of the predictive performance of ML and stacked EL models [[61]]. In addition, the net reclassification improvement (NRI) was computed to assess the extent to which the enhanced ML model improved the accurate classification of positive and negative cases, as compared to the TIMI risk score for NSTEMI/UA [[62]].

Comparison with conventional method TIMI score

The TIMI scores for NSTEMI/UA obtained from the NCVD registry were used to validate the dataset's performance. In terms of predictive performance, we compared the TIMI score to our developed ML model and the stacked EL model. The area under the curve (AUC) was used as the primary metric to assess the predictive accuracy of the models. This assessment was carried out on a validation set that comprised 30% of the total data, ensuring an accurate assessment. A performance breakdown graph was generated as well to examine the TIMI score for NSTEMI/UA application in clinical practice with regard to established literature thresholds [[62]]. The ML model's high-risk group in this study is defined as those with a mortality probability greater than 50%, which corresponds to a TIMI score greater than 5.

Additional statistics

In our study, we presented continuous variables as the mean ± standard deviation, while categorical variables were displayed as frequency and percentage. The Chi-square test was used to analyze categorical features like Killip class and statin usage. The Man-Whitney test was used for non-normally distributed continuous variables, and the two-sided independent t-test was used for normally distributed continuous features. To determine the normality of the data, the Shapiro-Wilk Test was used, with a p-value less than 0.05 indicating a non-normal distribution. Both the ML and stacked EL models were evaluated using pairwise corrected resampled t-tests [[45], [57]], with a p-value of less than 0.001 considered statistically significant. Fig 1 provides a detailed overview of the workflow and methods used in this study.

Graph: Fig 1 info:doi/10.1371/journal.pone.0298036.g001

Results

Patient characteristics

Table 1 summarizes the baseline demographic and clinical characteristics of the NSTEMI/UA patient cohort, stratified by survival status at the in-hospital, 30-day, and 1-year time points. The data depict the patients' mean age, which is uniformly distributed around the age of 61, with a significantly higher proportion of males (approximately 73%). The data reveals a progressive increase in mortality rates among the patients, with in-hospital mortality at 3.8%, rising to 6.6% at 30 days, and reaching 16.5% by the 1-year follow-up. A statistically significant difference between survivor and non-survivor groups is observed across multiple variables, including age, smoking status, and a history of various comorbidities. Clinical measures such as heart rate, blood pressure, and Killip class, alongside laboratory parameters including cholesterol levels, glucose, and specific ECG changes, also exhibited significant differences between the two groups (p < 0.001 for all). This table highlights the differences in clinical features and outcomes for NSTEMI/UA patients and provides a statistical basis for further model development and analysis.

Graph

Table 1 Comprehensive baseline characteristics and outcome correlates of the NSTEMI/UA patient population.

Features (39)DescriptionIn-Hospital30-Days1-Year
All Cases (9518)Survival (9156)Non-survival (362)p-valueAll Cases (7133)Survival (6659)Non-survival (474)p-valueAll Cases (7031)Survival (5868)Non-survival (1163)p-value
Age61.22 ± 12.1860.96 ± 12.1567.92 ± 11.12<0.00161.37 ± 12.3660.86 ± 12.2668.54 ± 11.53<0.00161.35 ± 12.3860.02 ± 12.1668.03 ± 11.26<0.001
Gender1: Male6865 (72.13)6622 (72.32)243 (67.13)0.0355208 (73.01)4881 (73.3)327 (68.99)0.0475129 (72.95)4317 (73.57)812 (69.82)0.01
2: Female2653 (27.87)2534 (27.68)119 (32.87)1925 (26.99)1778 (26.7)147 (31.01)1902 (27.05)1551 (26.43)351 (30.18)
Race1: Malay4157 (43.68)3995 (43.63)162 (44.75)0.1913139 (44.01)2922 (43.88)217 (45.78)0.0083089 (43.93)2563 (43.68)526 (45.23)0.011
2: Chinese2711 (28.48)2595 (28.34)116 (32.04)2015 (28.25)1861 (27.95)154 (32.49)1985 (28.23)1641 (27.97)344 (29.58)
3: Indian1697 (17.83)1642 (17.93)55 (15.19)1152 (16.15)1085 (16.29)67 (14.14)1141 (16.23)950 (16.19)191 (16.42)
4: Others953 (10.01)924 (10.09)29 (8.01)827 (11.59)791 (11.88)36 (7.59)816 (11.61)714 (12.17)102 (8.77)
Smoking status1: Never4723 (49.62)4516 (49.32)207 (57.18)<0.0013534 (49.54)3261 (48.97)273 (57.59)<0.0013490 (49.64)2833 (48.28)657 (56.49)<0.001
2: Former2244 (23.58)2152 (23.5)92 (25.41)1600 (22.43)1492 (22.41)108 (22.78)1570 (22.33)1298 (22.12)272 (23.39)
3: Current2551 (26.8)2488 (27.17)63 (17.4)1999 (28.02)1906 (28.62)93 (19.62)1971 (28.03)1737 (29.6)234 (20.12)
History of Dyslipidemia1: Yes4624 (48.58)4469 (48.81)155 (42.82)0.0293431 (48.1)3220 (48.36)211 (44.51)0.1173376 (48.02)2817 (48.01)559 (48.07)0.996
2: No4894 (51.42)4687 (51.19)207 (57.18)3702 (51.9)3439 (51.64)263 (55.49)3655 (51.98)3051 (51.99)604 (51.93)
History of Diabetes1: Yes4484 (47.11)4287 (46.82)197 (54.42)0.0053352 (46.99)3077 (46.21)275 (58.02)<0.0013306 (47.02)2602 (44.34)704 (60.53)<0.001
2: No5034 (52.89)4869 (53.18)165 (45.58)3781 (53.01)3582 (53.79)199 (41.98)3725 (52.98)3266 (55.66)459 (39.47)
History of Hypertension1: Yes6878 (72.26)6595 (72.03)283 (78.18)0.0125118 (71.75)4746 (71.27)372 (78.48)0.0015034 (71.6)4093 (69.75)941 (80.91)<0.001
2: No2640 (27.74)2561 (27.97)79 (21.82)2015 (28.25)1913 (28.73)102 (21.52)1997 (28.4)1775 (30.25)222 (19.09)
Family History of Premature Cardiovascular Disease1: Yes1501 (15.77)1472 (16.08)29 (8.01)<0.0011147 (16.08)1111 (16.68)36 (7.59)01138 (16.19)1040 (17.72)98 (8.43)<0.001
2: No8017 (84.23)7684 (83.92)333 (91.99)5986 (83.92)5548 (83.32)438 (92.41)5893 (83.81)4828 (82.28)1065 (91.57)
History of Myocardial Infarction1: Yes1772 (18.62)1701 (18.58)71 (19.61)0.6691147 (16.08)1060 (15.92)87 (18.35)0.1831115 (15.86)904 (15.41)211 (18.14)0.022
2: No7746 (81.38)7455 (81.42)291 (80.39)5986 (83.92)5599 (84.08)387 (81.65)5916 (84.14)4964 (84.59)952 (81.86)
New onset angina (<2 weeks)1: Yes6270 (65.88)6046 (66.03)224 (61.88)0.1144849 (67.98)4546 (68.27)303 (63.92)0.0564794 (68.18)4070 (69.36)724 (62.25)<0.001
2: No3248 (34.12)3110 (33.97)138 (38.12)2284 (32.02)2113 (31.73)171 (36.08)2237 (31.82)1798 (30.64)439 (37.75)
History of Heart Failure1: Yes732 (7.69)673 (7.35)59 (16.3)<0.001521 (7.3)458 (6.88)63 (13.29)<0.001511 (7.27)326 (5.56)185 (15.91)<0.001
2: No8786 (92.31)8483 (92.65)303 (83.7)6612 (92.7)6201 (93.12)411 (86.71)6520 (92.73)5542 (94.44)978 (84.09)
History of Chronic Lung Disease1: Yes438 (4.6)415 (4.53)23 (6.35)0.135336 (4.71)316 (4.75)20 (4.22)0.682329 (4.68)263 (4.48)66 (5.67)0.092
2: No9080 (95.4)8741 (95.47)339 (93.65)6797 (95.29)6343 (95.25)454 (95.78)0.6826702 (95.32)5605 (95.52)1097 (94.33)0.092
History of Renal Disease1: Yes1051 (11.04)975 (10.65)76 (20.99)<0.001811 (11.37)699 (10.5)112 (23.63)<0.001796 (11.32)511 (8.71)285 (24.51)<0.001
2: No8467 (88.96)8181 (89.35)286 (79.01)6322 (88.63)5960 (89.5)362 (76.37)6235 (88.68)5357 (91.29)878 (75.49)
History of Cerebrovascular Disease1: Yes437 (4.59)409 (4.47)28 (7.73)0.005332 (4.65)298 (4.48)34 (7.17)0.010329 (4.68)246 (4.19)83 (7.14)<0.001
2: No9081 (95.41)8747 (95.53)334 (92.27)6801 (95.35)6361 (95.52)440 (92.83)6702 (95.32)5622 (95.81)1080 (92.86)
Heart rate (bpm)84.33 ± 20.7683.79 ± 20.3597.94 ± 25.79<0.00184.93 ± 21.0784.16 ± 20.6195.74 ± 24.28<0.00184.85 ± 21.0183.39 ± 20.3892.21 ± 22.54<0.001
Systolic Blood Pressure (mmHg)143.47 ± 27.39144.07 ± 27128.18 ± 32.36<0.001143.16 ± 27.35144.06 ± 26.78130.59 ± 31.77<0.001143.33 ± 27.26144.25 ± 26.46138.67 ± 30.54<0.001
Diastolic Blood Pressure (mmHg)81.61 ± 16.7781.9 ± 16.6374.14 ± 18.49<0.00181.27 ± 17.0381.72 ± 16.8575.03 ± 18.27<0.00181.34 ± 17.0281.99 ± 16.6978.08 ± 18.27<0.001
Killip Classification1: Killip I7166 (75.29)7070 (77.22)96 (26.52)<0.0015409 (75.83)5237 (78.65)172 (36.29)<0.0015362 (76.26)4789 (81.61)573 (49.27)<0.001
2: Killip II1407 (14.78)1340 (14.64)67 (18.51)929 (13.02)844 (12.67)85 (17.93)895 (12.73)644 (10.97)251 (21.58)
3: Killip III611 (6.42)540 (5.9)71 (19.61)504 (7.07)420 (6.31)84 (17.72)494 (7.03)312 (5.32)182 (15.65)
4: Killip IV334 (3.51)206 (2.25)128 (35.36)291 (4.08)158 (2.37)133 (28.06)280 (3.98)123 (2.1)157 (13.5)
Total Cholesterol (mmol/L)4.79 ± 1.344.8 ± 1.334.34 ± 1.34<0.0014.7 ± 1.344.72 ± 1.334.4 ± 1.46<0.0014.7 ± 1.344.75 ± 1.34.46 ± 1.48<0.001
HDL (mmol/L)1.08 ± 0.361.09 ± 0.351.05 ± 0.42<0.0011.08 ± 0.351.08 ± 0.351.03 ± 0.4<0.0011.08 ± 0.351.08 ± 0.351.05 ± 0.36<0.001
LDL (mmol/L)2.97 ± 1.22.98 ± 1.22.65 ± 1.17<0.0012.91 ± 1.212.92 ± 1.192.74 ± 1.34<0.0012.91 ± 1.22.94 ± 1.182.75 ± 1.29<0.001
Fasting Blood Glucose (mmol/L)7.65 ± 3.567.57 ± 3.469.7 ± 5.21<0.0017.7 ± 3.637.58 ± 3.59.38 ± 4.81<0.0017.7 ± 3.627.49 ± 3.438.76 ± 4.32<0.001
ST-segment Depression ≥ 0.5mm in ≥ 2 Contiguous Leads1: Selected3241 (34.05)3025 (33.04)216 (59.67)<0.0012348 (32.92)2100 (31.54)248 (52.32)<0.0012295 (32.64)1759 (29.98)536 (46.09)<0.001
2: Not selected6277 (65.95)6131 (66.96)146 (40.33)4785 (67.08)4559 (68.46)226 (47.68)4736 (67.36)4109 (70.02)627 (53.91)
T-wave inversion ≥1mm1: Selected3504 (36.81)3400 (37.13)104 (28.73)0.0012597 (36.41)2457 (36.9)140 (29.54)0.0022563 (36.45)2177 (37.1)386 (33.19)0.013
2: Not selected6014 (63.19)5756 (62.87)258 (71.27)4536 (63.59)4202 (63.1)334 (70.46)4468 (63.55)3691 (62.9)777 (66.81)0.013
Bundle Branch Block (BBB)1: Selected514 (5.4)484 (5.29)30 (8.29)0.018364 (5.1)326 (4.9)38 (8.02)0.004355 (5.05)268 (4.57)87 (7.48)<0.001
2: Not selected9004 (94.6)8672 (94.71)332 (91.71)6769 (94.9)6333 (95.1)436 (91.98)6676 (94.95)5600 (95.43)1076 (92.52)
Inferior Leads: II, III, aVF1: Selected1859 (19.53)1777 (19.41)82 (22.65)0.1441327 (18.6)1237 (18.58)90 (18.99)0.8721307 (18.59)1096 (18.68)211 (18.14)0.699
2: Not selected7659 (80.47)7379 (80.59)280 (77.35)5806 (81.4)5422 (81.42)384 (81.01)5724 (81.41)4772 (81.32)952 (81.86)0.699
Anterior Leads: V1 to V41: Selected2910 (30.57)2755 (30.09)155 (42.82)<0.0012048 (28.71)1873 (28.13)175 (36.92)<0.0012008 (28.56)1609 (27.42)399 (34.31)<0.001
2: Not selected6608 (69.43)6401 (69.91)207 (57.18)5085 (71.29)4786 (71.87)299 (63.08)5023 (71.44)4259 (72.58)764 (65.69)
Lateral Leads: 1, aVL, V5 to V61: Selected3488 (36.65)3338 (36.46)150 (41.44)0.0612626 (36.81)2422 (36.37)204 (43.04)0.0042583 (36.74)2046 (34.87)537 (46.17)<0.001
2: Not selected6030 (63.35)5818 (63.54)212 (58.56)4507 (63.19)4237 (63.63)270 (56.96)4448 (63.26)3822 (65.13)626 (53.83)
Cardiac Catheterization1: Yes2992 (31.44)2928 (31.98)64 (17.68)<0.0012337 (32.76)2238 (33.61)99 (20.89)<0.0012290 (32.57)2061 (35.12)229 (19.69)<0.001
2: No6526 (68.56)6228 (68.02)298 (82.32)4796 (67.24)4421 (66.39)375 (79.11)4741 (67.43)3807 (64.88)934 (80.31)
Percutaneous Coronary Intervention (PCI)1: Yes1623 (17.05)1592 (17.39)31 (8.56)<0.0011255 (17.59)1206 (18.11)49 (10.34)<0.0011239 (17.62)1139 (19.41)100 (8.6)<0.001
2: No7895 (82.95)7564 (82.61)331 (91.44)5878 (82.41)5453 (81.89)425 (89.66)5792 (82.38)4729 (80.59)1063 (91.4)
LMWH1: Yes3140 (32.99)2972 (32.46)168 (46.41)<0.0011489 (20.87)1298 (19.49)191 (40.3)<0.0011433 (20.38)1004 (17.11)429 (36.89)<0.001
2: No6378 (67.01)6184 (67.54)194 (53.59)5644 (79.13)5361 (80.51)283 (59.7)5598 (79.62)4864 (82.89)734 (63.11)
Beta-blocker1: Yes5903 (62.02)5785 (63.18)118 (32.6)<0.0014206 (58.97)4008 (60.19)198 (41.77)<0.0014126 (58.68)3475 (59.22)651 (55.98)0.043
2: No3615 (37.98)3371 (36.82)244 (67.4)2927 (41.03)2651 (39.81)276 (58.23)2905 (41.32)2393 (40.78)512 (44.02)
ACE Inhibitor1: Yes4881 (51.28)4811 (52.54)70 (19.34)<0.0013392 (47.55)3279 (49.24)113 (23.84)<0.0013338 (47.48)2959 (50.43)379 (32.59)<0.001
2: No4637 (48.72)4345 (47.46)292 (80.66)3741 (52.45)3380 (50.76)361 (76.16)3693 (52.52)2909 (49.57)784 (67.41)
Angiotensin II Receptor Blocker1: Yes762 (8.01)755 (8.25)7 (1.93)<0.001524 (7.35)510 (7.66)14 (2.95)<0.001515 (7.32)457 (7.79)58 (4.99)0.001
2: No8756 (91.99)8401 (91.75)355 (98.07)6609 (92.65)6149 (92.34)460 (97.05)6516 (92.68)5411 (92.21)1105 (95.01)
Statin1: Yes8948 (94.01)8638 (94.34)310 (85.64)<0.0016727 (94.31)6309 (94.74)418 (88.19)<0.0016630 (94.3)5564 (94.82)1066 (91.66)<0.001
2: No570 (5.99)518 (5.66)52 (14.36)406 (5.69)350 (5.26)56 (11.81)401 (5.7)304 (5.18)97 (8.34)
Diuretics1: Yes3018 (31.71)2818 (30.78)200 (55.25)<0.0012327 (32.62)2069 (31.07)258 (54.43)<0.0012269 (32.27)1637 (27.9)632 (54.34)<0.001
2: No6500 (68.29)6338 (69.22)162 (44.75)4806 (67.38)4590 (68.93)216 (45.57)4762 (67.73)4231 (72.1)531 (45.66)
Calcium antagonist1: Yes2114 (22.21)2066 (22.56)48 (13.26)<0.0011601 (22.44)1528 (22.95)73 (15.4)<0.0011589 (22.6)1338 (22.8)251 (21.58)0.384
2: No7404 (77.79)7090 (77.44)314 (86.74)5532 (77.56)5131 (77.05)401 (84.6)5442 (77.4)4530 (77.2)912 (78.42)
Oral Hypoglycemic Agent1: Yes2722 (28.6)2696 (29.45)26 (7.18)<0.0011978 (27.73)1912 (28.71)66 (13.92)<0.0011957 (27.83)1662 (28.32)295 (25.37)0.043
2: No6796 (71.4)6460 (70.55)336 (92.82)5155 (72.27)4747 (71.29)408 (86.08)5074 (72.17)4206 (71.68)868 (74.63)
Insulin1: Yes2114 (22.21)1998 (21.82)116 (32.04)<0.0011639 (22.98)1474 (22.14)165 (34.81)<0.0011613 (22.94)1231 (20.98)382 (32.85)<0.001
2: No7404 (77.79)7158 (78.18)246 (67.96)5494 (77.02)5185 (77.86)309 (65.19)5418 (77.06)4637 (79.02)781 (67.15)

1 • An asterisk (*) denotes that a p-value of less than 0.001 signifies a statistically significant difference between the features of survival and non-survival.

2 • Features with statistically significant values are highlighted in bold.

Algorithm performance on complete cases

Table 2 presents a comprehensive comparison of the AUC scores of the TIMI risk score with different ML models. The ML models, trained using both the entire set of 39 features and SVM selected subset, demonstrated better predictive abilities when compared to the TIMI NSTEMI/UA risk score in the validation dataset. Specifically, for NSTEMI/UA patient predictions, all ML models, particularly when applying feature selection, outperformed the full-feature models and the conventional TIMI risk scores. The XGBoost model exhibited the highest AUC scores for in-hospital and 30-day predictions, whereas the stacked EL model performed exceptionally well in the 1-year prediction (p-value < 0.001 for all models). Table 3 describes the detailed performance of ML models in relation to the TIMI risk score for NSTEMI/UA.

Graph

Table 2 The AUC for full and selected feature ML models relative to TIMI score in NSTEMI/UA based on a 30% validation dataset.

ModelsThe area under the ROC Curve (95% CI)
In-Hospital30-Days1-Year
LinSVM0.865 (0.827–0.902)0.845 (0.81–0.88)0.807 (0.783–0.831)
LinSVM-BFE0.881 (0.854–0.917)0.850 (0.816–0.885)0.802 (0.778–0.826)
RadSVM0.856 (0.817–0.896)0.841 (0.806–0.875)0.725 (0.696–0.755)
RadSVM-BFE0.856 (0.819–0.893)0.866 (0.833–0.899)0.801 (0.777–0.825)
RF0.853 (0.842–0.913)0.840 (0.822–0.878)0.824 (0.8–0.848)
RF-BFE0.868 (0.835–0.902)0.862 (0.83–0.894)0.802 (0.778–0.826)
XGB0.877 (0.844–0.91)0.867 (0.835–0.898)0.801 (0.777–0.826)
XGB-BFE0.883 (0.851–0.915)0.874 (0.843–0.905)0.798 (0.774–0.823)
NB0.874 (0.838–0.91)0.854 (0.82–0.888)0.796 (0.771–0.821)
NB-BFE0.867 (0.831–0.903)0.862 (0.831–0.893)0.8 (0.776–0.824)
EnsembleGLM0.837 (0.792–0.882)0.865 (0.834–0.897)0.796 (0.771–0.82)
EnsembleGLM-BFE0.862 (0.827–0.896)0.873 (0.857–0.903)0.806 (0.782–0.83)
TIMI (NSTEMI/UA)0.553 (0.502–0.603)0.54 (0.495–0.586)0.602 (0.57–0.634)

  • 3 Abbreviations
  • 4 LinSVM = Support Vector Machine with Linear Kernel
  • 5 LinSVM-BFE = LinSVM with Backward Feature Elimination
  • 6 RadSVM = Support Vector Machine with Radial Kernel
  • 7 RadSVM-BFE = RadSVM classifier with Backward Feature Elimination
  • 8 RF = Random Forest
  • 9 RF-BFE = RF with Backward Feature Elimination
  • 10 XGB = XGBoost
  • 11 XGB-BFE = XGB with Backward Feature Elimination
  • 12 NB = Naive Bayes
  • 13 NB-BFE = NB with Backward Feature Elimination
  • 14 EnsembleGLM = Stacked Ensemble Learning with GLM meta learner
  • 15 EnsembleGLM-BFE = EnsembleGLM with Backward Feature Elimination

Graph

Table 3 Detailed performance metrics for full and selected feature ML models relative to TIMI score in NSTEMI/UA based on a 30% validation dataset.

ModelsAccuracy (95% CI)SensitivitySpecificityPPVNPVMcnemar's test (p-value)
In-hospital
LinSVM0.852 (0.838–0.865)0.7340.8570.1690.988<0.001
LinSVM-BFE0.839 (0.825–0.852)0.7340.8430.1560.988<0.001
RadSVM0.959 (0.951–0.966)0.2110.9880.4180.969<0.001
RadSVM-BFE0.862 (0.848–0.874)0.6150.8710.160.983<0.001
RF0.945 (0.936–0.953)0.5690.960.360.982<0.001
RF-BFE0.882 (0.87–0.894)0.670.8910.1960.986<0.001
XGB0.936 (0.927–0.945)0.5140.9530.3030.98<0.001
XGB-BFE0.874 (0.861–0.886)0.7160.880.1910.987<0.001
NB0.862 (0.848–0.874)0.7250.8670.1780.988<0.001
NB-BFE0.87 (0.857–0.882)0.6790.8770.180.986<0.001
EnsembleGLM0.961 (0.953–0.968)0.220.9910.480.97<0.001
EnsembleGLM-BFE0.878 (0.865–0.89)0.6330.8880.1830.984<0.001
TIMI (NSTEMI/UA)0.946 (0.938–0.954)0.0280.9830.060.962<0.001
30-Days
LinSVM0.796 (0.779–0.813)0.7550.7990.2120.979<0.001
LinSVM-BFE0.79 (0.772–0.807)0.7410.7940.2050.977<0.001
RadSVM0.9 (0.887–0.913)0.5870.9230.3530.969<0.001
RadSVM-BFE0.807 (0.789–0.823)0.7060.8140.2140.975<0.001
RF0.892 (0.878–0.905)0.6360.910.3370.972<0.001
RF-BFE0.812 (0.795–0.828)0.720.8190.2220.976<0.001
XGB0.869 (0.854–0.883)0.6780.8820.2920.975<0.001
XGB-BFE0.815 (0.798–0.832)0.7480.820.230.978<0.001
NB0.864 (0.849–0.879)0.6990.8760.2880.976<0.001
NB-BFE0.898 (0.884–0.91)0.580.920.3430.968<0.001
EnsembleGLM0.91 (0.897–0.922)0.5660.9340.3820.968<0.001
EnsembleGLM-BFE0.817 (0.8–0.833)0.7130.8250.2260.976<0.001
TIMI (NSTEMI/UA)0.917 (0.905–0.929)0.0210.9810.0750.933<0.001
1-Year
LinSVM0.754 (0.735–0.772)0.7110.7630.3720.93<0.001
LinSVM-BFE0.752 (0.733–0.77)0.6760.7670.3650.923<0.001
RadSVM0.789 (0.771–0.806)0.3720.8720.3650.875<0.001
RadSVM-BFE0.728 (0.708–0.747)0.7110.7310.3440.927<0.001
RF0.802 (0.784–0.819)0.6590.830.4350.925<0.001
RF-BFE0.751 (0.732–0.77)0.6960.7620.3670.927<0.001
XGB0.784 (0.766–0.801)0.6190.8170.4010.915<0.001
XGB-BFE0.734 (0.714–0.752)0.7020.740.3490.926<0.001
NB0.829 (0.813–0.845)0.4470.9050.4830.892<0.001
NB-BFE0.817 (0.8–0.833)0.4960.8810.4520.898<0.001
EnsembleGLM0.81 (0.793–0.827)0.4580.880.4310.891<0.001
EnsembleGLM-BFE0.73 (0.711–0.749)0.7220.7320.3480.93<0.001
TIMI (NSTEMI/UA)0.83 (0.813–0.846)0.0490.9850.3860.839<0.001

This study conducted a secondary analysis to evaluate the predictive performance of the best-performing ML models that were trained on imputed datasets. The analysis showed that the XGBoost model achieved an AUC of 0.89 (95% CI: 0.859–0.918) for predicting in-hospital outcomes and 0.86 (95% CI: 0.83–0.897) for predicting outcomes within 30 days. On the other hand, the stacked EL model achieved an AUC of 0.798 (95% CI: 0.774–0.822) for predicting outcomes within 1 year. The models trained on imputed data consistently performed well, with AUC values of 0.88 (95% CI: 0.851–0.915) for in-hospital mortality, 0.87 (95% CI: 0.843–0.905) for 30-day mortality, and 0.81 (95% CI: 0.782–0.83) for 1-year mortality, when compared to the results from the complete cases dataset. The models' robustness is emphasized by the statistical significance of their performance in predicting short-term and long-term outcomes, both in imputed and complete case datasets (p<0.001 for all comparisons). The results confirm the reliability of utilizing imputed data when complete data is not accessible, thereby enabling the wider application of predictive modeling in clinical environments.

Feature selection

As detailed in the methods section, BFE was combined with SVM algorithms to create models with optimized predictive accuracy. Table 4 displays a comparison of the features identified through ML feature selection and those utilized in the TIMI risk score for NSTEMI/UA. ML consistently identified patient age, Killip class, heart rate, and the use of Low-Molecular-Weight Heparin (LMWH) as significant factors for predicting outcomes within the hospital, at 30 days, and at 1 year. Age was the only predictor that was common to both the ML-selected features and the TIMI risk score across all evaluated timeframes.

Graph

Table 4 Feature selection comparison across timeframes: ML algorithms versus TIMI risk score for NSTEMI/UA.

TIMI–NSTEMI/UAML SelectedML SelectedML Selected
In-hospital30-Days1-Year
(7 predictors)(13 predictors)(11 predictors)(12 predictors)
• Patient Age • Risk factors of CAD • Use of Aspirin • Known CAD • Angina episodes • ST-segment deviation • Elevated cardiac markers• Patient Age • Gender • History of hypertension • History of MI • History of Chronic Lung Disease • Heart rate • Systolic Blood Pressure • Killip Class • Fasting Blood Glucose • ST-segment Depression ≥ 0.5mm LMWH • Angiotensin II Receptor Blocker • Beta-blocker• Patient Age • Chronic renal disease • Heart rate • Systolic Blood Pressure • Killip Class • HDLC • ECG bundle branch block • ECG-location inferior lead • Cardiac catheterization • LMWH • Oral hypoglycemic agent• Patient Age • Chronic angina (≥ 2 weeks) • Chronic renal disease • Heart rate • Killip Class • LDLC • ST-segment Depression ≥ 0.5mm • ECG bundle branch block • ECG-location lateral lead • PCI • Cardiac catheterization • LMWH • Diuretics

Model interpretation

The SHAP summary plots (Figs 2–4) demonstrate the distribution of SHAP values for the significant features identified through SVM-based feature selection. These features are used to predict in-hospital, 30-day, and 1-year outcomes for NSTEMI/UA patients.

Graph: Fig 2 info:doi/10.1371/journal.pone.0298036.g002

Graph: Fig 3 info:doi/10.1371/journal.pone.0298036.g003

Graph: Fig 4 info:doi/10.1371/journal.pone.0298036.g004

The vertical axis of each plot displays the features in a descending order of their influence on the model's predictions. The x-axis corresponds to the SHAP value, which measures the influence of each feature on the model's output. The color gradient indicates the original value of the feature, with red representing higher values and blue representing lower values.

In the in-hospital prediction model (Fig 2), factors such as Killip class, age, and fasting blood glucose have a significant impact on the model's output. Higher values of these features tend to increase the SHAP value, which indicates a more profound risk of negative outcomes. In the 30-day prediction model (Fig 3), systolic blood pressure and heart rate are identified as significant factors, with higher values indicating an increased risk. Furthermore, the 1-year prognostic model (Fig 4) indicates that chronic ailments like renal disease and prior myocardial infarction play a substantial role in influencing the predictions. The presence or elevated levels of these conditions are closely linked to heightened risk.

The plots clearly demonstrate a consistent association between higher Killip classes and elevated heart rates with higher SHAP values across all models. This suggests that these features are strong predictors of mortality at various stages after NSTEMI/UA. On the other hand, the use of medications such as Low-Molecular-Weight Heparin (LMWH) is shown to have lower SHAP values, suggesting a possible preventative effect.

Comparison with TIMI conventional risk score

The mortality rate distributions predicted by the TIMI risk score, and the best performing ML models developed for this study are illustrated in Figs 5–7. These figures are crucial for illustrating the models' reliability in forecasting the mortality risk of NSTEMI/UA patients over various time periods. The ML risk score, determined through SVM feature selection, classifies patients into two groups: low-risk (with a mortality probability below 50%) and high-risk (with a mortality probability of 50% or higher). This classification aligns with the established standards, where a TIMI score of 5 or less indicates low risk, and a TIMI score greater than 5 indicates high risk [[63]].

Graph: Fig 5 info:doi/10.1371/journal.pone.0298036.g005

Graph: Fig 6 info:doi/10.1371/journal.pone.0298036.g006

Graph: Fig 7 info:doi/10.1371/journal.pone.0298036.g007

Table 5 displays the mortality percentages categorized by the TIMI risk score and the best performing ML models that include SVM feature selection. The data classifies patients into two categories based on their risk of mortality: low risk being equivalent to a TIMI score ≤5 and ML probabilities < 0.5, and high risk corresponding to a TIMI score > 5 and ML probabilities ≥ 0.5.

Graph

Table 5 Comparative mortality risk stratification by ML models and TIMI score in NSTEMI/UA patients.

ModelLow Risk (%)High Risk (%)
In-Hospital
TIMI risk scores3.786
XGB-BFE1.2719.12
EnsembleGLM-BFE1.6118.25
30-Days
TIMI risk scores6.677.5
XGB-BFE2.1522.96
EnsembleGLM-BFE2.4322.57
1-Year
TIMI risk scores16.0738.64
XGB-BFE7.3934.85
EnsembleGLM-BFE734.81

The ML models, specifically the XGB and stacked EL models, which employ SVM-selected features, exhibit a more precise classification of patients into risk groups. When compared to the conventional TIMI scoring system, they demonstrate significantly lower rates of low-risk in-hospital mortality predictions and higher rates of high-risk predictions across all timeframes. This demonstrates an improved predictive capability, as the ML models are able to more accurately identify patients who are at a higher risk of mortality. For instance, the XGB model demonstrates a mortality prediction of 1.27% for low-risk in-hospital cases, which is notably lower than the TIMI score's prediction of 3.78%. Additionally, the XGB model predicts a high-risk rate of 19.12%, while the TIMI score predicts a rate of 6%. Similarly, the stacked EL model provides comparable discrimination, with a 1.61% rate for low-risk and an 18.25% rate for high-risk in-hospital cases. The consistency of these findings is evident in both the 30-day and 1-year forecasts, further confirming the reliability of the ML models in predicting outcomes for patients with NSTEMI/UA over different time frames.

Net reclassification index (NRI) analysis

The Net Reclassification Index (NRI) Analysis quantifies the improvement in risk stratification established by ML compared to the conventional TIMI risk scores for predicting in-hospital, 30-day, and 1-year mortality in patients with NTEMI/UA. Tables 6–8 demonstrate a breakdown of the NRI for these respective timeframes, demonstrating a significant enhancement in the accuracy of risk classification. The ML models illustrate an evident improvement in reclassification: 51% for in-hospital, 56.6% for 30-day, and 42% for 1-year outcomes, highlighting their superior predictive abilities. The improvements in classification accuracy are statistically significant, as indicated by a p-value of less than 0.00001. This emphasizes the reliability of the ML approach in accurately identifying patients at different levels of risk, surpassing the TIMI score for NSTEMI/UA.

Graph

Table 6 In-hospital mortality reclassification: NRI of ML models versus TIMI score.

In-Hospital
Individuals with Events (Died) (n = 109)
Number of IndividualsReclassificationNet Correctly Reclassified (%)
Machine LearningIncreased RiskDecreased Risk
Low RiskHigh Risk
TIMI Score
Low Risk386868260.55%
High Risk21
Individuals without Events (Alive) (n = 2747)
Number of IndividualsReclassificationNet Correctly Reclassified (%)
Machine LearningIncreased RiskDecreased Risk
Low RiskHigh Risk
TIMI Score
Low Risk240329729735-9.54%
High Risk3512
NRI (%)60.55–9.54 = 51.01
Z, p-valueZ =

51.0168+21092+297+3528472

= 662.13 662.13, p < 0.00001
ConclusionThe ML approach demonstrates a higher predictive ability in forecasting mortality in NSTEMI/UA patients compared to the TIMI Risk Score, resulting in a noticeable 51.01% increase in accurate classification.

Graph

Table 7 30-days mortality reclassification: NRI of ML models versus TIMI score.

30-Days
Individuals with Events (Died) (n = 143)
Number of IndividualsReclassificationNet Correctly Reclassified (%)
Machine LearningIncreased RiskDecreased Risk72.73%
Low RiskHigh Risk1040
TIMI Score
Low Risk36104
High Risk03
Individuals without Events (Alive) (n = 1997)
Number of IndividualsReclassificationNet Correctly Reclassified (%)
Machine LearningIncreased RiskDecreased Risk-16.12%
Low RiskHigh Risk34321
TIMI Score
Low Risk1617343
High Risk2116
NRI (%)72.73–16.12 = 56.6
Z, p-valueZ =

56.6104+01432+343+2119972

= 786.68 786.86, p < 0.00001
ConclusionThe ML approach demonstrates a higher predictive ability in forecasting mortality in NSTEMI/UA patients compared to the TIMI Risk Score, resulting in a noticeable 56.6% increase in accurate classification.

Graph

Table 8 1-year mortality reclassification: NRI of ML models versus TIMI score.

1-Year
Individuals with Events (Died) (n = 349)
Number of IndividualsReclassificationNet Correctly Reclassified (%)
Machine LearningIncreased RiskDecreased Risk67.34%
Low RiskHigh Risk2361
TIMI Score
Low Risk96236
High Risk116
Individuals without Events (Alive) (n = 1761)
Number of IndividualsReclassificationNet Correctly Reclassified (%)
Machine LearningIncreased RiskDecreased Risk
Low RiskHigh Risk
TIMI Score
Low Risk127845645611-25.27%
High Risk1116
NRI (%)67.34–25.27 = 42.07
Z, p-valueZ =

42.07236+1349+456+1117612

= 918.74 918.74, p < 0.00001
ConclusionThe ML approach demonstrates a higher predictive ability in forecasting mortality in NSTEMI/UA patients compared to the TIMI Risk Score, resulting in a noticeable 42.07% increase in accurate classification.

Discussion

This groundbreaking study represents the initial application of employing ML with stacked EL models to enhance the prediction performance of in-hospital, 30-days and 1-year mortality risk in Asian NSTEMI/UA patients, demonstrating the superiority of over the conventional TIMI scoring method. The ML techniques have been improved using a feature set chosen by SVM and further optimized through backward elimination. The outcomes of this study are: i) Using a selected feature set, the XGBoost model achieved an AUC of 0.883 for in-hospital mortality predictions and 0.874 for 30-day outcomes, while the stacked EL model achieved an AUC of 0.806 for one-year mortality, indicating that models developed with a selected feature set outperform those developed with a full feature set ii) Individual ML models and stacked EL models both outperformed the TIMI scoring system for NSTEMI/UA, with the ML approach resulting in a significant increase in classification accuracy.

ML algorithms have been shown to improve prognostic models for ACS patients using specific datasets from various regions, including the United States [[64]–[67]], Sweden [[32]], Sweden [[35]], Poland [[68]], Israel [[38]], China [[20]], Korea [[9]], and Singapore [[69]]. Despite these advances, ML research on NSTEMI/UA patients in the Asian demographic is limited. In this context, Lee et al. (2021) [[24]] discovered that ML outperformed the TIMI risk score in predicting outcomes for Korean NSTEMI/UA patients, demonstrating higher AUC values (In-hospital: 0.89 vs 0.67, 3 months: 0.85 vs 67, 1-year: 0.86 vs 0.68), which is consistent with the findings of this study.

The TIMI risk score for NSTEMI/UA was developed to estimate the 14-day mortality risk and has remained unchanged for over two decades [[5], [24], [62]]. Although it demonstrated an AUC of 0.80 for 30-day mortality prediction in Western populations [[70]], its applicability in Asian cohorts has been less convincing. Kumar et al. (2021) found a TIMI score AUC of 0.78 in Pakistan [[71]], while a study in China found AUCs of 0.52, 0.50, and 0.52 for 6-month, 1-year, and 2-year mortality risks [[72]]. According to Indonesian research, the 14-day mortality prediction AUC for NSTEMI was 0.61 and 0.63 for UA [[73]], while Malaysian data revealed an in-hospital mortality prediction AUC of only 0.55 [[51]]. The disparities in performance can be attributed to the TIMI score's origins in predominantly Caucasian populations, with a lack of representation of Asians, who frequently present with ACS at younger ages and have higher rates of diabetes, hypertension, renal failure, and delays in seeking medical care [[74]–[76]]. Furthermore, the TIMI score has been criticized for underestimating mortality in high-risk groups [[19], [77]], which is consistent with the current study's findings.

ML models have been improved in the NSTEMI/UA patient group with a selected group of features that proved to be most effective for improving performance—13 features for in-hospital, 11 for 30-day, and 12 for 1-year predictions. These models, which were optimized using Backward Feature Elimination (BFE), outperformed other ML models that included the entire set of features, as well as the TIMI risk score. The importance of feature selection algorithms in mortality prediction utilizing ML is emphasized, as combining these algorithms with classification methods has been shown to produce better results than using classifiers alone [[35], [48], [78]]. Feature selection not only improves ML model performance by reducing the number of predictors to a manageable number, but it also reduces the predictor space's complexity [[79]]. Streamlining predictors improved model performance in this study. This is due to the remaining features being clinically significant for mortality prediction, whereas the excluded features were determined to be non-contributory to the predictive accuracy of the models [[80]].

Treatment strategies were integrated into the best performing ML models for mortality prediction, yielding significant results. In particular, invasive procedures such as cardiac catheterization has been associated with improved 30-day survival rates in NSTEMI/UA patients [[81]]. Depending on the clinical stage of the condition, treatment pathways may differ, with catheter-based interventions frequently being the preferred course of action. Traditional ACS management has focused on antiplatelet and anticoagulant therapies, anti-anginal medications, aggressive lipid-lowering treatments, and risk factor modification. The introduction of thrombolytic therapy transformed the ACS therapeutic landscape. The current study's findings, as validated by SHAP analysis, are consistent with these well-established clinical practices and outcomes.

In this study, non-invasive treatment predictors, specifically pharmacotherapy encompassing antihypertensives (ACE inhibitors, beta-blockers, diuretics, ARBs), antidiabetics (oral hypoglycemic agents, insulin), and antiplatelet medications, were used to predict in-hospital, 30-day, and 1-year mortality. This is consistent with the findings of the SHAP analysis. These medications are typically given as an acute treatment to improve neurohumoral control in response to negative left ventricular remodeling. The requirement for such treatments may indicate a more compromised cardiac state, which explains their strong association with an increased risk of death [[83]].

The Malaysian NCVD registry frequently reports a high incidence of CVD risk factors such as hypertension, diabetes, dyslipidemia, smoking, and obesity [[84]]. Over 95% of people hospitalized for ACS have at least one established CVD risk factor. Concurrently, patients with chronic kidney disease have a threefold increased risk of myocardial infarction, as well as higher morbidity and mortality rates [[85]]. Uncontrolled CVD risk factors cause negative systemic changes that can lead to a variety of cardiovascular diseases such as heart failure, cerebrovascular renal failure, and peripheral arterial disease [[86]].

This study shows that stacked EL has the potential to predict one-year mortality in Asian NSTEMI/UA patients. Even minor improvements in predictive model performance can have a significant impact in healthcare [[87]]. However, it is important to note that the stacked EL method does not always outperform individual ML models; its effectiveness is heavily dependent on the diversity of the base learners and models used [[88]]. The current study supports this by demonstrating that the performance gains of the stacked EL model were not significantly higher than those of the single ML models that served as its base, which is consistent with previous research [[13], [58]].

Data imputation was used to ensure the accuracy of our analysis. Cases with more than 30% missing information were initially excluded to avoid potential biases caused by imputation. Due to the large size of the remaining dataset, this exclusion criterion was not deemed restrictive. The dataset's completeness for all follow-up intervals enabled the development of risk calculators for both ML models and the TIMI risk score. Using complete cases to identify features that predict short- and long-term mortality, on the other hand, is likely to yield more reliable results. Following that, analysis using both incomplete and imputed data produced similar results, confirming the findings' validity.

In line with the findings by Hayıroğlu and Altay (2023) [[89]], our research demonstrates AI's rising impact on healthcare, particularly in cardiology. Artificial intelligence algorithms have demonstrated effectiveness in predicting outcomes such as atrial fibrillation recurrence and measuring cardiovascular risk using complete data analysis that includes medical history and genetic information. These capabilities complement the findings our study, in which ML methods, specifically the stacked EL methodology, enhanced the accuracy of mortality prediction for Asian NSTEMI/UA patients. This is especially important given the complexity of cardiovascular conditions and the need for precise risk classification, which potentially dictate better clinical decision-making and eventually lead to better patient care outcomes.

There are several limitations to consider in this investigation. It primarily compares the performance of ML models across in-hospital, 30-day, and 1-year time periods to the TIMI risk score for NSTEMI/UA, which was designed to predict 14-day mortality. Furthermore, because the study focused on a multi-ethnic Malaysian cohort, the findings may not be directly applicable to countries with predominantly Caucasian populations. Another limitation is the lack of comparisons with other prognostic models such as HEART and GRACE, a decision influenced by the dataset's structure and the large amount of missing data in certain variables. Furthermore, the possibility of selection bias in patient registries should not be overlooked. As a result, future empirical studies in real-world settings will be required to validate these findings.

The risk scores generated by this study have the potential to be integrated into hospital electronic health record systems for clinician use. Instead of relying solely on administrative databases, future research could aim to implement and validate these risk scores within clinical registries. Given the demographic diversity of the NCVD registry, which includes Malay, Chinese, and Indian people, the study's findings are particularly relevant to the Asian context. This is especially important for Malaysia, Brunei, and Singapore, as well as other populous countries like China and India [[90]].

Conclusion

Our study demonstrates that the combination of ML and stacked EL, along with feature selection techniques, can improve the accuracy of mortality prediction in Asian NSTEMI/UA patients. This approach complements and enhances the conventional risk scoring methods, such as the TIMI risk score. With this predictive tool, healthcare professionals can identify patients at high risk right away in the emergency department, allowing for targeted and effective management based on their risk profiles. Future research will focus on integrating this algorithm into online platforms to improve accessibility and practical application in clinical settings.

Supporting information

S1 Table

The rates of missing data for each feature used in this study.

(PDF)

S2 Table

The parameters setting values for optimum ML model performance.

(PDF)

We would like to acknowledge the National Heart Association Malaysia for sharing us with the data for this study.

Footnotes 1 The authors have declared that no competing interests exist. References Kim Y, Ahn Y, Cho MC, Kim CJ, Kim YJ, Jeong MH. Current status of acute myocardial infarction in Korea. The Korean journal of internal medicine. (2019); 34(1):1. doi: 10.3904/kjim.2018.381, 30612415 2 Reddy K, Khaliq A, Henning RJ. Recent advances in the diagnosis and treatment of acute myocardial infarction. World journal of cardiology. (2015); 7(5):243. doi: 10.4330/wjc.v7.i5.243, 26015857 3 Castro-Dominguez Y, Dharmarajan K, McNamara RL. Predicting death after acute myocardial infarction. Trends in Cardiovascular Medicine. (2018); 28(2):102–9. doi: 10.1016/j.tcm.2017.07.011, 28826668 4 Eagle KA, Lim MJ, Dabbous OH, Pieper KS, Goldberg RJ, Van de Werf F, et al. A validated prediction model for all forms of acute coronary syndrome: estimating the risk of 6-month postdischarge death in an international registry. Jama. (2004); 291(22):2727–33. doi: 10.1001/jama.291.22.2727, 15187054 5 Morrow DA, Antman EM, Charlesworth A, Cairns R, Murphy SA, de Lemos JA, et al. TIMI risk score for ST-elevation myocardial infarction: a convenient, bedside, clinical score for risk assessment at presentation: an intravenous nPA for treatment of infarcting myocardium early II trial substudy. Circulation. (2000); 102(17):2031–7. doi: 10.1161/01.cir.102.17.2031, 11044416 6 Akyuz S, Yazici S, Bozbeyoglu E, Onuk T, Yildirimturk O, Karacimen D, et al. Validity of the updated GRACE risk predictor (version 2.0) in patients with non-ST-elevation acute coronary syndrome. Revista Portuguesa de Cardiologia. (2016); 35(1):25–31. doi: 10.1016/j.repc.2015.07.010, 26718493 7 Şaylık F, Çınar T, Selçuk M, Akbulut T, Hayıroğlu M, Tanboğa İ H. Evaluation of Naples Score for Long-Term Mortality in Patients With ST-Segment Elevation Myocardial Infarction Undergoing Primary Percutaneous Coronary Intervention. Angiology. (2023):33197231170982. doi: 10.1177/00033197231170982, 37058422 8 Çınar T, Şaylık F, Akbulut T, Korkmaz Y, Çiçek V, Asal S, et al. Evaluation of Intermountain Risk Score for Short- and Long-Term Mortality in ST Elevation Myocardial Infarction Patients. Angiology. (2023); 74(4):357–64. doi: 10.1177/00033197221105753, 35635200 9 Kwon J-m Jeon K-H, Kim HM Kim MJ, Lim S Kim K-H, et al. Deep-learning-based risk stratification for mortality of patients with acute myocardial infarction. PloS one. (2019); 14(10):e0224502. doi: 10.1371/journal.pone.0224502, 31671144 McNamara RL, Kennedy KF, Cohen DJ, Diercks DB, Moscucci M, Ramee S, et al. Predicting In-Hospital Mortality in Patients With Acute Myocardial Infarction. Journal of the American College of Cardiology. (2016); 68(6):626–35. McNamara RL, Wang Y, Partovian C, Montague J, Mody P, Eddy E, et al. Development of a Hospital Outcome Measure Intended for Use With Electronic Health Records: 30-Day Risk-standardized Mortality After Acute Myocardial Infarction. Medical Care. (2015); 53(9). doi: 10.1097/MLR.0000000000000402, 26225445 Gibson WJ, Nafee T, Travis R, Yee M, Kerneis M, Ohman M, et al. Machine learning versus traditional risk stratification methods in acute coronary syndrome: a pooled randomized clinical trial analysis. Journal of Thrombosis and Thrombolysis. (2020); 49(1):1–9. doi: 10.1007/s11239-019-01940-8, 31535314 Zheng H, Sherazi SWA, Lee JY. A Stacking Ensemble Prediction Model for the Occurrences of Major Adverse Cardiovascular Events in Patients with Acute Coronary Syndrome on Imbalanced Data. IEEE Access. (2021); 9:113692–704. Chang W, Liu Y, Xiao Y, Yuan X, Xu X, Zhang S, et al. A machine-learning-based prediction method for hypertension outcomes based on medical data. Diagnostics. (2019); 9(4):178. doi: 10.3390/diagnostics9040178, 31703364 Davagdorj K, Lee JS, Pham VH, Ryu KH. A comparative analysis of machine learning methods for class imbalance in a smoking cessation intervention. Applied Sciences. (2020); 10(9):3307. Saqlain M, Jargalsaikhan B, Lee JY. A voting ensemble classifier for wafer map defect patterns identification in semiconductor manufacturing. IEEE Transactions on Semiconductor Manufacturing. (2019); 32(2):171–82. Sherazi SWA, Jeong YJ, Jae MH, Bae J-W, Lee JY. A machine learning–based 1-year mortality prediction model after hospital discharge for clinical patients with acute coronary syndrome. Health informatics journal. (2020); 26(2):1289–304. doi: 10.1177/1460458219871780, 31566458 Jargalsaikhan B, Saqlain M, Abbas SSW, Jae MH, Kang IU, Ali S, et al. The early prediction acute myocardial infarction in real-time data using an ensemble machine learning model. Advances in Intelligent Information Hiding and Multimedia Signal Processing: Springer; (2020). p. 259–64. Kasim S, Malek S, Cheen S, Safiruz MS, Ahmad WAW, Ibrahim KS, et al. In-hospital risk stratification algorithm of Asian elderly patients. Scientific Reports. (2022); 12(1):17592. doi: 10.1038/s41598-022-18839-9, 36266376 Li X, Liu H, Yang J, Xie G, Xu M, Yang Y. Using machine learning models to predict in-hospital mortality for ST-elevation myocardial infarction patients. MEDINFO 2017: Precision Healthcare through Informatics: IOS Press; (2017). p. 476–80., 29295140 Patel B, Sengupta P. Machine learning for predicting cardiac events: what does the future hold? Expert review of cardiovascular therapy. (2020); 18(2):77–84. doi: 10.1080/14779072.2020.1732208, 32066289 Myers PD, Scirica BM, Stultz CM. Machine Learning Improves Risk Stratification After Acute Coronary Syndrome. Scientific Reports. (2017); 7(1):12692. doi: 10.1038/s41598-017-12951-x, 28978948 Massalha E, Sharon A, Oren D, Fishman B, Segev A, Matetzky S, et al. A GRADIENT BOOSTING MACHINE LEARNING MODEL ENHANCES MORTALITY PREDICTION IN NON-ST-SEGMENT ELEVATION MYOCARDIAL INFARCTION. Journal of the American College of Cardiology. (2022); 79(9_Supplement):1131–. Lee W, Lee J, Woo S-I, Choi SH, Bae J-W, Jung S, et al. Machine learning enhances the performance of short and long-term mortality prediction model in non-ST-segment elevation myocardial infarction. Scientific Reports. (2021); 11(1):12886. doi: 10.1038/s41598-021-92362-1, 34145358 Fu R, Song C, Yang J, Wang Y, Li B, Xu H, et al. CAMI-NSTEMI Score ― China Acute Myocardial Infarction Registry-Derived Novel Tool to Predict In-Hospital Death in Non-ST Segment Elevation Myocardial Infarction Patients ―. Circulation Journal. (2018); 82(7):1884–91. Sherazi SWA, Bae J-W, Lee JY. A soft voting ensemble classifier for early prediction and diagnosis of occurrences of major adverse cardiovascular events for STEMI and NSTEMI during 2-year follow-up in patients with acute coronary syndrome. PloS one. (2021); 16(6):e0249338. doi: 10.1371/journal.pone.0249338, 34115750 Nadarajah R, Ludman P, Laroche C, Appelman Y, Brugaletta S, Budaj A, et al. Presentation, care, and outcomes of patients with NSTEMI according to World Bank country income classification: the ACVC-EAPCI EORP NSTEMI Registry of the European Society of Cardiology. European Heart Journal—Quality of Care and Clinical Outcomes. (2023):qcad008. doi: 10.1093/ehjqcco/qcad008, 36737420 Cannon CP, McCabe CH, Stone PH, Rogers WJ, Schactman M, Thompson BW, et al. The electrocardiogram predicts one-year outcome of patients with unstable angina and non–Q wave myocardial infarction: results of the TIMI III Registry ECG Ancillary Study. Journal of the American College of Cardiology. (1997); 30(1):133–40. Damman P, Holmvang L, Tijssen JG, Lagerqvist B, Clayton TC, Pocock SJ, et al. Usefulness of the admission electrocardiogram to predict long-term outcomes after non-ST-elevation acute coronary syndrome (from the FRISC II, ICTUS, and RITA-3 [FIR] Trials). The American journal of cardiology. (2012); 109(1):6–12. doi: 10.1016/j.amjcard.2011.08.002, 21944677 Kaul P, Fu Y, Chang W-C, Harrington RA, Wagner GS, Goodman SG, et al. Prognostic value of ST segment depression in acute coronary syndromes: insights from PARAGON-A applied to GUSTO-IIb. Journal of the American College of Cardiology. (2001); 38(1):64–71. Savonitto S, Ardissino D, Granger CB, Morando G, Prando MD, Mafrici A, et al. Prognostic value of the admission electrocardiogram in acute coronary syndromes. Jama. (1999); 281(8):707–13. doi: 10.1001/jama.281.8.707, 10052440 Montalescot G, Dallongeville J, Van Belle E, Rouanet S, Baulac C, Degrandsart A, et al. STEMI and NSTEMI: are they so different? 1 year outcomes in acute myocardial infarction as defined by the ESC/ACC definition (the OPERA registry). Eur Heart J. (2007); 28(12):1409–17. doi: 10.1093/eurheartj/ehm031, 17412730 Wu CC, Hsu WD, Wang YC, Kung WM, Tzeng IS, Huang CW, et al. An Innovative Scoring System for Predicting Major Adverse Cardiac Events in Patients With Chest Pain Based on Machine Learning. IEEE Access. (2020); 8:124076–83. Chen X, Ishwaran H. Random forests for genomic data analysis. Genomics. (2012); 99(6):323–9. doi: 10.1016/j.ygeno.2012.04.003, 22546560 Wallert J, Tomasoni M, Madison G, Held C. Predicting two-year survival versus non-survival after first myocardial infarction using machine learning and Swedish national register data. BMC medical informatics and decision making. (2017); 17(1):1–11. Ahmad WAW, Ali RM, Khanom M, Han CK, Bang LH, Yip AFY, et al. The journey of Malaysian NCVD—PCI (National Cardiovascular Disease Database—Percutaneous Coronary Intervention) Registry: A summary of three years report. International journal of cardiology. (2013); 165(1):161–4. doi: 10.1016/j.ijcard.2011.08.015, 21920614 Ahmad WAW, Zambahari R, Ismail O, Sinnadurai J, Rosman A, Piaw CS, et al. Malaysian national cardiovascular disease database (NCVD)–acute coronary syndrome (ACS) registry: how are we different? CVD Prevention and Control. (2011); 6(3):81–9. Shouval R, Hadanny A, Shlomo N, Iakobishvili Z, Unger R, Zahger D, et al. Machine learning for prediction of 30-day mortality after ST elevation myocardial infraction: An Acute Coronary Syndrome Israeli Survey data mining study. International journal of cardiology. (2017); 246:7–13. doi: 10.1016/j.ijcard.2017.05.067, 28867023 Yang J, Li X, Chen T, Li Y, Xie G, Yang Y. Machine learning models to predict in-hospital mortality for ST-elevation myocardial infarction: from china acute myocardial infarction (cami) registry. Journal of the American College of Cardiology. (2018); 71(11S):A236–A. Van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate imputation by chained equations in R. Journal of statistical software. (2011); 45:1–67. Kuhn M, Johnson K. Applied predictive modeling: Springer; (2013). Kapoor S, Narayanan A. Leakage and the reproducibility crisis in ML-based science. arXiv preprint arXiv:220707048. (2022). Lunardon N, Menardi G, Torelli N. ROSE: a package for binary imbalanced learning. R journal. (2014); 6(1). Vapnik V, Guyon I, Hastie T. Support vector machines. Mach Learn. (1995); 20(3):273–97. Breiman L.Random forests. Machine learning. (2001); 45(1):5–32. Chen T, Guestrin C, editors. Xgboost: A scalable tree boosting system. Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining; (2016). Rish I, editor An empirical study of the naive Bayes classifier. IJCAI 2001 workshop on empirical methods in artificial intelligence; (2001). Aziida N, Malek S, Aziz F, Ibrahim KS, Kasim S. Predicting 30-day mortality after an acute coronary syndrome (ACS) using machine learning methods for feature selection, classification and visualisation. Sains Malaysiana. (2021); 50(3):753–68. Aziz F, Malek S, Ibrahim KS, Raja Shariff RE, Wan Ahmad WA, Ali RM, et al. Short-and long-term mortality prediction after an acute ST-elevation myocardial infarction (STEMI) in Asians: A machine learning approach. PloS one. (2021); 16(8):e0254894. doi: 10.1371/journal.pone.0254894, 34339432 Cho S-Y, Kim S-H, Kang S-H, Lee KJ, Choi D, Kang S, et al. Pre-existing and machine learning-based models for cardiovascular risk prediction. Scientific Reports. (2021); 11(1):8886. doi: 10.1038/s41598-021-88257-w, 33903629 Kasim S, Malek S, Song C, Wan Ahmad WA, Fong A, Ibrahim KS, et al. In-hospital mortality risk stratification of Asian ACS patients with artificial intelligence algorithm. Plos one. (2022); 17(12):e0278944. doi: 10.1371/journal.pone.0278944, 36508425 Nath A, Sahu GK. Exploiting ensemble learning to improve prediction of phospholipidosis inducing potential. Journal of Theoretical Biology. (2019); 479:37–47. doi: 10.1016/j.jtbi.2019.07.009, 31310757 Tama BA, Im S, Lee S. Improving an Intelligent Detection System for Coronary Heart Disease Using a Two-Tier Classifier Ensemble. BioMed Research International. (2020); 2020:9816142. doi: 10.1155/2020/9816142, 32420387 Zaman SMM, Qureshi WM, Raihan MMS, Shams AB, Sultana S, editors. Survival Prediction of Heart Failure Patients using Stacked Ensemble Machine Learning Algorithm. 2021 IEEE International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE); (2021)4–5 Dec. 2021. Dunkler D, Plischke M, Leffondré K, Heinze G. Augmented backward elimination: a pragmatic and purposeful way to develop statistical models. PloS one. (2014); 9(11):e113677. doi: 10.1371/journal.pone.0113677, 25415265 Genuer R, Poggi J-M, Tuleau-Malot C. Variable selection using random forests. Pattern recognition letters. (2010); 31(14):2225–36. Schaffer C.Selecting a classification method by cross-validation. Machine learning. (1993); 13(1):135–43. Zhang Z, Chen L, Xu P, Hong Y. Predictive analytics with ensemble modeling in laparoscopic surgery: a technical note. Laparoscopic, Endoscopic and Robotic Surgery. (2022); 5(1):25–34. Marcílio WE, Eler DM, editors. From explanations to feature selection: assessing SHAP values as feature selection mechanism. 2020 33rd SIBGRAPI conference on Graphics, Patterns and Images (SIBGRAPI); (2020): Ieee. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models: a framework for some traditional and novel measures. Epidemiology (Cambridge, Mass). (2010); 21(1):128. Kuhn M, Johnson K, Kuhn M, Johnson K. Classification trees and rule-based models. Applied predictive modeling. (2013):369–413. Antman EM, Cohen M, Bernink PJ, McCabe CH, Horacek T, Papuchis G, et al. The TIMI risk score for unstable angina/non-ST elevation MI: A method for prognostication and therapeutic decision making. Jama. (2000); 284(7):835–42. doi: 10.1001/jama.284.7.835, 10938172 Correia LC, Garcia G, Kalil F, Ferreira F, Carvalhal M, Oliveira R, et al. Prognostic value of TIMI score versus GRACE score in ST-segment elevation myocardial infarction. Arquivos brasileiros de cardiologia. (2014); 103:98–106. doi: 10.5935/abc.20140095, 25029471 VanHouten JP, Starmer JM, Lorenzi NM, Maron DJ, Lasko TA. Machine learning for risk prediction of acute coronary syndrome. AMIA Annu Symp Proc. (2014); 2014:1940–9., 25954467 Al'Aref SJ, Singh G, Rosendael ARv, Kolli KK, Ma X, Maliakal G, et al. Determinants of In‐Hospital Mortality After Percutaneous Coronary Intervention: A Machine Learning Approach. Journal of the American Heart Association. (2019); 8(5):e011160. doi: 10.1161/JAHA.118.011160, 30834806 Mansoor H, Elgendy IY, Segal R, Bavry AA, Bian J. Risk prediction model for in-hospital mortality in women with ST-elevation myocardial infarction: A machine learning approach. Heart & Lung. (2017); 46(6):405–11. doi: 10.1016/j.hrtlng.2017.09.003, 28992993 Emakhu J, Monplaisir L, Aguwa C, Arslanturk S, Masoud S, Nassereddine H, et al. Acute coronary syndrome prediction in emergency care: A machine learning approach. Computer Methods and Programs in Biomedicine. (2022); 225:107080. doi: 10.1016/j.cmpb.2022.107080, 36037605 Pieszko K, Hiczkiewicz J, Budzianowski P, Budzianowski J, Rzeźniczak J, Pieszko K, et al. Predicting Long-Term Mortality after Acute Coronary Syndrome Using Machine Learning Techniques and Hematological Markers. Disease Markers. (2019); 2019:9056402. doi: 10.1155/2019/9056402, 30838085 Bulluck H, Zheng H, Chan MY, Foin N, Foo DC, Lee CW, et al. Independent Predictors of Cardiac Mortality and Hospitalization for Heart Failure in a Multi-Ethnic Asian ST-segment Elevation Myocardial Infarction Population Treated by Primary Percutaneous Coronary Intervention. Scientific Reports. (2019); 9(1):10072. doi: 10.1038/s41598-019-46486-0, 31296912 Bradshaw PJ, Ko DT, Newman AM, Donovan LR, Tu JV. Validation of the Thrombolysis In Myocardial Infarction (TIMI) risk index for predicting early mortality in a population-based cohort of STEMI and non-STEMI patients. Canadian Journal of Cardiology. (2007); 23(1):51–6. doi: 10.1016/s0828-282x(07)70213-1, 17245483 Kumar D, Saghir T, Zahid M, Ashok A, Kumar M, Shah AA, et al. Validity of TIMI score for predicting 14-day mortality of non-st elevation myocardial infarction patients. Cureus. (2021); 13(1). doi: 10.7759/cureus.12518, 33564522 Chen Y-H, Huang S-S, Lin S-J. TIMI and GRACE risk scores predict both short-term and long-term outcomes in Chinese patients with acute myocardial infarction. Acta Cardiologica Sinica. (2018); 34(1):4. doi: 10.6515/ACS.201801_34(1).20170730B, 29375219 Karim K, Fadhilah VK, Aulia MS, Pradisa IS, Setyasari A, Yosephina S, et al. NEUTROPHIL TO LYMPHOCYTE RATIO AS MORTALITY PREDICTOR IN ACUTE CORONARY SYNDROME PATIENTS. Teo BW, Chan GC, Leo CCH, Tay JC, Chia YC, Siddique S, et al. Hypertension and chronic kidney disease in Asian populations. The Journal of Clinical Hypertension. (2021); 23(3):475–80. doi: 10.1111/jch.14188, 33538081 Mills KT, Bundy JD, Kelly TN, Reed J, Kearney PM, Reynolds K, et al. Global disparities of hypertension prevalence and control: a systematic analysis of population-based studies from 90 countries. Circulation. (2015); 132(suppl_3):A16828–A. Joshi P, Islam S, Pais P, Reddy S, Dorairaj P, Kazmi K, et al. Risk factors for early myocardial infarction in South Asians compared with individuals in other countries. Jama. (2007); 297(3):286–94. doi: 10.1001/jama.297.3.286, 17227980 Yanqiao L, Shen L, Yutong M, Linghong S, Ben H. Comparison of GRACE and TIMI risk scores in the prediction of in-hospital and long-term outcomes among East Asian non-ST-elevation myocardial infarction patients. BMC Cardiovascular Disorders. (2022); 22(1):4. doi: 10.1186/s12872-021-02311-z, 34996365 Liu N, Koh ZX, Goh J, Lin Z, Haaland B, Ting BP, et al. Prediction of adverse cardiac events in emergency department patients with chest pain using machine learning for variable selection. BMC Medical Informatics and Decision Making. (2014); 14(1):75. doi: 10.1186/1472-6947-14-75, 25150702 Vomlel J, Kruzık H, Tuma P, Precek J, Hutyra M. Machine learning methods for mortality prediction in patients with st elevation myocardial infarction. Proceedings of WUPES. (2012); 2012:204–13. Vázquez B, Fuentes G, Garcia F, Borrayo G, Prohias J. Risk markers by sex and age group for in-hospital mortality in patients with STEMI or NSTEMI: an approach based on machine learning. CoRR. (2021). Dworeck C, Redfors B, Völz S, Haraldsson I, Angerås O, Råmunddal T, et al. Radial artery access is associated with lower mortality in patients undergoing primary PCI: a report from the SWEDEHEART registry. European Heart Journal: Acute Cardiovascular Care. (2020); 9(4):323–32. doi: 10.1177/2048872620908032, 33025815 Feldkamp T, Luedemann M, Spehlmann ME, Freitag-Wolf S, Gaensbacher J, Schulte K, et al. Radial access protects from contrast media induced nephropathy after cardiac catheterization procedures. Clinical Research in Cardiology. (2018); 107:148–57. doi: 10.1007/s00392-017-1166-2, 28939956 Seong AC, John CKM. A review of coronary artery disease research in Malaysia. Med J Malaysia. (2016); 71(Supplement 1):46. Ahmad W.National Cardiovascular Disease Database (NCVD): annual report of the Acute Coronary Syndrome (ACS) Registry 2014–2015. Kuala Lumpur: National Heart Association of Malaysia (NHAM) and Ministry of Health Malaysia. (2017). Saad M, Karam B, Faddoul G, Douaihy YE, Yacoub H, Baydoun H, et al. Is kidney function affecting the management of myocardial infarction? A retrospective cohort study in patients with normal kidney function, chronic kidney disease stage III–V, and ESRD. International journal of nephrology and renovascular disease. (2016):5–10. doi: 10.2147/IJNRD.S91567, 26858529 Ministry of Health Malaysia. Clinical Practice Guidelines on Primary & Secondary Prevention of Cardiovascular Disease 2017. In: Malaysia MoH, editor. (2017). p. 182. H2O Tutorials [updated 2020, May 6]. Available from: https://github.com/h2oai/h2o-tutorials/blob/master/tutorials/ensembles-stacking/README.md. Alahmar A, Mohammed E, Benlamri R, editors. Application of data mining techniques to predict the length of stay of hospitalized patients with diabetes. 2018 4th International Conference on Big Data Innovations and Applications (Innovate-Data); (2018): IEEE. Hayıroğlu M, Altay S. The Role of Artificial Intelligence in Coronary Artery Disease and Atrial Fibrillation. Balkan Med J. (2023); 40(3):151–2. doi: 10.4274/balkanmedj.galenos.2023.06042023, 37025078 Laufer‐Perl M, Shacham Y, Letourneau‐Shesaf S, Priesler O, Keren G, Roth A, et al. Gender‐Related Mortality and In‐Hospital Complications Following ST‐Segment Elevation Myocardial Infarction: Data From a Primary Percutaneous Coronary Intervention Cohort. Clinical cardiology. (2015); 38(3):145–9. doi: 10.1002/clc.22363, 25728563

By Sazzli Kasim; Putri Nur Fatin Amir Rudin; Sorayya Malek; Firdaus Aziz; Wan Azman Wan Ahmad; Khairul Shafiq Ibrahim; Muhammad Hanis Muhmad Hamidi; Raja Ezman Raja Shariff; Alan Yean Yip Fong and Cheen Song

Reported by Author; Author; Author; Author; Author; Author; Author; Author; Author; Author

Titel:
Data analytics approach for short- and long-term mortality prediction following acute non-ST-elevation myocardial infarction (NSTEMI) and Unstable Angina (UA) in Asians.
Autor/in / Beteiligte Person: Kasim, S ; Amir Rudin, PNF ; Malek, S ; Aziz, F ; Wan Ahmad, WA ; Ibrahim, KS ; Muhmad Hamidi, MH ; Raja Shariff, RE ; Fong, AYY ; Song, C
Link:
Zeitschrift: PloS one, Jg. 19 (2024-02-15), Heft 2, S. e0298036
Veröffentlichung: San Francisco, CA : Public Library of Science, 2024
Medientyp: academicJournal
ISSN: 1932-6203 (electronic)
DOI: 10.1371/journal.pone.0298036
Schlagwort:
  • Humans
  • Heparin, Low-Molecular-Weight
  • Data Science
  • Bayes Theorem
  • Angina, Unstable
  • Risk Assessment
  • Arrhythmias, Cardiac
  • Non-ST Elevated Myocardial Infarction diagnosis
  • ST Elevation Myocardial Infarction
Sonstiges:
  • Nachgewiesen in: MEDLINE
  • Sprachen: English
  • Publication Type: Journal Article
  • Language: English
  • [PLoS One] 2024 Feb 15; Vol. 19 (2), pp. e0298036. <i>Date of Electronic Publication: </i>2024 Feb 15 (<i>Print Publication: </i>2024).
  • MeSH Terms: Non-ST Elevated Myocardial Infarction* / diagnosis ; ST Elevation Myocardial Infarction* ; Humans ; Heparin, Low-Molecular-Weight ; Data Science ; Bayes Theorem ; Angina, Unstable ; Risk Assessment ; Arrhythmias, Cardiac
  • References: Circulation. 2000 Oct 24;102(17):2031-7. (PMID: 11044416) ; Int J Cardiol. 2013 Apr 30;165(1):161-4. (PMID: 21920614) ; Biomed Res Int. 2020 Apr 27;2020:9816142. (PMID: 32420387) ; J Am Coll Cardiol. 1997 Jul;30(1):133-40. (PMID: 9207634) ; Diagnostics (Basel). 2019 Nov 07;9(4):. (PMID: 31703364) ; J Am Coll Cardiol. 2001 Jul;38(1):64-71. (PMID: 11451297) ; J Am Coll Cardiol. 2016 Aug 9;68(6):626-635. (PMID: 27491907) ; PLoS One. 2021 Jun 11;16(6):e0249338. (PMID: 34115750) ; Int J Cardiol. 2017 Nov 1;246:7-13. (PMID: 28867023) ; Circulation. 2016 Aug 9;134(6):441-50. (PMID: 27502908) ; J Thromb Thrombolysis. 2020 Jan;49(1):1-9. (PMID: 31535314) ; JAMA. 2004 Jun 9;291(22):2727-33. (PMID: 15187054) ; Acta Cardiol Sin. 2018 Jan;34(1):4-12. (PMID: 29375219) ; Int J Nephrol Renovasc Dis. 2016 Jan 22;9:5-10. (PMID: 26858529) ; J Am Heart Assoc. 2019 Mar 5;8(5):e011160. (PMID: 30834806) ; Heart Lung. 2017 Nov - Dec;46(6):405-411. (PMID: 28992993) ; Dis Markers. 2019 Jan 30;2019:9056402. (PMID: 30838085) ; Trends Cardiovasc Med. 2018 Feb;28(2):102-109. (PMID: 28826668) ; Can J Cardiol. 2007 Jan;23(1):51-6. (PMID: 17245483) ; JAMA. 1999 Feb 24;281(8):707-13. (PMID: 10052440) ; Eur Heart J. 2007 Jun;28(12):1409-17. (PMID: 17412730) ; Sci Rep. 2019 Jul 11;9(1):10072. (PMID: 31296912) ; Sci Rep. 2021 Apr 26;11(1):8886. (PMID: 33903629) ; PLoS One. 2021 Aug 2;16(8):e0254894. (PMID: 34339432) ; BMC Med Inform Decis Mak. 2014 Aug 23;14:75. (PMID: 25150702) ; PLoS One. 2022 Dec 12;17(12):e0278944. (PMID: 36508425) ; Med Care. 2015 Sep;53(9):818-26. (PMID: 26225445) ; JAMA. 2007 Jan 17;297(3):286-94. (PMID: 17227980) ; JAMA. 2000 Aug 16;284(7):835-42. (PMID: 10938172) ; J Clin Hypertens (Greenwich). 2021 Mar;23(3):475-480. (PMID: 33538081) ; Am J Cardiol. 2012 Jan 1;109(1):6-12. (PMID: 21944677) ; Sci Rep. 2021 Jun 18;11(1):12886. (PMID: 34145358) ; Angiology. 2023 Apr 14;:33197231170982. (PMID: 37058422) ; BMC Med Inform Decis Mak. 2017 Jul 5;17(1):99. (PMID: 28679442) ; Arq Bras Cardiol. 2014 Aug;103(2):98-106. (PMID: 25029471) ; Expert Rev Cardiovasc Ther. 2020 Feb;18(2):77-84. (PMID: 32066289) ; PLoS One. 2014 Nov 21;9(11):e113677. (PMID: 25415265) ; Angiology. 2023 Apr;74(4):357-364. (PMID: 35635200) ; Med J Malaysia. 2016 Jun;71(Suppl 1):42-57. (PMID: 27801387) ; PLoS One. 2019 Oct 31;14(10):e0224502. (PMID: 31671144) ; Sci Rep. 2017 Oct 4;7(1):12692. (PMID: 28978948) ; AMIA Annu Symp Proc. 2014 Nov 14;2014:1940-9. (PMID: 25954467) ; Comput Methods Programs Biomed. 2022 Oct;225:107080. (PMID: 36037605) ; Rev Port Cardiol. 2016 Jan;35(1):25-31. (PMID: 26718493) ; Eur Heart J Qual Care Clin Outcomes. 2023 Sep 12;9(6):552-563. (PMID: 36737420) ; Stud Health Technol Inform. 2017;245:476-480. (PMID: 29295140) ; Genomics. 2012 Jun;99(6):323-9. (PMID: 22546560) ; World J Cardiol. 2015 May 26;7(5):243-76. (PMID: 26015857) ; Korean J Intern Med. 2019 Jan;34(1):1-10. (PMID: 30612415) ; Clin Res Cardiol. 2018 Feb;107(2):148-157. (PMID: 28939956) ; Eur Heart J Acute Cardiovasc Care. 2020 Jun;9(4):323-332. (PMID: 33025815) ; J Theor Biol. 2019 Oct 21;479:37-47. (PMID: 31310757) ; Clin Cardiol. 2015 Mar;38(3):145-9. (PMID: 25728563) ; Patterns (N Y). 2023 Aug 04;4(9):100804. (PMID: 37720327) ; BMC Cardiovasc Disord. 2022 Jan 7;22(1):4. (PMID: 34996365) ; Sci Rep. 2022 Oct 20;12(1):17592. (PMID: 36266376) ; Circ J. 2018 Jun 25;82(7):1884-1891. (PMID: 29887577) ; Cureus. 2021 Jan 6;13(1):e12518. (PMID: 33564522) ; Epidemiology. 2010 Jan;21(1):128-38. (PMID: 20010215) ; Balkan Med J. 2023 Apr 07;40(3):151-152. (PMID: 37025078) ; Health Informatics J. 2020 Jun;26(2):1289-1304. (PMID: 31566458)
  • Substance Nomenclature: 0 (Heparin, Low-Molecular-Weight)
  • Entry Date(s): Date Created: 20240215 Date Completed: 20240219 Latest Revision: 20240219
  • Update Code: 20240219
  • PubMed Central ID: PMC10868757

Klicken Sie ein Format an und speichern Sie dann die Daten oder geben Sie eine Empfänger-Adresse ein und lassen Sie sich per Email zusenden.

oder
oder

Wählen Sie das für Sie passende Zitationsformat und kopieren Sie es dann in die Zwischenablage, lassen es sich per Mail zusenden oder speichern es als PDF-Datei.

oder
oder

Bitte prüfen Sie, ob die Zitation formal korrekt ist, bevor Sie sie in einer Arbeit verwenden. Benutzen Sie gegebenenfalls den "Exportieren"-Dialog, wenn Sie ein Literaturverwaltungsprogramm verwenden und die Zitat-Angaben selbst formatieren wollen.

xs 0 - 576
sm 576 - 768
md 768 - 992
lg 992 - 1200
xl 1200 - 1366
xxl 1366 -