Mortality prediction

死亡率预测
  • 文章类型: Journal Article
    机器学习在医疗保健中的应用通常需要使用分层代码,例如国际疾病分类(ICD)和解剖治疗化学(ATC)系统。这些代码对疾病和药物进行分类,分别,从而形成广泛的数据维度。无监督特征选择解决了“维度的诅咒”,并通过减少无关或冗余特征的数量并避免过度拟合,有助于提高监督学习模型的准确性和性能。无监督特征选择技术,比如过滤器,包装器,和嵌入式方法,被实现为选择具有最内在信息的最重要的功能。然而,由于ICD和ATC代码的庞大数量以及这些系统的层次结构,他们面临挑战。
    本研究的目的是比较冠状动脉疾病患者ICD和ATC代码数据库的几种无监督特征选择方法的性能和复杂性的不同方面,并选择代表这些患者的最佳特征集。
    我们比较了艾伯塔省51,506名冠状动脉疾病患者的2个ICD和1个ATC代码数据库的几种无监督特征选择方法,加拿大。具体来说,我们用拉普拉斯分数,多集群数据的无监督特征选择,自动编码器启发的无监督特征选择,主要特征分析,和混凝土自动编码器有和没有ICD或ATC树的重量调整,从超过9000ICD和2000ATC代码中选择100个最佳功能。我们根据其重建初始特征空间和预测出院后90天死亡率的能力评估了选定的特征。我们还通过ICD或ATC树中的平均代码级别比较了所选特征的复杂性,以及使用Shapley分析的死亡率预测任务中特征的可解释性。
    在特征空间重构和死亡率预测中,具体的基于自动编码器的方法优于其他技术。特别是,权重调整后的混凝土自动编码器变体展示了改进的重建精度和显著的预测性能增强,经DeLong和McNemar检验证实(P<0.05)。混凝土自动编码器首选更通用的代码,他们一致准确地重建了所有特征。此外,与大多数替代方案相比,通过重量调整的混凝土自动编码器选择的特征在死亡率预测中产生了更高的Shapley值。
    这项研究在无监督的背景下仔细检查了ICD和ATC代码数据集中的5种特征选择方法。我们的发现强调了具体的自动编码器方法在选择代表整个数据集的显着特征方面的优越性,为后续机器学习研究提供潜在资产。我们还为专门为ICD和ATC代码数据集量身定制的具体自动编码器提供了一种新颖的权重调整方法,以增强所选功能的可泛化性和可解释性。
    UNASSIGNED: The application of machine learning in health care often necessitates the use of hierarchical codes such as the International Classification of Diseases (ICD) and Anatomical Therapeutic Chemical (ATC) systems. These codes classify diseases and medications, respectively, thereby forming extensive data dimensions. Unsupervised feature selection tackles the \"curse of dimensionality\" and helps to improve the accuracy and performance of supervised learning models by reducing the number of irrelevant or redundant features and avoiding overfitting. Techniques for unsupervised feature selection, such as filter, wrapper, and embedded methods, are implemented to select the most important features with the most intrinsic information. However, they face challenges due to the sheer volume of ICD and ATC codes and the hierarchical structures of these systems.
    UNASSIGNED: The objective of this study was to compare several unsupervised feature selection methods for ICD and ATC code databases of patients with coronary artery disease in different aspects of performance and complexity and select the best set of features representing these patients.
    UNASSIGNED: We compared several unsupervised feature selection methods for 2 ICD and 1 ATC code databases of 51,506 patients with coronary artery disease in Alberta, Canada. Specifically, we used the Laplacian score, unsupervised feature selection for multicluster data, autoencoder-inspired unsupervised feature selection, principal feature analysis, and concrete autoencoders with and without ICD or ATC tree weight adjustment to select the 100 best features from over 9000 ICD and 2000 ATC codes. We assessed the selected features based on their ability to reconstruct the initial feature space and predict 90-day mortality following discharge. We also compared the complexity of the selected features by mean code level in the ICD or ATC tree and the interpretability of the features in the mortality prediction task using Shapley analysis.
    UNASSIGNED: In feature space reconstruction and mortality prediction, the concrete autoencoder-based methods outperformed other techniques. Particularly, a weight-adjusted concrete autoencoder variant demonstrated improved reconstruction accuracy and significant predictive performance enhancement, confirmed by DeLong and McNemar tests (P<.05). Concrete autoencoders preferred more general codes, and they consistently reconstructed all features accurately. Additionally, features selected by weight-adjusted concrete autoencoders yielded higher Shapley values in mortality prediction than most alternatives.
    UNASSIGNED: This study scrutinized 5 feature selection methods in ICD and ATC code data sets in an unsupervised context. Our findings underscore the superiority of the concrete autoencoder method in selecting salient features that represent the entire data set, offering a potential asset for subsequent machine learning research. We also present a novel weight adjustment approach for the concrete autoencoders specifically tailored for ICD and ATC code data sets to enhance the generalizability and interpretability of the selected features.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:跌倒后出现在急诊科(ED)的老年患者越来越普遍。跌倒与功能衰退和死亡有关。预测短期死亡率的生物标志物可能有助于有关资源分配和处置的决策。D-二聚体水平用于排除血栓栓塞性疾病,而和肽素和肾上腺髓质素(MR-proADM)可用作患者压力水平的量度。选择这些非特异性生物标志物作为死亡率的潜在预测因子。
    方法:前瞻性,国际,多中心,在德国和瑞士的两家三级医院和两家地区医院进行了横断面观察.纳入了在跌倒后出现在ED的65岁或以上的患者。人口统计数据,日常生活活动(ADL),和D-二聚体在呈递时收集。从冷冻样品中测定和肽素和MR-proADM水平。主要结局为30天死亡率;次要结局为90、180和365天死亡率。
    结果:纳入572例患者。年龄中位数为83[IQR78,89]岁,236名(67.7%)为女性。总体死亡率为3.1%(30d),5.4%(90天),7.5%(180天),和13.8%(365天),分别。非幸存者年龄较大,ADL指数较低,所有三种生物标志物水平较高。MR-proADM和D-二聚体水平升高与较高的死亡风险相关。MR-proADM和D-二聚体对短期死亡率的敏感性高,阴性似然比低,而copeptin没有。
    结论:D-二聚体和MR-proADM水平可能作为老年患者跌倒后出现ED的预后标志物,通过确定短期死亡率低风险的患者。
    背景:ClinicalTrials.gov标识符:NCT02244983。
    BACKGROUND: Older patients presenting to the emergency department (ED) after falling are increasingly prevalent. Falls are associated with functional decline and death. Biomarkers predicting short-term mortality might facilitate decisions regarding resource allocation and disposition. D-dimer levels are used to rule out thromboembolic disease, while copeptin and adrenomedullin (MR-proADM) may be used as measures of the patient`s stress level. These nonspecific biomarkers were selected as potential predictors for mortality.
    METHODS: Prospective, international, multicenter, cross-sectional observation was performed in two tertiary and two regional hospitals in Germany and Switzerland. Patients aged 65 years or older presenting to the ED after a fall were enrolled. Demographic data, Activities of Daily Living (ADL), and D-dimers were collected upon presentation. Copeptin and MR-proADM levels were determined from frozen samples. Primary outcome was 30-day mortality; and secondary outcomes were mortality at 90, 180, and 365 days.
    RESULTS: Five hundred and seventy-two patients were included. Median age was 83 [IQR 78, 89] years, 236 (67.7%) were female. Mortality overall was 3.1% (30 d), 5.4% (90 d), 7.5% (180 d), and 13.8% (365 d), respectively. Non-survivors were older, had a lower ADL index and higher levels of all three biomarkers. Elevated levels of MR-proADM and D-dimer were associated with higher risk of mortality. MR-proADM and D-dimer showed high sensitivity and low negative likelihood ratio regarding short-term mortality, whereas copeptin did not.
    CONCLUSIONS: D-dimer and MR-proADM levels might be useful as prognostic markers in older patients presenting to the ED after a fall, by identifying patients at low risk of short-term mortality.
    BACKGROUND: ClinicalTrials.gov Identifier: NCT02244983.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目标:现有的公平性评估方法往往忽视了健康的社会决定因素的系统性差异,比如人口统计学和社会经济学,在比较组中,可能导致不准确甚至矛盾的结论。这项研究旨在使用考虑系统差异的公平性检测方法来评估预测慢性病患者死亡率的种族差异。
    方法:我们从MassGeneralBrigham的电子健康记录(EHR)中创建了五个数据集,每个人都专注于不同的慢性病:充血性心力衰竭(CHF),慢性肾脏病(CKD),慢性阻塞性肺疾病(COPD),慢性肝病(CLD),和痴呆症。对于每个数据集,我们开发了单独的机器学习模型来预测1年死亡率,并通过比较黑人和白人个体的预测表现来检验种族差异.我们比较了总体黑人和白人与他们的同行之间的种族公平性评估,这些人是黑人和匹配的白人,通过倾向得分匹配确定,系统差异得到缓解。
    结果:我们发现黑人和白人在年龄上存在显著差异,性别,婚姻状况,教育水平,吸烟状况,健康保险类型,身体质量指数,和Charlson合并症指数(p值<0.001)。当检查通过倾向得分匹配确定的匹配的黑人和白人亚群时,特定协变量之间存在显著差异。我们观察到CHF队列中保险类型的显著性水平较弱(p=0.043),在CKD队列中,保险类型(p=0.005)和教育水平(p=0.016),和痴呆队列中的体重指数(p=0.041);其他协变量没有显着差异。在检查五个研究队列的死亡率预测模型时,我们对消除系统性差异前后的公平性评价进行了比较。我们揭示了CHF队列的显着差异,在AdaBoost模型的F1测量和敏感性方面,p值为0.021和0.001,就MLP模型的F1度量和灵敏度而言,p值为0.014和0.003,分别。
    结论:这项研究通过关注系统差异的检查,为公平性评估的研究做出了贡献,并强调了在临床环境中使用的机器学习模型中揭示种族偏见的可能性。
    OBJECTIVE: Existing approaches to fairness evaluation often overlook systematic differences in the social determinants of health, like demographics and socioeconomics, among comparison groups, potentially leading to inaccurate or even contradictory conclusions. This study aims to evaluate racial disparities in predicting mortality among patients with chronic diseases using a fairness detection method that considers systematic differences.
    METHODS: We created five datasets from Mass General Brigham\'s electronic health records (EHR), each focusing on a different chronic condition: congestive heart failure (CHF), chronic kidney disease (CKD), chronic obstructive pulmonary disease (COPD), chronic liver disease (CLD), and dementia. For each dataset, we developed separate machine learning models to predict 1-year mortality and examined racial disparities by comparing prediction performances between Black and White individuals. We compared racial fairness evaluation between the overall Black and White individuals versus their counterparts who were Black and matched White individuals identified by propensity score matching, where the systematic differences were mitigated.
    RESULTS: We identified significant differences between Black and White individuals in age, gender, marital status, education level, smoking status, health insurance type, body mass index, and Charlson comorbidity index (p-value < 0.001). When examining matched Black and White subpopulations identified through propensity score matching, significant differences between particular covariates existed. We observed weaker significance levels in the CHF cohort for insurance type (p = 0.043), in the CKD cohort for insurance type (p = 0.005) and education level (p = 0.016), and in the dementia cohort for body mass index (p = 0.041); with no significant differences for other covariates. When examining mortality prediction models across the five study cohorts, we conducted a comparison of fairness evaluations before and after mitigating systematic differences. We revealed significant differences in the CHF cohort with p-values of 0.021 and 0.001 in terms of F1 measure and Sensitivity for the AdaBoost model, and p-values of 0.014 and 0.003 in terms of F1 measure and Sensitivity for the MLP model, respectively.
    CONCLUSIONS: This study contributes to research on fairness assessment by focusing on the examination of systematic disparities and underscores the potential for revealing racial bias in machine learning models used in clinical settings.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:机器学习技术开始在各种医疗保健数据集中使用,以识别可能从干预中受益的体弱者。然而,与传统回归相比,关于机器学习技术性能的证据好坏参半。还不清楚哪些方法和数据库因素与性能相关。
    目的:本研究旨在比较各种机器学习分类器在不同场景下识别体弱老年人的死亡率预测准确性。
    方法:我们使用2012年1月1日至2016年12月31日在新西兰使用interRAI-HomeCare工具评估的老年人(65岁及以上)收集的去识别数据。总共使用138个InterRAI评估项目来预测6个月和12个月的死亡率。使用3个机器学习分类器(随机森林[RF],极端梯度增强[XGBoost],和多层感知器[MLP])和正则化逻辑回归。我们进行了一项模拟研究,比较了机器学习模型与逻辑回归和内部RAI家庭护理脆弱量表的性能,并检查了样本量的影响,功能的数量,和列车测试分流比。
    结果:共有95,042名老年人(平均年龄82.66岁,IQR77.92-88.76;n=37,462,39.42%男性)接受家庭护理。曲线下平均面积(AUC)和6个月死亡率预测的敏感性表明,机器学习分类器的表现并不优于正则逻辑回归。就AUC而言,正则化逻辑回归的性能优于XGBoost,MLP,当特征数量≤80且样本量≤16,000时,和RF;当特征数量≥40且样本量≥4000时,MLP在灵敏度方面优于正则逻辑回归。相反,在所有情况下,RF和XGBoost均表现出比正则逻辑回归更高的特异性。
    结论:研究表明,当使用不同的指标进行评估时,机器学习模型在预测性能方面表现出显著差异。正则逻辑回归是一种有效的模型,用于识别体弱的老年人接受家庭护理,如AUC所示,特别是当特征的数量和样本大小不太大时。相反,MLP显示出优越的灵敏度,而当特征数量和样本量大时,RF表现出优异的特异性。
    BACKGROUND: Machine learning techniques are starting to be used in various health care data sets to identify frail persons who may benefit from interventions. However, evidence about the performance of machine learning techniques compared to conventional regression is mixed. It is also unclear what methodological and database factors are associated with performance.
    OBJECTIVE: This study aimed to compare the mortality prediction accuracy of various machine learning classifiers for identifying frail older adults in different scenarios.
    METHODS: We used deidentified data collected from older adults (65 years of age and older) assessed with interRAI-Home Care instrument in New Zealand between January 1, 2012, and December 31, 2016. A total of 138 interRAI assessment items were used to predict 6-month and 12-month mortality, using 3 machine learning classifiers (random forest [RF], extreme gradient boosting [XGBoost], and multilayer perceptron [MLP]) and regularized logistic regression. We conducted a simulation study comparing the performance of machine learning models with logistic regression and interRAI Home Care Frailty Scale and examined the effects of sample sizes, the number of features, and train-test split ratios.
    RESULTS: A total of 95,042 older adults (median age 82.66 years, IQR 77.92-88.76; n=37,462, 39.42% male) receiving home care were analyzed. The average area under the curve (AUC) and sensitivities of 6-month mortality prediction showed that machine learning classifiers did not outperform regularized logistic regressions. In terms of AUC, regularized logistic regression had better performance than XGBoost, MLP, and RF when the number of features was ≤80 and the sample size ≤16,000; MLP outperformed regularized logistic regression in terms of sensitivities when the number of features was ≥40 and the sample size ≥4000. Conversely, RF and XGBoost demonstrated higher specificities than regularized logistic regression in all scenarios.
    CONCLUSIONS: The study revealed that machine learning models exhibited significant variation in prediction performance when evaluated using different metrics. Regularized logistic regression was an effective model for identifying frail older adults receiving home care, as indicated by the AUC, particularly when the number of features and sample sizes were not excessively large. Conversely, MLP displayed superior sensitivity, while RF exhibited superior specificity when the number of features and sample sizes were large.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    全球,创伤病例具有显著的发病率和死亡率。因此,已经设计了各种评分系统来改善创伤病例的预后。创伤和损伤严重度评分(TRISS)是广泛用于预测死亡率的模型之一;然而,它有一定的局限性。我们旨在评估新模型TRISS-氧饱和度(SpO2)的生存预测,并与创伤研究参与者的原始TRISS评分进行比较。
    这是一项前瞻性队列研究,对2021年1月20日至2021年11月28日入住外科的380名创伤研究参与者进行。所提出的模型包括TRISS-SpO2,它代替了脉冲SpO2,而不是原始TRISS评分中的修正创伤评分。使用从主要创伤结果研究分析的Walker-Duncan回归分析得出的系数,计算了两个模型的生存概率(Ps)。使用接收器工作特性曲线分析来预测模型性能并计算精度。
    本研究的死亡率为30(7.9%)。基于呼吸频率计算Ps的原始TRISS评分的预测准确率为97.11%,对于基于SpO2计算Ps的TRISS评分模型,发现97.11%,因此,在性能上没有显着差异。
    新提出的模型TRISS-SpO2显示出与原始TRISS分数相似的良好准确性。然而,新工具TRISS-SpO2可能更易于使用,在临床环境中具有稳健的性能.
    UNASSIGNED: Globally, trauma cases have significant morbidity and mortality. Hence, various scoring systems have been designed to improve the prognosis in trauma cases. Trauma and Injury Severity Score (TRISS) is one of the widely used models to predict mortality; however, it has certain limitation. We have aimed to evaluate the survival prediction of new model TRISS-oxygen saturation (SpO2) and to compare with original TRISS score in trauma study participants.
    UNASSIGNED: This was a prospective cohort study conducted on 380 trauma study participants admitted to the surgery department from January 20, 2021, to November 28, 2021. The proposed model includes TRISS-SpO2 which replaces pulse SpO2 instead of revised trauma score in the original TRISS score. Probability of survival (Ps) was calculated for both models using coefficients derived from Walker-Duncan regression analysis analyzed from the Major Trauma Outcome Study. Receiver operating characteristic curve analysis was used to predict model performance and the accuracy was calculated.
    UNASSIGNED: The mortality rate in the present study was 30 (7.9%). The predictive accuracy of original TRISS score which calculated Ps based on respiratory rate was 97.11%, and for the proposed model of TRISS score which calculated Ps based on SpO2 was found 97.11%, and thus there is no significant difference in the performance.
    UNASSIGNED: The new proposed model TRISS-SpO2 showed a good accuracy which is similar to original TRISS score. However, the new tool TRISS-SpO2 might be easier to use for robust performance in the clinical setting.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:早期可靠地识别具有高死亡风险的脓毒症患者对于改善临床结局很重要。然而,人工智能(AI)模型的三大障碍,包括缺乏可解释性,普遍性的困难,以及自动化偏差的风险,阻碍人工智能模型在临床实践中的广泛采用。
    目的:本研究旨在开发和验证(内部和外部)危重患者败血症死亡风险的适形预测因子,利用人工智能辅助预测建模。所提出的方法能够解释模型输出并评估其置信水平。
    方法:我们从BethIsraelDeaconess医学中心教学医院收集的数据库中,回顾性提取了成人脓毒症患者的数据,用于模型训练和内部验证。PhilipseICU研究所的大型多中心重症监护数据库用于外部验证。入院后第一天共提取103个临床特征。我们使用梯度提升机开发了AI模型来预测败血症的死亡风险,并使用Mondrian适形预测来估计预测不确定性。采用Shapley加性解释方法对模型进行了解释。
    结果:共有16,746名(80%)来自贝斯以色列女执事医疗中心的患者用于训练模型。当在4187(20%)患者的内部验证人群中进行测试时,该模型实现了受试者工作特性曲线下的面积为0.858(95%CI0.845-0.871),当从PhilipseICU数据库中对10,362名患者进行外部验证时,该指数降至0.800(95%CI0.789-0.811).在内部验证队列的指定置信水平为90%时,错误预测(n=438)占所有预测(n=4187)的百分比为10.5%,1229(29.4%)的预测需要临床医生审查。相比之下,没有适形预测的人工智能模型产生了1449个(34.6%)的误差。当外部验证时,由于数据库间的异质性,更多的预测(n=4004,38.6%)被标记为临床医师审查.然而,与AI的点预测相比,该模型的错误率仍然显著降低(n=1221,11.8%vsn=4540,43.8%).在该预测模型中确定的最重要的预测因子是急性生理学评分III,年龄,尿量,血管升压药,和肺部感染。还检查了导致单个患者的临床相关风险因素,以显示风险是如何产生的。
    结论:通过结合模型解释和适形预测,基于AI的系统可以更好地转化为临床决策的医疗实践。
    BACKGROUND: Early and reliable identification of patients with sepsis who are at high risk of mortality is important to improve clinical outcomes. However, 3 major barriers to artificial intelligence (AI) models, including the lack of interpretability, the difficulty in generalizability, and the risk of automation bias, hinder the widespread adoption of AI models for use in clinical practice.
    OBJECTIVE: This study aimed to develop and validate (internally and externally) a conformal predictor of sepsis mortality risk in patients who are critically ill, leveraging AI-assisted prediction modeling. The proposed approach enables explaining the model output and assessing its confidence level.
    METHODS: We retrospectively extracted data on adult patients with sepsis from a database collected in a teaching hospital at Beth Israel Deaconess Medical Center for model training and internal validation. A large multicenter critical care database from the Philips eICU Research Institute was used for external validation. A total of 103 clinical features were extracted from the first day after admission. We developed an AI model using gradient-boosting machines to predict the mortality risk of sepsis and used Mondrian conformal prediction to estimate the prediction uncertainty. The Shapley additive explanation method was used to explain the model.
    RESULTS: A total of 16,746 (80%) patients from Beth Israel Deaconess Medical Center were used to train the model. When tested on the internal validation population of 4187 (20%) patients, the model achieved an area under the receiver operating characteristic curve of 0.858 (95% CI 0.845-0.871), which was reduced to 0.800 (95% CI 0.789-0.811) when externally validated on 10,362 patients from the Philips eICU database. At a specified confidence level of 90% for the internal validation cohort the percentage of error predictions (n=438) out of all predictions (n=4187) was 10.5%, with 1229 (29.4%) predictions requiring clinician review. In contrast, the AI model without conformal prediction made 1449 (34.6%) errors. When externally validated, more predictions (n=4004, 38.6%) were flagged for clinician review due to interdatabase heterogeneity. Nevertheless, the model still produced significantly lower error rates compared to the point predictions by AI (n=1221, 11.8% vs n=4540, 43.8%). The most important predictors identified in this predictive model were Acute Physiology Score III, age, urine output, vasopressors, and pulmonary infection. Clinically relevant risk factors contributing to a single patient were also examined to show how the risk arose.
    CONCLUSIONS: By combining model explanation and conformal prediction, AI-based systems can be better translated into medical practice for clinical decision-making.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:预后指数可以增强对健康负担的个性化预测。然而,一个简单的,缺乏临床使用的实用和可重复的工具。这项研究旨在开发一种基于机器学习的预后指数,用于预测社区居住的老年人的全因死亡率。
    方法:我们利用台湾健康老龄化纵向研究(HALST)队列,包括来自5663名参与者的数据。在5年的随访中,已确认447人死亡。使用基于机器学习技术的常见实验室测试开发了基于机器学习的常规血液检查预后指数(MARBE-PI)。根据MARBE-PI评分,通过特定层的似然比分析将参与者分为多个风险类别。随后,MARBE-PI与来自日本的独立基于人群的队列进行了外部验证。
    结果:超过年龄,性别,教育水平和BMI,六项实验室测试(LDL,白蛋白,AST,淋巴细胞计数,hsCRP,和肌酐)是通过逐步逻辑回归5年死亡率的关键预测因子。内部和外部验证数据集通过逻辑回归构建的MARBE-PI的AUC分别为0.799(95%CI:0.778-0.819)和0.756(95%CI:0.694-0.814)。在两个数据集中,延长的10年死亡率分别为0.801(95%CI:0.790-0.811)和0.809(95%CI:0.774-0.845),分别。根据MARBE-PI分层的风险类别显示出与死亡率一致的剂量反应相关性。MARBE-PI还与根据临床健康缺陷和/或实验室结果构建的指数进行比较。
    结论:在繁忙的临床环境中,MARBE-PI被认为是最适用的风险分层方法。它有可能确定老年人死亡率升高的风险,从而帮助临床决策。
    BACKGROUND: Prognostic indices can enhance personalized predictions of health burdens. However, a simple, practical, and reproducible tool is lacking for clinical use. This study aimed to develop a machine learning-based prognostic index for predicting all-cause mortality in community-dwelling older individuals.
    METHODS: We utilized the Healthy Aging Longitudinal Study in Taiwan (HALST) cohort, encompassing data from 5 663 participants. Over the 5-year follow-up, 447 deaths were confirmed. A machine learning-based routine blood examination prognostic index (MARBE-PI) was developed using common laboratory tests based on machine learning techniques. Participants were grouped into multiple risk categories by stratum-specific likelihood ratio analysis based on their MARBE-PI scores. The MARBE-PI was subsequently externally validated with an independent population-based cohort from Japan.
    RESULTS: Beyond age, sex, education level, and BMI, 6 laboratory tests (low-density lipoprotein, albumin, aspartate aminotransferase, lymphocyte count, high-sensitivity C-reactive protein, and creatinine) emerged as pivotal predictors via stepwise logistic regression (LR) for 5-year mortality. The area under curves of MARBE-PI constructed by LR were 0.799 (95% confidence interval [95% CI]: 0.778-0.819) and 0.756 (95% CI: 0.694-0.814) for the internal and external validation data sets, and were 0.801 (95% CI: 0.790-0.811) and 0.809 (95% CI: 0.774-0.845) for the extended 10-year mortality in both data sets, respectively. Risk categories stratified by MARBE-PI showed a consistent dose-response association with mortality. The MARBE-PI also performed comparably with indices constructed with clinical health deficits and/or laboratory results.
    CONCLUSIONS: The MARBE-PI is considered the most applicable measure for risk stratification in busy clinical settings. It holds potential to pinpoint older individuals at elevated mortality risk, thereby aiding clinical decision-making.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Multicenter Study
    背景:结直肠癌(CRC)是癌症相关死亡的第二大原因。本研究旨在使用机器学习(ML)方法预测CRC患者的生存结果。
    方法:回顾性分析包括2006年10月至2019年7月在伊朗三家著名三级医院收治的1853例CRC患者。六种ML方法,即逻辑回归(LR),朴素贝叶斯(NB),支持向量机(SVM)神经网络(NN),决策树(DT)和轻型梯度增压机(LGBM),进行了10倍交叉验证。特征选择采用基于平均减少GINI标准的随机森林方法。使用曲线下面积(AUC)评估模型性能。
    结果:诊断时间,年龄,肿瘤大小,转移状态,淋巴结受累,基于平均GINI下降,治疗类型成为生存的关键预测因子。NB(AUC=0.70,95%置信区间[CI]0.65-0.75)和LGBM(AUC=0.70,95%CI0.65-0.75)模型实现了CRC患者生存的最高预测AUC值。
    结论:这项研究强调了变量的重要性,包括从诊断开始的时间,年龄,肿瘤大小,转移状态,淋巴结受累,和治疗类型在预测CRC生存率中的作用。NB模型在死亡率预测中表现出最佳效果,保持平衡的灵敏度和特异性。政策建议包括对CRC患者的早期诊断和治疗。通过数字健康记录和标准化协议改进数据收集,支持临床决策中的预测分析集成,并在治疗指南中纳入已确定的预后变量,以提高患者的预后。
    Colorectal cancer (CRC) ranks as the second leading cause of cancer-related deaths. This study aimed to predict survival outcomes of CRC patients using machine learning (ML) methods.
    A retrospective analysis included 1853 CRC patients admitted to three prominent tertiary hospitals in Iran from October 2006 to July 2019. Six ML methods, namely logistic regression (LR), Naïve Bayes (NB), Support Vector Machine (SVM), Neural Network (NN), Decision Tree (DT), and Light Gradient Boosting Machine (LGBM), were developed with 10-fold cross-validation. Feature selection employed the Random Forest method based on mean decrease GINI criteria. Model performance was assessed using Area Under the Curve (AUC).
    Time from diagnosis, age, tumor size, metastatic status, lymph node involvement, and treatment type emerged as crucial predictors of survival based on mean decrease GINI. The NB (AUC = 0.70, 95% Confidence Interval [CI] 0.65-0.75) and LGBM (AUC = 0.70, 95% CI 0.65-0.75) models achieved the highest predictive AUC values for CRC patient survival.
    This study highlights the significance of variables including time from diagnosis, age, tumor size, metastatic status, lymph node involvement, and treatment type in predicting CRC survival. The NB model exhibited optimal efficacy in mortality prediction, maintaining a balanced sensitivity and specificity. Policy recommendations encompass early diagnosis and treatment initiation for CRC patients, improved data collection through digital health records and standardized protocols, support for predictive analytics integration in clinical decisions, and the inclusion of identified prognostic variables in treatment guidelines to enhance patient outcomes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:在重症监护病房(ICU)中,准确的死亡率预测对于有效的患者管理和资源分配至关重要。简化急性生理学评分II(SAPS-2),虽然常用,严重依赖全面的临床数据和血液样本。这项研究旨在开发一种人工智能(AI)模型,利用关键的血液动力学参数来预测最初24小时内的ICU死亡率,并评估其相对于SAPS-2的性能。
    方法:我们对选定的血流动力学参数和心率曲线结构进行了分析,以确定ICU死亡率的潜在预测因素。随后在不同的患者队列上训练和验证了机器学习模型。然后将AI算法的性能与SAPS-2进行比较,重点是分类准确性,校准,和普适性。
    结果:该研究包括从3月27日开始的1298名ICU患者,2015年3月27日,2017.从2022年到2023年的额外队列包括590名患者,导致1888名患者的总数据集。观察到的死亡率为24.0%。死亡率的关键决定因素是格拉斯哥昏迷量表评分,心率复杂度,患者年龄,舒张压低于50mmHg的持续时间,心率变异性,和特定的平均和收缩压阈值。AI模型,在这些决定因素的指导下,在预测死亡率方面表现出相当的表现,如果不是优越的,到SAPS-2.
    结论:AI模型,将心率和血压曲线分析与基本临床参数相结合,提供了一种方法来预测ICU患者的院内死亡率。该模型提供了现有工具的替代方案,这些工具依赖于广泛的临床数据和实验室输入。将其整合到ICU监测系统中可能会促进更简化的死亡率预测过程。
    BACKGROUND: In intensive care units (ICUs), accurate mortality prediction is crucial for effective patient management and resource allocation. The Simplified Acute Physiology Score II (SAPS-2), though commonly used, relies heavily on comprehensive clinical data and blood samples. This study sought to develop an artificial intelligence (AI) model utilizing key hemodynamic parameters to predict ICU mortality within the first 24 h and assess its performance relative to SAPS-2.
    METHODS: We conducted an analysis of select hemodynamic parameters and the structure of heart rate curves to identify potential predictors of ICU mortality. A machine-learning model was subsequently trained and validated on distinct patient cohorts. The AI algorithm\'s performance was then compared to the SAPS-2, focusing on classification accuracy, calibration, and generalizability.
    RESULTS: The study included 1298 ICU admissions from March 27th, 2015, to March 27th, 2017. An additional cohort from 2022 to 2023 comprised 590 patients, resulting in a total dataset of 1888 patients. The observed mortality rate stood at 24.0%. Key determinants of mortality were the Glasgow Coma Scale score, heart rate complexity, patient age, duration of diastolic blood pressure below 50 mmHg, heart rate variability, and specific mean and systolic blood pressure thresholds. The AI model, informed by these determinants, exhibited a performance profile in predicting mortality that was comparable, if not superior, to the SAPS-2.
    CONCLUSIONS: The AI model, which integrates heart rate and blood pressure curve analyses with basic clinical parameters, provides a methodological approach to predict in-hospital mortality in ICU patients. This model offers an alternative to existing tools that depend on extensive clinical data and laboratory inputs. Its potential integration into ICU monitoring systems may facilitate more streamlined mortality prediction processes.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:设计一种新模型,利用移植前数据预测中国人群移植后死亡率,并将其性能与现有模型进行比较。
    方法:在这项多中心研究中,在训练组中招募了544名非肿瘤适应症的肝移植接受者,在验证组中招募了276名患者。使用C统计量将新的简化死亡率预测得分(SMOPS)模型与MELD和四个现有模型进行了比较。
    结果:SMOPS模型使用从训练组中筛选的6个独立的移植前危险因素(慢性肝衰竭/器官衰竭评分,发烧>37.6℃,ABO血型相容性,动脉乳酸水平,白细胞计数和重新移植)。SMOPS准确预测了患者的30天,肝移植后90天和365天死亡率,它的分数比其他模型更准确。SMOPS产生了四个风险级别:低风险(<10分),中等风险(11-20分),高风险(21-25分)和无效风险(≥26分)。在MELD=40和MELD<40之间,所有风险水平内的生存率没有差异。适度内的生存-,高危或极端危型ALF在ALF和非ALF之间没有差异.
    结论:SMOPS模型使用移植前风险因素对移植后存活进行分层,优于目前中国人群的模型,并有可能有助于改进机构分配政策。
    OBJECTIVE: To designed a new model using pre-transplant data to predict post-transplant mortality for Chinese population and compared its performance to that of existing models.
    METHODS: In this multicenter study, 544 recipients of liver transplants for non-tumor indications were enrolled in the training group and 276 patients in the validation group. The new Simplified Mortality Prediction Scores (SMOPS) model was compared to the MELD and four existing models using the C-statistic.
    RESULTS: SMOPS model used 6 independent pre-transplantation risk factors screened from the training group (chronic liver failure/organ failure scores, fever > 37.6 ℃, ABO blood-type compatibility, arterial lactate level, leukocyte count and re-transplantation). The SMOPS accurately predicted patients\' 30-day, 90-day and 365-day mortality following liver transplantation, and its\' scores were more accurate than those of the other models. The SMOPS generated four levels of risk: low risk (<10 points), moderate risk (11-20 points), high risk (21-25 points) and futile risk (≥26 points). The survival within all risk levels was not different between MELD=40 and MELD<40. The survival within moderate-, high- or extreme-risk ALF was not different between ALF and non-ALF.
    CONCLUSIONS: The SMOPS model uses pre-transplant risk factors to stratify post-transplant survival and is superior to current models for Chinese population, and has the potential to contribute to improvements in organ-allocation policies.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号