SHapley Additive exPlanations

Shapley 添加剂 explanations
  • 文章类型: Journal Article
    生物材料研究的最新进展为预测各种材料特性提供了人工智能。然而,基于氨基酸序列预测生物材料力学性能的研究一直缺乏。这项研究率先使用分类模型来预测丝纤维氨基酸序列的极限拉伸强度,采用逻辑回归,具有各种内核的支持向量机,和深度神经网络(DNN)。值得注意的是,该模型在泛化测试中表现出0.83的高精度。该研究引入了一种超越传统实验方法的创新方法来预测生物材料力学特性。认识到传统线性预测模型的局限性,该研究强调了未来的DNN轨迹,可以以高精度巧妙地捕获非线性关系。此外,通过不同预测模型之间的综合性能比较,该研究提供了对预测某些材料的机械性能的特定模型的有效性的见解。总之,这项研究是一项开创性的贡献,为未来的努力奠定基础,并倡导将人工智能方法无缝集成到材料研究中。
    Recent advancements in biomaterial research conduct artificial intelligence for predicting diverse material properties. However, research predicting the mechanical properties of biomaterial based on amino acid sequences have been notably absent. This research pioneers the use of classification models to predict ultimate tensile strength from silk fiber amino acid sequences, employing logistic regression, support vector machines with various kernels, and a deep neural network (DNN). Remarkably, the model demonstrates a high accuracy of 0.83 during the generalization test. The study introduces an innovative approach to predicting biomaterial mechanical properties beyond traditional experimental methods. Recognizing the limitations of conventional linear prediction models, the research emphasizes the future trajectory toward DNNs that can adeptly capture non-linear relationships with high precision. Moreover, through comprehensive performance comparisons among diverse prediction models, the study offers insights into the effectiveness of specific models for predicting the mechanical properties of certain materials. In conclusion, this study serves as a pioneering contribution, laying the groundwork for future endeavors and advocating for the seamless integration of AI methodologies into materials research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:由于多重耐药生物体(MDROs)引起的医疗保健相关感染,如耐甲氧西林金黄色葡萄球菌(MRSA)和艰难梭菌(CDI),给我们的医疗基础设施带来沉重负担。
    目的:MDROs的筛查是防止传播的重要机制,但却是资源密集型的。这项研究的目的是开发可以使用电子健康记录(EHR)数据预测定植或感染风险的自动化工具,提供有用的信息来帮助感染控制,并指导经验性抗生素覆盖。
    方法:我们回顾性地开发了一个机器学习模型来检测在弗吉尼亚大学医院住院患者样本采集时未分化患者的MRSA定植和感染。我们使用来自患者EHR数据的入院和住院期间信息的临床和非临床特征来构建模型。此外,我们在EHR数据中使用了一类从联系网络派生的特征;这些网络特征可以捕获患者与提供者和其他患者的联系,提高预测MRSA监测试验结果的模型可解释性和准确性。最后,我们探索了不同患者亚群的异质模型,例如,入住重症监护病房或急诊科的人或有特定检测史的人,哪个表现更好。
    结果:我们发现惩罚逻辑回归比其他方法表现更好,当我们使用多项式(二次)变换特征时,该模型的性能根据其接收器操作特征-曲线下面积得分提高了近11%。预测MDRO风险的一些重要特征包括抗生素使用,手术,使用设备,透析,患者的合并症状况,和网络特征。其中,网络功能增加了最大的价值,并将模型的性能提高了至少15%。对于特定患者亚群,具有相同特征转换的惩罚逻辑回归模型也比其他模型表现更好。
    结论:我们的研究表明,使用来自EHR数据的临床和非临床特征,通过机器学习方法可以非常有效地进行MRSA风险预测。网络特征是最具预测性的,并且提供优于现有方法的显著改进。此外,不同患者亚群的异质预测模型提高了模型的性能。
    BACKGROUND: Health care-associated infections due to multidrug-resistant organisms (MDROs), such as methicillin-resistant Staphylococcus aureus (MRSA) and Clostridioides difficile (CDI), place a significant burden on our health care infrastructure.
    OBJECTIVE: Screening for MDROs is an important mechanism for preventing spread but is resource intensive. The objective of this study was to develop automated tools that can predict colonization or infection risk using electronic health record (EHR) data, provide useful information to aid infection control, and guide empiric antibiotic coverage.
    METHODS: We retrospectively developed a machine learning model to detect MRSA colonization and infection in undifferentiated patients at the time of sample collection from hospitalized patients at the University of Virginia Hospital. We used clinical and nonclinical features derived from on-admission and throughout-stay information from the patient\'s EHR data to build the model. In addition, we used a class of features derived from contact networks in EHR data; these network features can capture patients\' contacts with providers and other patients, improving model interpretability and accuracy for predicting the outcome of surveillance tests for MRSA. Finally, we explored heterogeneous models for different patient subpopulations, for example, those admitted to an intensive care unit or emergency department or those with specific testing histories, which perform better.
    RESULTS: We found that the penalized logistic regression performs better than other methods, and this model\'s performance measured in terms of its receiver operating characteristics-area under the curve score improves by nearly 11% when we use polynomial (second-degree) transformation of the features. Some significant features in predicting MDRO risk include antibiotic use, surgery, use of devices, dialysis, patient\'s comorbidity conditions, and network features. Among these, network features add the most value and improve the model\'s performance by at least 15%. The penalized logistic regression model with the same transformation of features also performs better than other models for specific patient subpopulations.
    CONCLUSIONS: Our study shows that MRSA risk prediction can be conducted quite effectively by machine learning methods using clinical and nonclinical features derived from EHR data. Network features are the most predictive and provide significant improvement over prior methods. Furthermore, heterogeneous prediction models for different patient subpopulations enhance the model\'s performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    早期检测老年人的认知能力下降对于有效干预至关重要。这项研究,马鞍山健康老龄化队列研究的一部分,检查了2288名认知功能正常的参与者。42个潜在的预测因子,包括人口统计,慢性疾病,生活方式因素,和基线认知功能,被选中。数据集分为训练,验证,和测试集(60%,20%,20%,分别)。递归特征消除(RFE)和六种机器学习算法用于模型开发。使用曲线下面积(AUC)评估模型性能,特异性,灵敏度,和准确性。沙普利附加扩张(SHAP)被应用于可解释性,揭示了十大有影响力的特征:基线MMSE,教育,经济地位,社会活动,PSQI,BMI,SBP,DBP,IADL,和年龄。基于朴素贝叶斯(NB)算法的模型在测试集上实现了0.820(95%CI0.773-0.887)的AUC,优于其他算法。该模型可以帮助社区环境中的初级卫生保健人员识别出老年人中三年内患认知障碍风险较高的个体。
    BACKGROUND: The prevalence of cognitive impairment and dementia in the older population is increasing, and thereby, early detection of cognitive decline is essential for effective intervention.
    METHODS: This study included 2,288 participants with normal cognitive function from the Ma\'anshan Healthy Aging Cohort Study. Forty-two potential predictors, including demographic characteristics, chronic diseases, lifestyle factors, anthropometric indices, physical function, and baseline cognitive function, were selected based on clinical importance and previous research. The dataset was partitioned into training, validation, and test sets in a proportion of 60% for training, 20% for validation, and 20% for testing, respectively. Recursive feature elimination was used for feature selection, followed by six machine learning algorithms that were employed for model development. The performance of the models was evaluated using area under the curve (AUC), specificity, sensitivity, and accuracy. Moreover, SHapley Additive exPlanations (SHAP) was conducted to access the interpretability of the final selected model and to gain insights into the impact of features on the prediction outcomes. SHAP force plots were established to vividly show the application of the prediction model at the individual level.
    RESULTS: The final predictive model based on the Naive Bayes algorithm achieved an AUC of 0.820 (95% CI, 0.773-0.887) on the test set, outperforming other algorithms. The top ten influential features in the model included baseline Mini-Mental State Examination (MMSE), education, self-reported economic status, collective or social activities, Pittsburgh sleep quality index (PSQI), body mass index, systolic blood pressure, diastolic blood pressure, instrumental activities of daily living, and age. The model demonstrated the potential to identify individuals at a higher risk of cognitive impairment within 3 years from older adults.
    CONCLUSIONS: The predictive model developed in this study contributes to the early detection of cognitive impairment in older adults by primary healthcare staff in community settings.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:自杀是青少年死亡的第二大原因,并且与自杀集群有关。尽管对这种可预防的死亡原因进行了大量研究,重点主要是单一国家和传统的统计方法。
    目的:本研究旨在使用跨国数据集和机器学习(ML)开发青少年自杀思维的预测模型。
    方法:我们使用韩国青少年风险行为网络调查的数据,对566,875名年龄在13至18岁之间的青少年进行调查,并使用青少年风险行为调查对103,874名青少年进行外部验证,挪威大学国家综合调查对19,574名青少年进行验证。开发了几种基于树的机器学习模型,并对特征重要性和Shapley加性解释值进行分析,以确定青少年自杀思维的危险因素。
    结果:在对来自韩国的基于韩国青年风险行为网络的调查数据进行训练时,以95%的CI,XGBoost模型报告的接受者工作特征(AUROC)曲线下面积为90.06%(95%CI89.97-90.16),与其他型号相比,表现出卓越的性能。对于使用美国青年风险行为调查数据和挪威大学国家综合调查的外部验证,XGBoost模型的AUROC分别为83.09%和81.27%,分别。在所有数据集中,XGBoost始终优于AUROC得分最高的其他模型,并被选为最优模型。就自杀思维的预测因素而言,悲伤和绝望的感觉是最有影响力的,占影响的57.4%,其次是压力状态为19.8%。其次是年龄(5.7%),家庭收入(4%),学业成绩(3.4%),性别(2.1%),和其他人,各贡献不到2%。
    结论:本研究通过整合来自3个国家的不同数据集来使用ML来解决青少年自杀问题。研究结果强调了情绪健康指标在预测青少年自杀思维中的重要作用。具体来说,悲伤和绝望被认为是最重要的预测因素,其次是压力条件和年龄。这些发现强调了青春期早期诊断和预防心理健康问题的迫切需要。
    BACKGROUND: Suicide is the second-leading cause of death among adolescents and is associated with clusters of suicides. Despite numerous studies on this preventable cause of death, the focus has primarily been on single nations and traditional statistical methods.
    OBJECTIVE: This study aims to develop a predictive model for adolescent suicidal thinking using multinational data sets and machine learning (ML).
    METHODS: We used data from the Korea Youth Risk Behavior Web-based Survey with 566,875 adolescents aged between 13 and 18 years and conducted external validation using the Youth Risk Behavior Survey with 103,874 adolescents and Norway\'s University National General Survey with 19,574 adolescents. Several tree-based ML models were developed, and feature importance and Shapley additive explanations values were analyzed to identify risk factors for adolescent suicidal thinking.
    RESULTS: When trained on the Korea Youth Risk Behavior Web-based Survey data from South Korea with a 95% CI, the XGBoost model reported an area under the receiver operating characteristic (AUROC) curve of 90.06% (95% CI 89.97-90.16), displaying superior performance compared to other models. For external validation using the Youth Risk Behavior Survey data from the United States and the University National General Survey from Norway, the XGBoost model achieved AUROCs of 83.09% and 81.27%, respectively. Across all data sets, XGBoost consistently outperformed the other models with the highest AUROC score, and was selected as the optimal model. In terms of predictors of suicidal thinking, feelings of sadness and despair were the most influential, accounting for 57.4% of the impact, followed by stress status at 19.8%. This was followed by age (5.7%), household income (4%), academic achievement (3.4%), sex (2.1%), and others, which contributed less than 2% each.
    CONCLUSIONS: This study used ML by integrating diverse data sets from 3 countries to address adolescent suicide. The findings highlight the important role of emotional health indicators in predicting suicidal thinking among adolescents. Specifically, sadness and despair were identified as the most significant predictors, followed by stressful conditions and age. These findings emphasize the critical need for early diagnosis and prevention of mental health issues during adolescence.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:开发和比较基于三相对比增强CT(CECT)的机器学习模型,以区分良性和恶性肾脏肿瘤。
    方法:总共,427名患者来自两个医疗中心:中心1(用作训练集)和中心2(用作外部验证集)。首先,从皮质髓质期(CP)中单独提取1781个放射学特征,肾图相位(NP),和排泄期(EP)CECT图像,之后,通过最小冗余最大相关性方法选择10个特征。第二,随机森林(RF)模型由单相特征(CP,NP,和EP)以及来自所有三个阶段(TP)的特征组合。第三,在训练集和外部验证集中评估RF模型.最后,模型的内部预测机制由SHapley加法扩张(SHAP)方法解释。
    结果:共纳入了来自中心1的266例肾脏肿瘤患者和来自中心2的161例患者。在训练集中,从CP构建的RF模型的AUC,NP,EP,TP特征分别为0.886、0.912、0.930和0.944。在外部验证集中,模型的AUC分别为0.860,0.821,0.921和0.908.根据SHAP方法,“original_shape_flatness”特征在基于EP特征的RF模型的预测结果中起着最重要的作用。
    结论:四种RF模型可有效区分良性和恶性实体肾肿瘤,基于EP特征的RF模型显示最佳性能。
    BACKGROUND: To develop and compare machine learning models based on triphasic contrast-enhanced CT (CECT) for distinguishing between benign and malignant renal tumors.
    METHODS: In total, 427 patients were enrolled from two medical centers: Center 1 (serving as the training set) and Center 2 (serving as the external validation set). First, 1781 radiomic features were individually extracted from corticomedullary phase (CP), nephrographic phase (NP), and excretory phase (EP) CECT images, after which 10 features were selected by the minimum redundancy maximum relevance method. Second, random forest (RF) models were constructed from single-phase features (CP, NP, and EP) as well as from the combination of features from all three phases (TP). Third, the RF models were assessed in the training and external validation sets. Finally, the internal prediction mechanisms of the models were explained by the SHapley Additive exPlanations (SHAP) approach.
    RESULTS: A total of 266 patients with renal tumors from Center 1 and 161 patients from Center 2 were included. In the training set, the AUCs of the RF models constructed from the CP, NP, EP, and TP features were 0.886, 0.912, 0.930, and 0.944, respectively. In the external validation set, the models achieved AUCs of 0.860, 0.821, 0.921, and 0.908, respectively. The \"original_shape_Flatness\" feature played the most important role in the prediction outcome for the RF model based on EP features according to the SHAP method.
    CONCLUSIONS: The four RF models efficiently differentiated benign from malignant solid renal tumors, with the EP feature-based RF model displaying the best performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:尽管胃腺癌(GA)相关的眼部转移(OM)很少见,它的发生表明疾病更严重。我们旨在利用机器学习(ML)来分析GA相关OM的风险因素并预测其风险。方法:本研究为回顾性队列研究。收集3532名GA患者的临床数据,并以7:3的比例随机分为训练集和验证集。具有或不具有OM的那些被分类为OM和非OM(NOM)组。进行了单变量和多变量逻辑回归分析以及最小绝对收缩率和选择算子。我们集成了通过特征重要性排名识别的变量,并在将其纳入ML模型之前,使用基于随机森林(RF)算法的正向顺序特征选择进一步完善了选择过程。我们应用了六种ML算法来构建预测GA模型。受试者工作特征(ROC)曲线下的面积表明了模型的预测能力。此外,建立了基于最佳性能模型的网络风险计算器。我们使用Shapley加性解释(SHAP)来识别风险因素并确认黑盒模型的可解释性。我们已经取消了所有患者的详细信息。结果:ML模型,由13个变量组成,使用梯度增强机(GBM)模型实现了最佳预测性能,测试集中曲线下面积(AUC)为0.997。利用SHAP方法,我们确定了GA患者OM的关键因素,包括LDL,CA724,CEA,法新社,CA125,Hb,CA153和Ca2+。此外,我们通过对2例患者病例的分析验证了模型的可靠性,并开发了基于GBM模型的功能性在线网络预测计算器.结论:我们使用ML方法建立了GA相关OM的风险预测模型,并表明GBM在六个ML模型中表现最好。该模型可以识别患有GA相关OM的患者以提供早期和及时的治疗。
    Background: Although gastric adenocarcinoma (GA) related ocular metastasis (OM) is rare, its occurrence indicates a more severe disease. We aimed to utilize machine learning (ML) to analyze the risk factors of GA-related OM and predict its risks. Methods: This is a retrospective cohort study. The clinical data of 3532 GA patients were collected and randomly classified into training and validation sets in a ratio of 7:3. Those with or without OM were classified into OM and non-OM (NOM) groups. Univariate and multivariate logistic regression analyses and least absolute shrinkage and selection operator were conducted. We integrated the variables identified through feature importance ranking and further refined the selection process using forward sequential feature selection based on random forest (RF) algorithm before incorporating them into the ML model. We applied six ML algorithms to construct the predictive GA model. The area under the receiver operating characteristic (ROC) curve indicated the model\'s predictive ability. Also, we established a network risk calculator based on the best performance model. We used Shapley additive interpretation (SHAP) to identify risk factors and to confirm the interpretability of the black box model. We have de-identified all patient details. Results: The ML model, consisting of 13 variables, achieved an optimal predictive performance using the gradient boosting machine (GBM) model, with an impressive area under the curve (AUC) of 0.997 in the test set. Utilizing the SHAP method, we identified crucial factors for OM in GA patients, including LDL, CA724, CEA, AFP, CA125, Hb, CA153, and Ca2+. Additionally, we validated the model\'s reliability through an analysis of two patient cases and developed a functional online web prediction calculator based on the GBM model. Conclusion: We used the ML method to establish a risk prediction model for GA-related OM and showed that GBM performed best among the six ML models. The model may identify patients with GA-related OM to provide early and timely treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    获取并验证可用于AKI监测和管理以改善临床结果的急性肾损伤(AKI)的机器学习(ML)预测模型。
    这项回顾性队列研究在阜外医院进行,包括2017年1月1日至2018年12月31日期间收治的18岁及以上接受心脏手术的患者.随机选择70%的观察结果进行训练,其余30%进行测试。人口统计,合并症,实验室检查参数,和操作细节用于通过逻辑回归和极限梯度增强(Xgboost)构建AKI的预测模型。通过受试者操作特征(AUROC)曲线下的面积在测试队列上评估每个模型的区别性,同时通过校准图进行校准。
    本研究共纳入15880名患者,4845例(30.5%)发生AKI。与逻辑回归相比,Xgboost模型具有更高的判别能力(AUROC,0.849[95%CI,0.837-0.861]vs0.803[95%CI0.790-0.817],P<0.001)在测试数据集中。估计的肾小球滤过率(eGFR)和肌酸在重症监护病房(ICU)的到达是两个最重要的预测参数。SHAP摘要图用于说明归因于Xgboost模型的前15个特征的效果。
    ML模型可以提供临床决策支持,以确定哪些患者应专注于围手术期预防性治疗,以通过预测哪些患者没有风险来先发制人地减少急性肾损伤。
    UNASSIGNED: To derive and validate a machine learning (ML) prediction model of acute kidney injury (AKI) that could be used for AKI surveillance and management to improve clinical outcomes.
    UNASSIGNED: This retrospective cohort study was conducted in Fuwai Hospital, including patients aged 18 years and above undergoing cardiac surgery admitted between January 1, 2017, and December 31, 2018. Seventy percent of the observations were randomly selected for training and the remaining 30% for testing. The demographics, comorbidities, laboratory examination parameters, and operation details were used to construct a prediction model for AKI by logistic regression and eXtreme gradient boosting (Xgboost). The discrimination of each model was assessed on the test cohort by the area under the receiver operator characteristic (AUROC) curve, while calibration was performed by the calibration plot.
    UNASSIGNED: A total of 15,880 patients were enrolled in this study, and 4845 (30.5%) had developed AKI. Xgboost model had the higher discriminative ability compared with logistic regression (AUROC, 0.849 [95% CI, 0.837-0.861] vs 0.803[95% CI 0.790-0.817], P<0.001) in the test dataset. The estimated glomerular filtration (eGFR) and creatine on intensive care unit (ICU) arrival are the two most important prediction parameters. A SHAP summary plot was used to illustrate the effects of the top 15 features attributed to the Xgboost model.
    UNASSIGNED: ML models can provide clinical decision support to determine which patients should focus on perioperative preventive treatment to preemptively reduce acute kidney injury by predicting which patients are not at risk.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    确定肝细胞癌(HCC)患者肝切除术后复发的高风险,有助于及时实施介入治疗。本研究旨在开发一种机器学习(ML)模型来预测肝癌患者肝切除术后的复发风险。
    我们回顾性收集了2013年4月至2017年10月在中山大学附属第三医院接受根治性肝切除术的315例HCC患者,并以7:3的比例随机分为训练集和验证集。根据HCC患者术后复发情况,将患者分为复发组和未复发组,并对两组进行单因素和多因素logistic回归。我们应用了六种机器学习算法来构建预测模型,并通过10倍交叉验证进行了内部验证。Shapley加性解释(SHAP)方法用于解释机器学习模型。我们还建立了一个基于最佳机器学习模型的网络计算器,以个性化评估肝癌患者肝切除术后的复发风险。
    机器学习模型中包含了总共13个变量。多层感知器(MLP)机器学习模型在测试集(AUC=0.680)中具有最佳预测值。SHAP方法显示γ-谷氨酰转肽酶(GGT),纤维蛋白原,中性粒细胞,谷草转氨酶(AST)和总胆红素(TB)是肝癌患者肝切除术后复发风险的前5个重要因素。此外,我们通过分析两名患者进一步证明了模型的可靠性.最后,我们成功构建了基于MLP机器学习模型的在线网络预测计算器。
    MLP是预测肝癌患者肝切除术后复发风险的最佳机器学习模型。该预测模型可以帮助识别肝切除术后高复发风险的HCC患者,以提供早期和个性化的治疗。
    UNASSIGNED: Identifying patients with hepatocellular carcinoma (HCC) at high risk of recurrence after hepatectomy can help to implement timely interventional treatment. This study aimed to develop a machine learning (ML) model to predict the recurrence risk of HCC patients after hepatectomy.
    UNASSIGNED: We retrospectively collected 315 HCC patients who underwent radical hepatectomy at the Third Affiliated Hospital of Sun Yat-sen University from April 2013 to October 2017, and randomly divided them into the training and validation sets at a ratio of 7:3. According to the postoperative recurrence of HCC patients, the patients were divided into recurrence group and non-recurrence group, and univariate and multivariate logistic regression were performed for the two groups. We applied six machine learning algorithms to construct the prediction models and performed internal validation by 10-fold cross-validation. Shapley additive explanations (SHAP) method was applied to interpret the machine learning model. We also built a web calculator based on the best machine learning model to personalize the assessment of the recurrence risk of HCC patients after hepatectomy.
    UNASSIGNED: A total of 13 variables were included in the machine learning models. The multilayer perceptron (MLP) machine learning model was proved to achieve optimal predictive value in test set (AUC = 0.680). The SHAP method displayed that γ-glutamyl transpeptidase (GGT), fibrinogen, neutrophil, aspartate aminotransferase (AST) and total bilirubin (TB) were the top 5 important factors for recurrence risk of HCC patients after hepatectomy. In addition, we further demonstrated the reliability of the model by analyzing two patients. Finally, we successfully constructed an online web prediction calculator based on the MLP machine learning model.
    UNASSIGNED: MLP was an optimal machine learning model for predicting the recurrence risk of HCC patients after hepatectomy. This predictive model can help identify HCC patients at high recurrence risk after hepatectomy to provide early and personalized treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:机器学习(ML)方法在预测结直肠癌(CRC)生存率方面显示出巨大的潜力。然而,到目前为止,引入的ML模型主要集中在二元结果上,并没有考虑这种类型的建模的时间到事件的性质。
    目的:本研究旨在评估ML方法对时间至事件生存数据建模的性能,并开发用于预测CRC特异性生存的透明模型。
    方法:本回顾性队列研究中使用的数据集包含2012年12月28日至2019年12月27日在华西医院新诊断为CRC的患者的信息。四川大学。我们评估了6个代表性ML模型的性能,包括随机生存森林(RSF),梯度增压机(GBM),DeepSurv,DeepHit,神经网络扩展的时间相关Cox(或Cox-Time),和神经多任务逻辑回归(N-MTLR)预测CRC特异性生存率。采用链式方程的多重插补方法来处理变量中的缺失值。多变量分析和临床经验用于选择与CRC生存相关的重要特征。使用时间依赖性一致性指数在重复5次的分层5倍交叉验证中评估模型性能,综合Brier评分,校正曲线,和决策曲线。采用Shapley加法扩展方法计算特征重要性。
    结果:本研究共纳入2157例CRC患者。在6种时间到事件ML模型中,DeepHit模型表现出最佳的辨别能力(时间依赖性一致性指数0.789,95%CI0.779-0.799),RSF模型产生了更好的校准生存估计(综合Brier评分0.096,95%CI0.094-0.099),但这些并不具有统计学意义。此外,RSF,GBM,DeepSurv,考克斯时间,和N-MTLR模型在辨别和校准方面具有与Cox比例风险模型相当的预测准确性。校准曲线显示所有ML模型均表现出良好的5年生存校准。5年CRC特异性生存的决定曲线显示,所有ML模型,尤其是RSF,在临床合理的风险阈值范围内治疗所有患者或不治疗患者的默认策略相比具有更高的净获益.沙普利加性移植方法显示,R0切除,肿瘤淋巴结转移分期,阳性淋巴结数量是影响5年CRC特异性生存率的重要因素。
    结论:本研究显示了应用时间至事件ML预测算法来帮助预测CRC特异性生存率的潜力。RSF,GBM,考克斯时间,和N-MTLR算法可以为Cox比例风险模型提供非参数替代方法来估计CRC患者的生存概率.透明的时间到事件ML模型帮助临床医生更准确地预测这些患者的生存率,并通过启用由可解释的ML模型提供信息的个性化治疗计划来改善患者预后。
    Machine learning (ML) methods have shown great potential in predicting colorectal cancer (CRC) survival. However, the ML models introduced thus far have mainly focused on binary outcomes and have not considered the time-to-event nature of this type of modeling.
    This study aims to evaluate the performance of ML approaches for modeling time-to-event survival data and develop transparent models for predicting CRC-specific survival.
    The data set used in this retrospective cohort study contains information on patients who were newly diagnosed with CRC between December 28, 2012, and December 27, 2019, at West China Hospital, Sichuan University. We assessed the performance of 6 representative ML models, including random survival forest (RSF), gradient boosting machine (GBM), DeepSurv, DeepHit, neural net-extended time-dependent Cox (or Cox-Time), and neural multitask logistic regression (N-MTLR) in predicting CRC-specific survival. Multiple imputation by chained equations method was applied to handle missing values in variables. Multivariable analysis and clinical experience were used to select significant features associated with CRC survival. Model performance was evaluated in stratified 5-fold cross-validation repeated 5 times by using the time-dependent concordance index, integrated Brier score, calibration curves, and decision curves. The SHapley Additive exPlanations method was applied to calculate feature importance.
    A total of 2157 patients with CRC were included in this study. Among the 6 time-to-event ML models, the DeepHit model exhibited the best discriminative ability (time-dependent concordance index 0.789, 95% CI 0.779-0.799) and the RSF model produced better-calibrated survival estimates (integrated Brier score 0.096, 95% CI 0.094-0.099), but these are not statistically significant. Additionally, the RSF, GBM, DeepSurv, Cox-Time, and N-MTLR models have comparable predictive accuracy to the Cox Proportional Hazards model in terms of discrimination and calibration. The calibration curves showed that all the ML models exhibited good 5-year survival calibration. The decision curves for CRC-specific survival at 5 years showed that all the ML models, especially RSF, had higher net benefits than default strategies of treating all or no patients at a range of clinically reasonable risk thresholds. The SHapley Additive exPlanations method revealed that R0 resection, tumor-node-metastasis staging, and the number of positive lymph nodes were important factors for 5-year CRC-specific survival.
    This study showed the potential of applying time-to-event ML predictive algorithms to help predict CRC-specific survival. The RSF, GBM, Cox-Time, and N-MTLR algorithms could provide nonparametric alternatives to the Cox Proportional Hazards model in estimating the survival probability of patients with CRC. The transparent time-to-event ML models help clinicians to more accurately predict the survival rate for these patients and improve patient outcomes by enabling personalized treatment plans that are informed by explainable ML models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:眼部转移(OM)是原发性肝癌(PLC)的罕见转移部位。目的建立基于机器学习(ML)的PLC患者OM临床预测模型。
    方法:我们回顾性收集了1540名PLC患者的临床数据,并将其按7:3的比例分为训练集和内部测试集。将PLC患者分为OM组和非眼转移(NOM)组,两组间进行单因素logistic回归分析。对于ML模型选择具有单变量逻辑分析p<0.05的变量。我们构建了六个机器学习模型,通过10倍交叉验证进行了内部验证。通过受试者工作特征曲线(ROC)评估每个ML模型的预测性能。我们还基于最佳性能ML模型构建了一个网络计算器,以个性化OM的风险概率。
    结果:为ML模型选择了六个变量。极端梯度增强(XGB)ML模型实现了最优鉴别诊断能力,曲线下面积(AUC)=0.993,准确性=0.992,敏感性=0.998,特异性=0.984。基于这些结果,使用XGBML模型构建了一个在线网络计算器,以帮助临床医生诊断和治疗PLC患者OM的风险概率.最后,Shapley加性解释(SHAP)库用于获得PLC患者OM的六个最重要的危险因素:CA125,ALP,法新社,TG,CA199和CEA。
    结论:我们使用XGB模型建立了PLC患者OM的风险预测模型。预测模型可以帮助识别具有高OM风险的PLC患者,提供早期和个性化的诊断和治疗,降低OM患者的不良预后,提高PLC患者的生活质量。
    Ocular metastasis (OM) is a rare metastatic site of primary liver cancer (PLC). The purpose of this study was to establish a clinical predictive model of OM in PLC patients based on machine learning (ML).
    We retrospectively collected the clinical data of 1540 PLC patients and divided it into a training set and an internal test set in a 7:3 proportion. PLC patients were divided into OM and non-ocular metastasis (NOM) groups, and univariate logistic regression analysis was performed between the two groups. The variables with univariate logistic analysis p < 0.05 were selected for the ML model. We constructed six ML models, which were internally verified by 10-fold cross-validation. The prediction performance of each ML model was evaluated by receiver operating characteristic curves (ROCs). We also constructed a web calculator based on the optimal performance ML model to personalize the risk probability for OM.
    Six variables were selected for the ML model. The extreme gradient boost (XGB) ML model achieved the optimal differential diagnosis ability, with an area under the curve (AUC) = 0.993, accuracy = 0.992, sensitivity = 0.998, and specificity = 0.984. Based on these results, an online web calculator was constructed by using the XGB ML model to help clinicians diagnose and treat the risk probability of OM in PLC patients. Finally, the Shapley additive explanations (SHAP) library was used to obtain the six most important risk factors for OM in PLC patients: CA125, ALP, AFP, TG, CA199, and CEA.
    We used the XGB model to establish a risk prediction model of OM in PLC patients. The predictive model can help identify PLC patients with a high risk of OM, provide early and personalized diagnosis and treatment, reduce the poor prognosis of OM patients, and improve the quality of life of PLC patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号