SHAP

SHAP
  • 文章类型: Journal Article
    肺癌是影响人类健康最危险的恶性肿瘤之一。肺腺癌(LUAD)是肺癌最常见的亚型。糖酵解和胆固醇生成途径在癌症的代谢适应中起关键作用。从癌症基因组图谱数据库下载585个LUAD样品的数据集。通过从分子特征数据库v7.5中选择和聚类基因,我们获得了共表达的糖酵解和胆固醇生成基因。我们比较了不同亚型的预后,并确定了亚型之间的差异表达基因。预测结果事件使用机器学习建模,并通过Shapley加性解释分析选择了前9个最重要的预后基因。建立基于多变量Cox分析的风险评分模型。LUAD患者分为四个代谢亚组:糖酵解,静止,和混合。预后最差的是混合亚型。预后模型在测试集中具有很好的预测性能。LUAD患者通过糖酵解和胆固醇生成基因有效分型,并在糖酵解和胆固醇生成富集基因组中被确定为预后最差。该预后模型可以为临床医生预测患者的临床结局提供必要的依据。该模型在训练和测试数据集上具有鲁棒性,并具有良好的预测性能。
    Lung cancer is one of the most dangerous malignant tumors affecting human health. Lung adenocarcinoma (LUAD) is the most common subtype of lung cancer. Both glycolytic and cholesterogenic pathways play critical roles in metabolic adaptation to cancer. A dataset of 585 LUAD samples was downloaded from The Cancer Genome Atlas database. We obtained co-expressed glycolysis and cholesterogenesis genes by selecting and clustering genes from Molecular Signatures Database v7.5. We compared the prognosis of different subtypes and identified differentially expressed genes between subtypes. Predictive outcome events were modeled using machine learning, and the top 9 most important prognostic genes were selected by Shapley additive explanation analysis. A risk score model was built based on multivariate Cox analysis. LUAD patients were categorized into four metabolic subgroups: cholesterogenic, glycolytic, quiescent, and mixed. The worst prognosis was the mixed subtype. The prognostic model had great predictive performance in the test set. Patients with LUAD were effectively typed by glycolytic and cholesterogenic genes and were identified as having the worst prognosis in the glycolytic and cholesterogenic enriched gene groups. The prognostic model can provide an essential basis for clinicians to predict clinical outcomes for patients. The model was robust on the training and test datasets and had a great predictive performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Published Erratum
    [这更正了文章DOI:10.3389/fmed.2024.1285067。].
    [This corrects the article DOI: 10.3389/fmed.2024.1285067.].
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项研究的目的是探索机器学习是否可以用于通过使用从小脑灰质和白质中提取的纹理特征来建立诊断帕金森病(PD)的有效模型,从而识别肉眼无法观察到的细微变化。
    这项研究涉及2010年6月至2023年3月的数据收集期,其中包括来自两个队列的374名受试者。帕金森进展标志物倡议(PPMI)作为训练集,来自24个全球站点的对照组和PD患者(HC:102和PD:102)。我们机构的数据被用作测试集(HC:91和PD:79)。利用机器学习建立基于小脑灰质和白质纹理特征的多模型进行PD诊断。结果通过5倍交叉验证分析进行了评估,计算每个模型的受试者工作特征曲线下面积(AUC)。使用Delong测试比较了每个模型的性能,通过采用Shapley加法解释(SHAP)进一步增强了优化模型的可解释性。
    使用FeAtureExplorer(FAE)软件比较验证数据集中所有管道的AUC。在Kruskal-Wallis(KW)和Lasso(LRLasso)逻辑回归建立的模型中,使用“一标准误差”规则,AUC最高.\'WM_original_glrlm_GrayLevelNonUniformity\'被认为是最稳定和预测功能。
    小脑灰质和白质的纹理特征结合机器学习可能对帕金森病的诊断具有潜在价值,其中白质的异质性可能是更有价值的成像标记。
    UNASSIGNED: The purpose of this study is to explore whether machine learning can be used to establish an effective model for the diagnosis of Parkinson\'s disease (PD) by using texture features extracted from cerebellar gray matter and white matter, so as to identify subtle changes that cannot be observed by the naked eye.
    UNASSIGNED: This study involved a data collection period from June 2010 to March 2023, including 374 subjects from two cohorts. The Parkinson\'s Progression Markers Initiative (PPMI) served as the training set, with control group and PD patients (HC: 102 and PD: 102) from 24 global sites. Our institution\'s data was utilized as the test set (HC: 91 and PD: 79). Machine learning was employed to establish multiple models for PD diagnosis based on texture features of the cerebellum\'s gray and white matter. Results underwent evaluation through 5-fold cross-validation analysis, calculating the area under the receiver operating characteristic curve (AUC) for each model. The performance of each model was compared using the Delong test, and the interpretability of the optimized model was further augmented by employing Shapley additive explanations (SHAP).
    UNASSIGNED: The AUCs for all pipelines in the validation dataset were compared using FeAture Explorer (FAE) software. Among the models established by Kruskal-Wallis (KW) and logistic regression via Lasso (LRLasso), the AUC was highest using the \"one-standard error\" rule. \'WM_original_glrlm_GrayLevelNonUniformity\' was considered the most stable and predictive feature.
    UNASSIGNED: The texture features of cerebellar gray matter and white matter combined with machine learning may have potential value in the diagnosis of Parkinson\'s disease, in which the heterogeneity of white matter may be a more valuable imaging marker.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    三元聚合物太阳能电池(PSC)是目前进一步提高PSC器件性能的最简单和最有效的方法。寻找高性能有机光伏材料,材料结构与器件性能之间的联系在制造前具有重要意义。在这里,首先,建立了文献中报道的874个实验PSC的光伏性能数据库,并探索了分子结构的三种不同指纹表达作为输入特征;结果表明,二维原子对的长指纹可以包含更有效的信息,并提高模型的准确性。通过监督学习,训练了五个机器学习(ML)模型,以构建从二元到三元PSC的光伏性能改善关系的映射。GBDT模型具有最好的预测能力和泛化性。基于该模型筛选了来自非富勒烯受体的18个关键结构特征和影响器件PCE的第三组分,包括带有孤对电子的腈基,卤素原子,一个氧原子,等。有趣的是,JSC或FF基本上增加了增强型设备PCE的结构特征。更重要的是,通过制备高效PSC,进一步验证了ML模型的可靠性。以PM6:BTP-eC9:PY-IT三元PSC为例,该模型的PCE预测(18.03%)与实验结果(17.78%)吻合良好,相对预测误差为1.41%,所有实验结果与预测结果的相对误差均小于5%。这些结果表明,ML是探索PSC光伏性能改善以及加速高效非富勒烯材料设计和应用的有用工具。
    Ternary polymer solar cells (PSCs) are currently the simplest and most efficient way to further improve the device performance in PSCs. To find high-performance organic photovoltaic materials, the established connection between the material structure and device performance before fabrication is of great significance. Herein, firstly, a database of the photovoltaic performance in 874 experimental PSCs reported in the literature is established, and three different fingerprint expressions of a molecular structure are explored as input features; the results show that long fingerprints of 2D atom pairs can contain more effective information and improve the accuracy of the models. Through supervised learning, five machine learning (ML) models were trained to build a mapping of the photovoltaic performance improvement relationship from binary to ternary PSCs. The GBDT model had the best predictive ability and generalization. Eighteen key structural features from a non-fullerene acceptor and the third components that affect the device\'s PCE were screened based on this model, including a nitrile group with lone-pair electron, a halogen atom, an oxygen atom, etc. Interestingly, the structural features for the enhanced device\'s PCE were essentially increased by the Jsc or FF. More importantly, the reliability of the ML model was further verified by preparing the highly efficient PSCs. Taking the PM6:BTP-eC9:PY-IT ternary PSC as an example, the PCE prediction (18.03%) by the model was in good agreement with the experimental results (17.78%), the relative prediction error was 1.41%, and the relative error between all experimental results and predicted results was less than 5%. These results indicate that ML is a useful tool for exploring the photovoltaic performance improvement of PSCs and accelerating the design and application with highly efficient non-fullerene materials.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:由于多重耐药生物体(MDROs)引起的医疗保健相关感染,如耐甲氧西林金黄色葡萄球菌(MRSA)和艰难梭菌(CDI),给我们的医疗基础设施带来沉重负担。
    目的:MDROs的筛查是防止传播的重要机制,但却是资源密集型的。这项研究的目的是开发可以使用电子健康记录(EHR)数据预测定植或感染风险的自动化工具,提供有用的信息来帮助感染控制,并指导经验性抗生素覆盖。
    方法:我们回顾性地开发了一个机器学习模型来检测在弗吉尼亚大学医院住院患者样本采集时未分化患者的MRSA定植和感染。我们使用来自患者EHR数据的入院和住院期间信息的临床和非临床特征来构建模型。此外,我们在EHR数据中使用了一类从联系网络派生的特征;这些网络特征可以捕获患者与提供者和其他患者的联系,提高预测MRSA监测试验结果的模型可解释性和准确性。最后,我们探索了不同患者亚群的异质模型,例如,入住重症监护病房或急诊科的人或有特定检测史的人,哪个表现更好。
    结果:我们发现惩罚逻辑回归比其他方法表现更好,当我们使用多项式(二次)变换特征时,该模型的性能根据其接收器操作特征-曲线下面积得分提高了近11%。预测MDRO风险的一些重要特征包括抗生素使用,手术,使用设备,透析,患者的合并症状况,和网络特征。其中,网络功能增加了最大的价值,并将模型的性能提高了至少15%。对于特定患者亚群,具有相同特征转换的惩罚逻辑回归模型也比其他模型表现更好。
    结论:我们的研究表明,使用来自EHR数据的临床和非临床特征,通过机器学习方法可以非常有效地进行MRSA风险预测。网络特征是最具预测性的,并且提供优于现有方法的显著改进。此外,不同患者亚群的异质预测模型提高了模型的性能。
    BACKGROUND: Health care-associated infections due to multidrug-resistant organisms (MDROs), such as methicillin-resistant Staphylococcus aureus (MRSA) and Clostridioides difficile (CDI), place a significant burden on our health care infrastructure.
    OBJECTIVE: Screening for MDROs is an important mechanism for preventing spread but is resource intensive. The objective of this study was to develop automated tools that can predict colonization or infection risk using electronic health record (EHR) data, provide useful information to aid infection control, and guide empiric antibiotic coverage.
    METHODS: We retrospectively developed a machine learning model to detect MRSA colonization and infection in undifferentiated patients at the time of sample collection from hospitalized patients at the University of Virginia Hospital. We used clinical and nonclinical features derived from on-admission and throughout-stay information from the patient\'s EHR data to build the model. In addition, we used a class of features derived from contact networks in EHR data; these network features can capture patients\' contacts with providers and other patients, improving model interpretability and accuracy for predicting the outcome of surveillance tests for MRSA. Finally, we explored heterogeneous models for different patient subpopulations, for example, those admitted to an intensive care unit or emergency department or those with specific testing histories, which perform better.
    RESULTS: We found that the penalized logistic regression performs better than other methods, and this model\'s performance measured in terms of its receiver operating characteristics-area under the curve score improves by nearly 11% when we use polynomial (second-degree) transformation of the features. Some significant features in predicting MDRO risk include antibiotic use, surgery, use of devices, dialysis, patient\'s comorbidity conditions, and network features. Among these, network features add the most value and improve the model\'s performance by at least 15%. The penalized logistic regression model with the same transformation of features also performs better than other models for specific patient subpopulations.
    CONCLUSIONS: Our study shows that MRSA risk prediction can be conducted quite effectively by machine learning methods using clinical and nonclinical features derived from EHR data. Network features are the most predictive and provide significant improvement over prior methods. Furthermore, heterogeneous prediction models for different patient subpopulations enhance the model\'s performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:慢性应激在德国人群中非常普遍。已知它对心理健康有不良影响,如倦怠和抑郁。慢性压力的已知长期影响是心血管疾病,糖尿病,和癌症。
    目的:本研究旨在基于德国成人健康访谈和检查调查的全国代表性数据,得出一个可解释的多类机器学习模型,用于预测慢性压力水平和预防慢性压力的因素。这是国家健康监测计划的一部分。
    方法:来自德国成人健康访谈和检查调查研究的数据集,包括人口统计学,临床,分析了5801名参与者的实验室数据.构建了一个多类极限梯度提升(XGBoost)模型,将参与者分为3类,包括低,中间,和高慢性压力水平。使用接收器工作特性曲线下的面积评估模型的性能,精度,召回,特异性,和F1得分。此外,使用Shapley加法扩张来解释预测XGBoost模型并确定保护免受慢性压力的因素。
    结果:多类XGBoost模型显示了宏观平均分数,接收器工作特性曲线下面积为81%,精度为63%,召回52%,特异性为78%,F1得分为54%。低水平慢性压力的最重要特征是男性,良好的整体健康,对生活空间的高度满意,强大的社会支持。
    结论:本研究为德国成年人的慢性应激提供了一个多类可解释的预测模型。可解释的人工智能技术Shapley加法扩张确定了慢性压力的相关保护因素,在制定减少慢性压力的干预措施时需要考虑这一点。
    BACKGROUND: Chronic stress is highly prevalent in the German population. It has known adverse effects on mental health, such as burnout and depression. Known long-term effects of chronic stress are cardiovascular disease, diabetes, and cancer.
    OBJECTIVE: This study aims to derive an interpretable multiclass machine learning model for predicting chronic stress levels and factors protecting against chronic stress based on representative nationwide data from the German Health Interview and Examination Survey for Adults, which is part of the national health monitoring program.
    METHODS: A data set from the German Health Interview and Examination Survey for Adults study including demographic, clinical, and laboratory data from 5801 participants was analyzed. A multiclass eXtreme Gradient Boosting (XGBoost) model was constructed to classify participants into 3 categories including low, middle, and high chronic stress levels. The model\'s performance was evaluated using the area under the receiver operating characteristic curve, precision, recall, specificity, and the F1-score. Additionally, SHapley Additive exPlanations was used to interpret the prediction XGBoost model and to identify factors protecting against chronic stress.
    RESULTS: The multiclass XGBoost model exhibited the macroaverage scores, with an area under the receiver operating characteristic curve of 81%, precision of 63%, recall of 52%, specificity of 78%, and F1-score of 54%. The most important features for low-level chronic stress were male gender, very good general health, high satisfaction with living space, and strong social support.
    CONCLUSIONS: This study presents a multiclass interpretable prediction model for chronic stress in adults in Germany. The explainable artificial intelligence technique SHapley Additive exPlanations identified relevant protective factors for chronic stress, which need to be considered when developing interventions to reduce chronic stress.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在城市地区使用除冰器,在跑道和飞机上引起了人们对其环境影响的担忧。了解融冰机制对于开发环保除冰剂至关重要,但它仍然具有挑战性。这项研究采用机器学习来研究21盐和16有机溶剂作为除冰剂的冰渗透能力(IPC)。使用极端梯度增强(XGBoost)和Shapley添加剂解释(SHAP)分析了其IPC与各种物理性质之间的关系。确定了三个关键的融冰机制:(1)冰点降低,(2)除冰剂与H2O分子之间的相互作用;(3)离子渗入冰晶。SHAP分析揭示了盐和有机溶剂的不同融冰因素和机理,表明两者结合的潜在优势。丙二醇(PG)和甲酸钠的混合物表现出优异的环境影响和IPC。与六种市售除冰剂相比,PG和甲酸钠混合物表现出更高的IPC,为可持续除冰应用提供承诺。这项研究为融冰过程提供了有价值的见解,并提出了一种有效的,结合了有机溶剂和盐的优点的环保除冰器,为更可持续的除冰实践铺平道路。
    The use of deicers in urban areas, on runways and aircrafts has raised concerns about their environmental impact. Understanding the ice-melting mechanism is crucial for developing environmentally friendly deicers, yet it remains challenging. This study employs machine learning to investigate the ice penetration capacity (IPC) of 21 salts and 16 organic solvents as deicers. Relationships between their IPC and various physical properties were analysed using extreme gradient boosting (XGBoost) and Shapley additive explanation (SHAP). Three key ice-melting mechanisms were identified: (1) freezing-point depression, (2) interactions between deicers and H2O molecules and (3) infiltration of ions into ice crystals. SHAP analysis revealed different ice-melting factors and mechanisms for salts and organic solvents, suggesting a potential advantage in combining the two. A mixture of propylene glycol (PG) and sodium formate demonstrated superior environmental impact and IPC. The PG and sodium formate mixture exhibited higher IPC when compared to six commercially available deicers, offering promise for sustainable deicing applications. This study provides valuable insights into the ice-melting process and proposes an effective, environmentally friendly deicer that combines the strengths of organic solvents and salts, paving the way for more sustainable practices in deicing.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:COVID-19病例死亡率(CFRs)存在显著的地理不平等,从全球角度全面了解其国家一级的决定因素是必要的。本研究旨在量化COVID-19CFR的特定国家风险,并提出量身定制的应对策略,包括疫苗接种策略,在156个国家。
    方法:自1月28日起,使用极端梯度增强(XGBoost)确定了COVID-19CFR的跨时间和跨国家变化,包括来自156个国家七个维度的35个因子,2020年1月31日,2022年。使用SHapley加法扩张(SHAP)进一步阐明了驱动CFR的关键因素和每个国家并发风险因素的影响。模拟了疫苗接种率的增加,以说明不同类别国家的CFR降低。
    结果:从2020年1月28日至2022年1月31日,COVID-19总体CFR在各国之间有所不同,范围为每100,000人口68至6373。在COVID-19大流行期间,CFR的决定因素首先从健康状况转变为全民健康覆盖,然后以疫苗接种为主的多因素混合效应。在奥米米周期,根据风险决定因素将国家分为五类。低疫苗接种驱动类(70个国家)主要分布在撒哈拉以南非洲和拉丁美洲,包括大多数低收入国家(95.7%),这些国家有许多并发风险因素。老龄化驱动类(26个国家)主要分布在欧洲高收入国家。高疾病负担类(32个国家)主要分布在亚洲和北美。低GDP驱动的阶层(14个国家)分散在各大洲。模拟疫苗接种率增加5%,导致低疫苗接种驱动类和高疾病负担驱动类的CFR降低31.2%和15.0%,分别,总体风险高的国家的CFR降低幅度更大(SHAP值>0.1),但老龄化驱动的阶层只有3.1%。
    结论:这项研究的证据表明,COVID-19CFR的地理不平等是由关键和并发风险共同决定的,实现降低COVID-19CFR需要的不仅仅是增加疫苗接种覆盖率,而是基于特定国家风险的有针对性的干预策略。
    BACKGROUND: There are significant geographic inequities in COVID-19 case fatality rates (CFRs), and comprehensive understanding its country-level determinants in a global perspective is necessary. This study aims to quantify the country-specific risk of COVID-19 CFR and propose tailored response strategies, including vaccination strategies, in 156 countries.
    METHODS: Cross-temporal and cross-country variations in COVID-19 CFR was identified using extreme gradient boosting (XGBoost) including 35 factors from seven dimensions in 156 countries from 28 January, 2020 to 31 January, 2022. SHapley Additive exPlanations (SHAP) was used to further clarify the clustering of countries by the key factors driving CFR and the effect of concurrent risk factors for each country. Increases in vaccination rates was simulated to illustrate the reduction of CFR in different classes of countries.
    RESULTS: Overall COVID-19 CFRs varied across countries from 28 Jan 2020 to 31 Jan 31 2022, ranging from 68 to 6373 per 100,000 population. During the COVID-19 pandemic, the determinants of CFRs first changed from health conditions to universal health coverage, and then to a multifactorial mixed effect dominated by vaccination. In the Omicron period, countries were divided into five classes according to risk determinants. Low vaccination-driven class (70 countries) mainly distributed in sub-Saharan Africa and Latin America, and include the majority of low-income countries (95.7%) with many concurrent risk factors. Aging-driven class (26 countries) mainly distributed in high-income European countries. High disease burden-driven class (32 countries) mainly distributed in Asia and North America. Low GDP-driven class (14 countries) are scattered across continents. Simulating a 5% increase in vaccination rate resulted in CFR reductions of 31.2% and 15.0% for the low vaccination-driven class and the high disease burden-driven class, respectively, with greater CFR reductions for countries with high overall risk (SHAP value > 0.1), but only 3.1% for the ageing-driven class.
    CONCLUSIONS: Evidence from this study suggests that geographic inequities in COVID-19 CFR is jointly determined by key and concurrent risks, and achieving a decreasing COVID-19 CFR requires more than increasing vaccination coverage, but rather targeted intervention strategies based on country-specific risks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:在肌少症筛查的实际应用中,需要更快,节省时间,和社区友好的检测方法。这项研究的主要目的是在社区居住的老年人中进行肌肉减少症筛查,并研究是否可以使用机器学习(ML)方法从sEMG信号中提取合理的特征来检测肌肉减少症。次要目的是使用新颖的特征重要性估计方法提供所获得的ML模型的可解释性。
    方法:共招募158名社区老年居民(≥60岁)。经过2019年亚洲工作组(AWGS2019)的诊断标准和数据质量检查,参与者被分为健康组(n=45)和节肌组(n=48).在手握力任务期间,以20%的最大自愿收缩(MVC)和50%的MVC记录了来自六个前臂肌肉的sEMG信号。过滤记录的信号后,提取了九个代表性特征,包括六个时域特征和三个时频域特征。然后,由支持向量机(SVM)集成的投票分类器,随机森林(RF),并实施梯度增强机(GBM)对健康参与者和减少肌肉的参与者进行分类.最后,Shapley加性移植(SHAP)方法用于研究分类过程中的特征重要性。
    结果:在20%和50%的MVC测试中,9个特征中有7个在健康和少肌症参与者之间表现出统计学上的显着差异。使用这些功能,通过5倍交叉验证,投票分类器的灵敏度达到80%,准确率达到73%.这样的性能优于每个SVM,射频,和GBM模型单独。最后,SHAP结果表明,波长(WL)和连续小波变换系数的峰度(CWT_kurtosis)具有最高的特征影响得分。
    结论:这项研究提出了一种使用前臂肌肉的sEMG信号进行基于社区的肌肉减少症筛查的方法。使用具有九个代表性特征的投票分类器,准确度超过70%,灵敏度超过75%,表示中等分类性能。从SHAP模型获得的可解释结果表明,运动单位(MU)激活模式可能是影响肌肉减少症的关键因素。
    BACKGROUND: In the practical application of sarcopenia screening, there is a need for faster, time-saving, and community-friendly detection methods. The primary purpose of this study was to perform sarcopenia screening in community-dwelling older adults and investigate whether surface electromyogram (sEMG) from hand grip could potentially be used to detect sarcopenia using machine learning (ML) methods with reasonable features extracted from sEMG signals. The secondary aim was to provide the interpretability of the obtained ML models using a novel feature importance estimation method.
    METHODS: A total of 158 community-dwelling older residents (≥ 60 years old) were recruited. After screening through the diagnostic criteria of the Asian Working Group for Sarcopenia in 2019 (AWGS 2019) and data quality check, participants were assigned to the healthy group (n = 45) and the sarcopenic group (n = 48). sEMG signals from six forearm muscles were recorded during the hand grip task at 20% maximal voluntary contraction (MVC) and 50% MVC. After filtering recorded signals, nine representative features were extracted, including six time-domain features plus three time-frequency domain features. Then, a voting classifier ensembled by a support vector machine (SVM), a random forest (RF), and a gradient boosting machine (GBM) was implemented to classify healthy versus sarcopenic participants. Finally, the SHapley Additive exPlanations (SHAP) method was utilized to investigate feature importance during classification.
    RESULTS: Seven out of the nine features exhibited statistically significant differences between healthy and sarcopenic participants in both 20% and 50% MVC tests. Using these features, the voting classifier achieved 80% sensitivity and 73% accuracy through a five-fold cross-validation. Such performance was better than each of the SVM, RF, and GBM models alone. Lastly, SHAP results revealed that the wavelength (WL) and the kurtosis of continuous wavelet transform coefficients (CWT_kurtosis) had the highest feature impact scores.
    CONCLUSIONS: This study proposed a method for community-based sarcopenia screening using sEMG signals of forearm muscles. Using a voting classifier with nine representative features, the accuracy exceeds 70% and the sensitivity exceeds 75%, indicating moderate classification performance. Interpretable results obtained from the SHAP model suggest that motor unit (MU) activation mode may be a key factor affecting sarcopenia.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    评估老年高血压卒中患者的健康相关生活质量(HRQoL)状况,为了了解影响它的因素,并为制定卫生干预政策提供依据。
    这项研究使用EQ-5D-3L量表评估了与高血压相关的中风的老年患者的HRQoL。采用各种分析方法来检查影响患者生活质量的因素。单变量分析,Tobit回归,随机森林,应用XGBoost模型分析患者的HRQoL。此外,来解释机器学习的结果,使用SHAP方法。这种方法涉及评估每个特征的重要性,检查每个特征对单个样本预测结果的影响,并确定个体特征对整体预测的影响。
    研究发现,老年高血压卒中患者的健康效用值中位数为0.427,四分位数间范围为0.186至0.745。Tobit回归模型的结果,随机森林,与XGBoost模型进行了比较。模型评估结果表明,机器学习模型和Tobit回归模型的性能差异不大。XGBoost模型的性能相对于随机森林模型略好。强烈影响患者健康效用价值的因素包括BMI,社会活动,吸烟,教育水平,酒精消费,城市/农村住宅,年收入,身体活动水平,晚上的睡眠时间。
    高血压卒中患者与健康相关的生活质量受多种因素的影响。通过修改这些因素并调整生活方式,可以对健康相关的生活质量产生积极影响。保持健康的体重,社会活跃,戒烟,改善生活条件,建议增加体力活动水平和获得足够的睡眠。需要根据具体情况和医疗建议为每个人制定生活方式的改变。
    UNASSIGNED: To evaluate the health-related quality of life(HRQoL)status of elderly patients with hypertensive stroke, to understand the factors influencing it, and to provide a basis for the development of health intervention policies.
    UNASSIGNED: This study used the EQ-5D-3L scale to assess the HRQoL among elderly patients who experienced a stroke related to high blood pressure. Various analytical methods were employed to examine the factors that influenced the patient\'s quality of life. Univariate analysis, Tobit regression, random forest, and XGBoost models were applied to analyze the HRQoL of the patients. Furthermore, to interpret the machine learning results, the SHAP method was utilized. This method involved assessing the importance of each feature, examining the effect of each feature on the prediction result of a single sample, and determining the impact of individual features on the overall prediction.
    UNASSIGNED: The study found that the median health utility value for elderly patients with hypertensive stroke was 0.427, with an interquartile range of 0.186 to 0.745. The results of the Tobit regression model, Random Forest, and XGBoost model were compared. The results of the model evaluation show that the performance of the machine learning model and the Tobit regression model are not very different. The XGBoost model performs slightly better relative to the random forest model. The factors that strongly influenced the health utility value of patients included BMI, social activities, smoking, education level, alcohol consumption, urban/rural residence, annual income, physical activity level, and hours of sleep at night.
    UNASSIGNED: Health-related quality of life in hypertensive stroke patients is influenced by a variety of factors. Health-related quality of life can be positively influenced by modifying these factors and making lifestyle adjustments. Maintaining a healthy weight, being socially active, quitting smoking, improving living conditions, increasing physical activity levels and getting enough sleep are recommended. Lifestyle changes need to be developed for each individual on a case-by-case basis and by medical advice.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号