Shapley Additive Explanations

Shapley 添加剂 explanations
  • 文章类型: Journal Article
    胰腺癌是消化道恶性肿瘤之一,死亡率高。尽管有广泛的可用治疗方法和手术改进,化疗,和放射治疗,诊断为胰腺癌的个体的5年预后仍然较差.仍有研究要做,看看免疫疗法是否可以用于治疗胰腺癌。我们的研究目标是了解胰腺癌的肿瘤微环境,发现了一种有用的生物标志物来评估患者的预后,并调查了其生物学相关性。在本文中,将随机森林等机器学习方法与加权基因共表达网络融合,用于筛选枢纽免疫相关基因(hub-IRGs)。采用LASSO回归模型进行进一步研究。因此,我们有八个中心IRG。基于hub-IRG,我们创建了PAAD的预后风险预测模型,该模型可以准确地进行分层,并为每位患者提供预后风险评分(IRG_Score).在原始数据集和验证数据集中,该模型的5年曲线下面积(AUC)分别为0.9和0.7.并且shapley加性解释(SHAP)从机器学习的角度刻画了预后风险预测影响因素的重要性,以获得最有影响力的某一基因(或临床因素)。五个最重要的因素是TRIM67,CORT,PSPN,SCAMP5,RFXAP,所有这些都是基因。总之,八个枢纽-IRG具有准确的风险预测性能和生物学意义,这在其他癌症中得到了验证。SHAP的结果有助于了解胰腺癌的分子机制。
    Pancreatic cancer is one of digestive tract cancers with high mortality rate. Despite the wide range of available treatments and improvements in surgery, chemotherapy, and radiation therapy, the five-year prognosis for individuals diagnosed pancreatic cancer remains poor. There is still research to be done to see if immunotherapy may be used to treat pancreatic cancer. The goals of our research were to comprehend the tumor microenvironment of pancreatic cancer, found a useful biomarker to assess the prognosis of patients, and investigated its biological relevance. In this paper, machine learning methods such as random forest were fused with weighted gene co-expression networks for screening hub immune-related genes (hub-IRGs). LASSO regression model was used to further work. Thus, we got eight hub-IRGs. Based on hub-IRGs, we created a prognosis risk prediction model for PAAD that can stratify accurately and produce a prognostic risk score (IRG_Score) for each patient. In the raw data set and the validation data set, the five-year area under the curve (AUC) for this model was 0.9 and 0.7, respectively. And shapley additive explanation (SHAP) portrayed the importance of prognostic risk prediction influencing factors from a machine learning perspective to obtain the most influential certain gene (or clinical factor). The five most important factors were TRIM67, CORT, PSPN, SCAMP5, RFXAP, all of which are genes. In summary, the eight hub-IRGs had accurate risk prediction performance and biological significance, which was validated in other cancers. The result of SHAP helped to understand the molecular mechanism of pancreatic cancer.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    全球COVID-19大流行仍在持续,COVID-19年龄调整后病死率(CFRs)的跨国和跨期变化尚未明确。这里,我们旨在确定加强疫苗接种的国家特定影响和其他可能影响全球范围内年龄调整的CFR异质性的特征,并预测提高加强疫苗接种率对未来CFR的益处。
    使用最新可用的数据库,在32个国家/地区确定了CFR的跨时间和跨国家变化,具有多重特征(疫苗接种覆盖率,人口特征,疾病负担,行为风险,环境风险,健康服务和信任)使用极端梯度提升(XGBoost)算法和Shapley加法扩张(SHAP)。之后,确定了影响年龄调整后CFR的特定国家风险特征。通过在每个国家将加强疫苗接种增加1-30%来模拟加强对年龄调整后的CFR的益处。
    从2020年2月4日至2022年1月31日,32个国家的COVID-19年龄调整后的CFR总体范围从每100,000例110例死亡到每100,000例5112例死亡,分别分为年龄调整后的CFR高于粗CFR的国家和年龄调整后的CFR低于粗CFR的国家(n=9和n=23)。从Alpha到Omicron期,加强疫苗接种对年龄调整后的CFR的影响变得更加重要(重要性评分:0.03-0.23)。Omicron周期模型表明,年龄调整后的CFR高于粗CFR的国家的关键风险因素是人均GDP低和加强疫苗接种率低,而年龄调整后CFR高于粗CFR的国家的关键风险因素是高饮食风险和低体力活动。将加强疫苗接种率提高7%将降低所有年龄调整后的CFRs高于粗CFRs的国家的CFRs。
    加强疫苗接种在降低年龄调整后的CFR方面仍然发挥着重要作用,尽管存在多层面的并发风险因素,但精确的联合干预策略和基于特定国家风险的准备工作也至关重要。
    The global COVID-19 pandemic is still ongoing, and cross-country and cross-period variation in COVID-19 age-adjusted case fatality rates (CFRs) has not been clarified. Here, we aimed to identify the country-specific effects of booster vaccination and other features that may affect heterogeneity in age-adjusted CFRs with a worldwide scope, and to predict the benefit of increasing booster vaccination rate on future CFR.
    Cross-temporal and cross-country variations in CFR were identified in 32 countries using the latest available database, with multi-feature (vaccination coverage, demographic characteristics, disease burden, behavioral risks, environmental risks, health services and trust) using Extreme Gradient Boosting (XGBoost) algorithm and SHapley Additive exPlanations (SHAP). After that, country-specific risk features that affect age-adjusted CFRs were identified. The benefit of booster on age-adjusted CFR was simulated by increasing booster vaccination by 1-30% in each country.
    Overall COVID-19 age-adjusted CFRs across 32 countries ranged from 110 deaths per 100,000 cases to 5,112 deaths per 100,000 cases from February 4, 2020 to Jan 31, 2022, which were divided into countries with age-adjusted CFRs higher than the crude CFRs and countries with age-adjusted CFRs lower than the crude CFRs (n = 9 and n = 23) when compared with the crude CFR. The effect of booster vaccination on age-adjusted CFRs becomes more important from Alpha to Omicron period (importance scores: 0.03-0.23). The Omicron period model showed that the key risk factors for countries with higher age-adjusted CFR than crude CFR are low GDP per capita and low booster vaccination rates, while the key risk factors for countries with higher age-adjusted CFR than crude CFR were high dietary risks and low physical activity. Increasing booster vaccination rates by 7% would reduce CFRs in all countries with age-adjusted CFRs higher than the crude CFRs.
    Booster vaccination still plays an important role in reducing age-adjusted CFRs, while there are multidimensional concurrent risk factors and precise joint intervention strategies and preparations based on country-specific risks are also essential.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    系统地了解生态系统服务(ESs)的驱动机制及其之间的关系对于成功的生态系统管理至关重要。然而,驱动因素对ESs与生态系统服务束(ESB)形成之间关系的影响尚不清楚。为了解决这个差距,我们开发了一个建模过程,使用随机森林(RF)对浙江省的ESs和ESB进行建模,中国,在回归和分类模式中,分别,和Shapley加法解释(SHAP)方法来解释潜在的驱动力。我们首先以1×1km的空间分辨率绘制了浙江省7个ESs的空间分布图,然后使用K均值聚类算法获得了4个ESB。将RF模型与SHAP分析相结合,结果表明,每个ES都有关键的驱动因素,关键因子的驱动方向和强度决定了ESs之间的协同和权衡关系。驱动因素影响ESs的关系,从而影响ESB的形成。因此,管理主导驱动因素是提高ESs供应能力的关键。
    A systematic understanding of the driving mechanisms of ecosystem services (ESs) and the relationships among them is critical for successful ecosystem management. However, the impact of driving factors on the relationships between ESs and the formation of ecosystem service bundles (ESBs) remains unclear. To address this gap, we developed a modeling process that used random forest (RF) to model the ESs and ESBs of Zhejiang Province, China, in regression and classification mode, respectively, and the Shapley Additive Explanations (SHAP) method to interpret the underlying driving forces. We first mapped the spatial distribution of seven ESs in Zhejiang Province at a 1 × 1 km spatial resolution and then used the K-means clustering algorithm to obtain four ESBs. Combining the RF models with SHAP analysis, the results showed that each ES had key driving factors, and the relationships of synergy and trade-off between ESs were determined by the driving direction and intensity of the key factors. The driving factors affect the relationships of ESs and consequently affect the formation of ESBs. Thus, managing the dominant drivers is key to improving the supply capacity of ESs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号