Shapley Additive Explanations

Shapley 添加剂 explanations
  • 文章类型: Journal Article
    Radionuclide diffusion will be influenced by numerous factors. Establishing a model that can elucidate the internal correlation between mesoscopic diffusion and the microscopic structure of bentonite can enhance the comprehension of radionuclide diffusion mechanisms. In this study, a light gradient boosting machine (LightGBM) was employed to predict the effective diffusion coefficients of HCrO4-, I-, and CoEDTA2- in bentonite. The model\'s hyperparameters were optimized using the particle swarm optimization (PSO) algorithm. Several correlated physical quantities, such as mesoscopic parameters (total porosity, rock capacity factor, and ion molar conductivity) and microscopic parameters (ionic radius and montmorillonite stacking number) were incorporated to develop a machine learning model that incorporated micro- and meso-scale features. The predictive performance of PSO-LightGBM was verified using diffusion experiments, which investigated the diffusion of HCrO4-, I-, and CoEDTA2- at compacted dry densities of 1200-1800 kg/m3 using a through-diffusion method. Spearman correlation and Shapley additive explanation analyses revealed that the compacted dry density, ionic diffusion coefficient in water, ionic radius, and total porosity were the top-four influencing factors among the 16 input features. Partial dependence plot analysis elucidated the relationship between the effective diffusion coefficient and each input feature. The analysis results were consistent with the experimental findings, demonstrating the reliability of machine learning. Due to the incorporation of multi-scale features, the PSO-LightGBM model demonstrated enhanced predictive accuracy, linking the microstructure of bentonite to radionuclide diffusion, and providing a comprehensive interpretation of the diffusion mechanism.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    随着全球甲状腺结节患病率的增加,这项研究调查了使用蓝牙耳机与甲状腺结节发生率之间的潜在相关性,考虑到这些设备发出的非电离辐射(NIR)的累积效应。在这项研究中,我们使用倾向得分匹配(PSM)和XGBOOST模型分析了来自WenJuanXing平台的600份有效问卷,辅以SHAP分析,评估甲状腺结节的风险。PSM用于平衡基线特征差异,从而减少偏差。然后采用XGBOOST模型来预测风险因素,模型功效由接收器工作特征(ROC)曲线(AUC)下面积测量。SHAP分析有助于量化和解释每个特征对预测结果的影响,确定关键风险因素。最初,来自文娟兴平台的600份有效问卷进行了PSM处理,产生96个案例的匹配数据集用于建模分析。XGBOOST模型的AUC值达到0.95,在区分甲状腺结节风险方面具有较高的准确性。SHAP分析显示,年龄和每日蓝牙耳机使用时间是影响甲状腺结节风险的两个最重要因素。具体来说,延长每天使用蓝牙耳机的持续时间与发生甲状腺结节的风险增加密切相关,如SHAP分析结果所示。我们的研究强调了长时间使用蓝牙耳机与甲状腺结节风险增加之间的显着影响关系,强调在使用现代技术时考虑健康影响的重要性,特别是对于经常使用的蓝牙耳机等设备。通过精确的模型预测和变量重要性分析,我们的研究为制定公共卫生政策和个人卫生习惯选择提供了科学依据,建议在日常生活中应注意蓝牙耳机的使用时间,以降低甲状腺结节的潜在风险。未来的研究应进一步探讨这种关系的生物学机制,并考虑其他潜在的影响因素,以提供更全面的健康指导和预防措施。
    With an increasing prevalence of thyroid nodules globally, this study investigates the potential correlation between the use of Bluetooth headsets and the incidence of thyroid nodules, considering the cumulative effects of non-ionizing radiation (NIR) emitted by these devices. In this study, we analyzed 600 valid questionnaires from the WenJuanXing platform using Propensity Score Matching (PSM) and the XGBOOST model, supplemented by SHAP analysis, to assess the risk of thyroid nodules. PSM was utilized to balance baseline characteristic differences, thereby reducing bias. The XGBOOST model was then employed to predict risk factors, with model efficacy measured by the area under the Receiver Operating Characteristic (ROC) curve (AUC). SHAP analysis helped quantify and explain the impact of each feature on the prediction outcomes, identifying key risk factors. Initially, 600 valid questionnaires from the WenJuanXing platform underwent PSM processing, resulting in a matched dataset of 96 cases for modeling analysis. The AUC value of the XGBOOST model reached 0.95, demonstrating high accuracy in differentiating thyroid nodule risks. SHAP analysis revealed age and daily Bluetooth headset usage duration as the two most significant factors affecting thyroid nodule risk. Specifically, longer daily usage durations of Bluetooth headsets were strongly linked to an increased risk of developing thyroid nodules, as indicated by the SHAP analysis outcomes. Our study highlighted a significant impact relationship between prolonged Bluetooth headset use and increased thyroid nodule risk, emphasizing the importance of considering health impacts in the use of modern technology, especially for devices like Bluetooth headsets that are frequently used daily. Through precise model predictions and variable importance analysis, our research provides a scientific basis for the formulation of public health policies and personal health habit choices, suggesting that attention should be paid to the duration of Bluetooth headset use in daily life to reduce the potential risk of thyroid nodules. Future research should further investigate the biological mechanisms of this relationship and consider additional potential influencing factors to offer more comprehensive health guidance and preventive measures.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    卤化物钙钛矿材料在太阳能电池等各个领域有着广阔的应用前景,LED器件,光电探测器,荧光标记,生物成像,和光催化由于它们的带隙特性。本研究从已发表的文献中收集了实验数据,并利用了出色的预测能力,低过度拟合风险,和集成学习模型的强鲁棒性分析卤化物钙钛矿化合物的带隙。结果证明了集成学习决策树模型的有效性,特别是梯度提升决策树模型,均方根误差为0.090eV,平均绝对误差为0.053eV,决定系数为93.11%。对与通过元素摩尔量归一化计算的比率相关的数据的研究表明,X和B位置的离子对带隙有重大影响。此外,掺杂碘原子可以有效降低本征带隙,而锡原子的s和p轨道的杂化也可以降低带隙。通过预测光伏材料MASn1-xPbxI3的带隙来验证模型的准确性。总之,这项研究强调了机器学习对材料开发的积极影响,特别是在预测卤化物钙钛矿化合物的带隙时,其中集成学习方法显示出显著的优势。
    Halide perovskite materials have broad prospects for applications in various fields such as solar cells, LED devices, photodetectors, fluorescence labeling, bioimaging, and photocatalysis due to their bandgap characteristics. This study compiled experimental data from the published literature and utilized the excellent predictive capabilities, low overfitting risk, and strong robustness of ensemble learning models to analyze the bandgaps of halide perovskite compounds. The results demonstrate the effectiveness of ensemble learning decision tree models, especially the gradient boosting decision tree model, with a root mean square error of 0.090 eV, a mean absolute error of 0.053 eV, and a determination coefficient of 93.11%. Research on data related to ratios calculated through element molar quantity normalization indicates significant influences of ions at the X and B positions on the bandgap. Additionally, doping with iodine atoms can effectively reduce the intrinsic bandgap, while hybridization of the s and p orbitals of tin atoms can also decrease the bandgap. The accuracy of the model is validated by predicting the bandgap of the photovoltaic material MASn1-xPbxI3. In conclusion, this study emphasizes the positive impact of machine learning on material development, especially in predicting the bandgaps of halide perovskite compounds, where ensemble learning methods demonstrate significant advantages.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项回顾性研究的目的是建立基于超声(US)-影像组学和临床因素的组合模型,以预测手术前I期宫颈癌(CC)的患者。
    回顾性分析安徽医科大学第一附属医院经阴道超声检查(TVS)发现宫颈病变的209例CC患者,患者分为训练集(n=146)和内部验证集(n=63),以安徽省妇幼保健院和南充市中心医院的52例CC患者作为外部验证集。通过单因素和多因素逻辑回归分析选择临床独立预测因子。从美国图像中提取美国影像组学特征。通过单变量分析选择最重要的特征后,斯皮尔曼相关分析,和最小绝对收缩和选择算子(LASSO)算法,使用六种机器学习(ML)算法来构建影像组学模型。接下来,临床能力,美国放射组学,并将临床US-影像组学联合模型与诊断I期CC进行了比较。最后,Shapley加法解释(SHAP)方法用于解释每个特征的贡献。
    宫颈病变的长径(L)和鳞状细胞癌相关抗原(SCCa)是I期CC的独立临床预测因子。极限梯度提升(Xgboost)模型在六个ML影像组学模型中表现最好,训练中的曲线下面积(AUC)值,内部验证,和外部验证集分别为0.778、0.751和0.751。在最后三个模型中,基于临床特征和rad评分的组合模型显示出良好的判别力,在训练中使用AUC值,内部验证,外部验证集分别为0.837、0.828和0.839。决策曲线分析验证了组合列线图的临床实用性。SHAP算法说明了组合模型中每个特征的贡献。
    我们建立了一个可解释的组合模型来预测I阶段CC。这种非侵入性预测方法可用于I期CC患者的术前识别。
    UNASSIGNED: The purpose of this retrospective study was to establish a combined model based on ultrasound (US)-radiomics and clinical factors to predict patients with stage I cervical cancer (CC) before surgery.
    UNASSIGNED: A total of 209 CC patients who had cervical lesions found by transvaginal sonography (TVS) from the First Affiliated Hospital of Anhui Medical University were retrospectively reviewed, patients were divided into the training set (n = 146) and internal validation set (n = 63), and 52 CC patients from Anhui Provincial Maternity and Child Health Hospital and Nanchong Central Hospital were taken as the external validation set. The clinical independent predictors were selected by univariate and multivariate logistic regression analyses. US-radiomics features were extracted from US images. After selecting the most significant features by univariate analysis, Spearman\'s correlation analysis, and the least absolute shrinkage and selection operator (LASSO) algorithm, six machine learning (ML) algorithms were used to build the radiomics model. Next, the ability of the clinical, US-radiomics, and clinical US-radiomics combined model was compared to diagnose stage I CC. Finally, the Shapley additive explanations (SHAP) method was used to explain the contribution of each feature.
    UNASSIGNED: Long diameter of the cervical lesion (L) and squamous cell carcinoma-associated antigen (SCCa) were independent clinical predictors of stage I CC. The eXtreme Gradient Boosting (Xgboost) model performed the best among the six ML radiomics models, with area under the curve (AUC) values in the training, internal validation, and external validation sets being 0.778, 0.751, and 0.751, respectively. In the final three models, the combined model based on clinical features and rad-score showed good discriminative power, with AUC values in the training, internal validation, and external validation sets being 0.837, 0.828, and 0.839, respectively. The decision curve analysis validated the clinical utility of the combined nomogram. The SHAP algorithm illustrates the contribution of each feature in the combined model.
    UNASSIGNED: We established an interpretable combined model to predict stage I CC. This non-invasive prediction method may be used for the preoperative identification of patients with stage I CC.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    早期检测老年人的认知能力下降对于有效干预至关重要。这项研究,马鞍山健康老龄化队列研究的一部分,检查了2288名认知功能正常的参与者。42个潜在的预测因子,包括人口统计,慢性疾病,生活方式因素,和基线认知功能,被选中。数据集分为训练,验证,和测试集(60%,20%,20%,分别)。递归特征消除(RFE)和六种机器学习算法用于模型开发。使用曲线下面积(AUC)评估模型性能,特异性,灵敏度,和准确性。沙普利附加扩张(SHAP)被应用于可解释性,揭示了十大有影响力的特征:基线MMSE,教育,经济地位,社会活动,PSQI,BMI,SBP,DBP,IADL,和年龄。基于朴素贝叶斯(NB)算法的模型在测试集上实现了0.820(95%CI0.773-0.887)的AUC,优于其他算法。该模型可以帮助社区环境中的初级卫生保健人员识别出老年人中三年内患认知障碍风险较高的个体。
    BACKGROUND: The prevalence of cognitive impairment and dementia in the older population is increasing, and thereby, early detection of cognitive decline is essential for effective intervention.
    METHODS: This study included 2,288 participants with normal cognitive function from the Ma\'anshan Healthy Aging Cohort Study. Forty-two potential predictors, including demographic characteristics, chronic diseases, lifestyle factors, anthropometric indices, physical function, and baseline cognitive function, were selected based on clinical importance and previous research. The dataset was partitioned into training, validation, and test sets in a proportion of 60% for training, 20% for validation, and 20% for testing, respectively. Recursive feature elimination was used for feature selection, followed by six machine learning algorithms that were employed for model development. The performance of the models was evaluated using area under the curve (AUC), specificity, sensitivity, and accuracy. Moreover, SHapley Additive exPlanations (SHAP) was conducted to access the interpretability of the final selected model and to gain insights into the impact of features on the prediction outcomes. SHAP force plots were established to vividly show the application of the prediction model at the individual level.
    RESULTS: The final predictive model based on the Naive Bayes algorithm achieved an AUC of 0.820 (95% CI, 0.773-0.887) on the test set, outperforming other algorithms. The top ten influential features in the model included baseline Mini-Mental State Examination (MMSE), education, self-reported economic status, collective or social activities, Pittsburgh sleep quality index (PSQI), body mass index, systolic blood pressure, diastolic blood pressure, instrumental activities of daily living, and age. The model demonstrated the potential to identify individuals at a higher risk of cognitive impairment within 3 years from older adults.
    CONCLUSIONS: The predictive model developed in this study contributes to the early detection of cognitive impairment in older adults by primary healthcare staff in community settings.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:开发和比较基于三相对比增强CT(CECT)的机器学习模型,以区分良性和恶性肾脏肿瘤。
    方法:总共,427名患者来自两个医疗中心:中心1(用作训练集)和中心2(用作外部验证集)。首先,从皮质髓质期(CP)中单独提取1781个放射学特征,肾图相位(NP),和排泄期(EP)CECT图像,之后,通过最小冗余最大相关性方法选择10个特征。第二,随机森林(RF)模型由单相特征(CP,NP,和EP)以及来自所有三个阶段(TP)的特征组合。第三,在训练集和外部验证集中评估RF模型.最后,模型的内部预测机制由SHapley加法扩张(SHAP)方法解释。
    结果:共纳入了来自中心1的266例肾脏肿瘤患者和来自中心2的161例患者。在训练集中,从CP构建的RF模型的AUC,NP,EP,TP特征分别为0.886、0.912、0.930和0.944。在外部验证集中,模型的AUC分别为0.860,0.821,0.921和0.908.根据SHAP方法,“original_shape_flatness”特征在基于EP特征的RF模型的预测结果中起着最重要的作用。
    结论:四种RF模型可有效区分良性和恶性实体肾肿瘤,基于EP特征的RF模型显示最佳性能。
    BACKGROUND: To develop and compare machine learning models based on triphasic contrast-enhanced CT (CECT) for distinguishing between benign and malignant renal tumors.
    METHODS: In total, 427 patients were enrolled from two medical centers: Center 1 (serving as the training set) and Center 2 (serving as the external validation set). First, 1781 radiomic features were individually extracted from corticomedullary phase (CP), nephrographic phase (NP), and excretory phase (EP) CECT images, after which 10 features were selected by the minimum redundancy maximum relevance method. Second, random forest (RF) models were constructed from single-phase features (CP, NP, and EP) as well as from the combination of features from all three phases (TP). Third, the RF models were assessed in the training and external validation sets. Finally, the internal prediction mechanisms of the models were explained by the SHapley Additive exPlanations (SHAP) approach.
    RESULTS: A total of 266 patients with renal tumors from Center 1 and 161 patients from Center 2 were included. In the training set, the AUCs of the RF models constructed from the CP, NP, EP, and TP features were 0.886, 0.912, 0.930, and 0.944, respectively. In the external validation set, the models achieved AUCs of 0.860, 0.821, 0.921, and 0.908, respectively. The \"original_shape_Flatness\" feature played the most important role in the prediction outcome for the RF model based on EP features according to the SHAP method.
    CONCLUSIONS: The four RF models efficiently differentiated benign from malignant solid renal tumors, with the EP feature-based RF model displaying the best performance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    这项研究的重点是预测弯曲中由纤维增强聚合物(FRP)增强的钢筋混凝土梁中的混凝土覆盖层分离(CCS)。首先,基于线性回归构建机器学习模型,支持向量回归,BP神经网络,决策树,随机森林,和XGBoost算法。其次,根据评估指标确定了最适合的CCS预测模型,并将其与代码和研究人员的模型进行了比较。最后,进行了基于Shapley加法扩张(SHAP)的参数研究,并得出以下结论:XGBoost最适合于CCS和代码的预测,研究人员的模型精度需要提高,并且遭受过度或保守估计的困扰。混凝土对剪切力的贡献和钢筋的屈服强度是CCS最重要的参数,其中CCS开始时的剪切力与混凝土对剪切力的贡献大致成正比,与钢筋的屈服强度大致成反比。
    This study focuses on the prediction of concrete cover separation (CCS) in reinforced concrete beams strengthened by fiber-reinforced polymer (FRP) in flexure. First, machine learning models were constructed based on linear regression, support vector regression, BP neural networks, decision trees, random forests, and XGBoost algorithms. Secondly, the most suitable model for predicting CCS was identified based on the evaluation metrics and compared with the codes and the researcher\'s model. Finally, a parametric study based on SHapley Additive exPlanations (SHAP) was carried out, and the following conclusions were obtained: XGBoost is best-suited for the prediction of CCS and codes, and researchers\' model accuracy needs to be improved and suffers from over or conservative estimation. The contributions of the concrete to the shear force and the yield strength of the reinforcement are the most important parameters for the CCS, where the shear force at the onset of CCS is approximately proportional to the contribution of the concrete to the shear force and approximately inversely proportional to the yield strength of the reinforcement.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:急性肾损伤(AKI)不仅是脑梗死(CI)患者的并发症,而且是严重的威胁。本研究旨在探讨可解释机器学习算法在预测脑梗死患者AKI中的应用。
    方法:该研究包括丽水市中心医院重症监护病房和急诊医学收治的3,920例CI患者,浙江省。九种机器学习技术,包括XGBoost,物流,LightGBM,随机森林(RF),AdaBoost,GaussianNB(GNB),多层感知器(MLP),支持向量机(SVM),和k-最近邻(KNN)分类,用于开发这些患者的AKI预测模型。Shapley加法扩张(SHAP)分析为每位患者提供了视觉解释。最后,使用平均精度(AP)、灵敏度,特异性,准确度,F1得分,精度-召回(PR)曲线,校准图,和决策曲线分析(DCA)。
    结果:XGBoost模型在内部验证集和外部验证集中表现更好,AUC分别为0.940和0.887。模型中最重要的五个变量是,按顺序,肾小球滤过率,低密度脂蛋白,总胆固醇,偏瘫和血清钾。
    结论:本研究证明了可解释的机器学习算法在预测伴有AKI的CI患者中的潜力。
    BACKGROUND: Acute kidney injury (AKI) is not only a complication but also a serious threat to patients with cerebral infarction (CI). This study aimed to explore the application of interpretable machine learning algorithms in predicting AKI in patients with cerebral infarction.
    METHODS: The study included 3920 patients with CI admitted to the Intensive Care Unit and Emergency Medicine of the Central Hospital of Lishui City, Zhejiang Province. Nine machine learning techniques, including XGBoost, logistics, LightGBM, random forest (RF), AdaBoost, GaussianNB (GNB), Multi-Layer Perceptron (MLP), support vector machine (SVM), and k-nearest neighbors (KNN) classification, were used to develop a predictive model for AKI in these patients. SHapley Additive exPlanations (SHAP) analysis provided visual explanations for each patient. Finally, model effectiveness was assessed using metrics such as average precision (AP), sensitivity, specificity, accuracy, F1 score, precision-recall (PR) curve, calibration plot, and decision curve analysis (DCA).
    RESULTS: The XGBoost model performed better in the internal validation set and the external validation set, with an AUC of 0.940 and 0.887, respectively. The five most important variables in the model were, in order, glomerular filtration rate, low-density lipoprotein, total cholesterol, hemiplegia and serum kalium.
    CONCLUSIONS: This study demonstrates the potential of interpretable machine learning algorithms in predicting CI patients with AKI.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:本研究旨在建立一个机器学习(ML)模型,通过随机森林(RF)和XGBoost算法预测术后非哺乳期乳腺炎(NLM)的复发概率。它可以提供识别NLM复发风险和指导临床治疗计划的能力。
    方法:本研究以上海中医药大学附属曙光医院2019年7月至2021年12月收治的住院患者为研究对象。住院患者数据随访已完成至2022年12月。在这项研究中选择了十个特征来构建ML模型:年龄,体重指数(BMI),堕胎次数,乳头倒置的存在,乳房肿块的范围,白细胞计数(WBC),中性粒细胞与淋巴细胞比率(NLR),白蛋白-球蛋白比率(AGR)和甘油三酯(TG)以及术中排出的存在。我们使用两种ML方法(RF和XGBoost)来建立模型并预测女性患者的NLM复发风险。将258例患者按75%-25%的比例随机分为训练集和测试集。模型性能是基于准确性进行评估的,Precision,回想一下,F1评分和AUC。Shapley加法解释(SHAP)方法用于解释模型。
    结果:有48例(18.6%)NLM患者在随访期间出现复发。在这项研究中选择了十个特征来构建ML模型。对于RF模型,BMI是最重要的影响因素,对于XGBoost模型是术中出院。十倍交叉验证结果表明,RF模型和XGBoost模型均具有良好的预测性能,但是在我们的研究中,XGBoost模型比RF模型具有更好的性能。我们模型中所有特征的SHAP值的趋势与这些特征的临床表现的趋势一致。在模型中包含这十个特征对于建立实际的复发预测模型是必要的。
    结论:十倍交叉验证和SHAP值的结果表明模型具有预测能力。SHAP值的趋势在我们的模型中提供了辅助验证,并使其具有更多的临床意义。
    OBJECTIVE: This study aims to build a machine learning (ML) model to predict the recurrence probability for postoperative non-lactating mastitis (NLM) by Random Forest (RF) and XGBoost algorithms. It can provide the ability to identify the risk of NLM recurrence and guidance in clinical treatment plan.
    METHODS: This study was conducted on inpatients who were admitted to the Mammary Department of Shuguang Hospital affiliated to Shanghai University of Traditional Chinese Medicine between July 2019 to December 2021. Inpatient data follow-up has been completed until December 2022. Ten features were selected in this study to build the ML model: age, body mass index (BMI), number of abortions, presence of inverted nipples, extent of breast mass, white blood cell count (WBC), neutrophil to lymphocyte ratio (NLR), albumin-globulin ratio (AGR) and triglyceride (TG) and presence of intraoperative discharge. We used two ML approaches (RF and XGBoost) to build models and predict the NLM recurrence risk of female patients. Totally 258 patients were randomly divided into a training set and a test set according to a 75%-25% proportion. The model performance was evaluated based on Accuracy, Precision, Recall, F1-score and AUC. The Shapley Additive Explanations (SHAP) method was used to interpret the model.
    RESULTS: There were 48 (18.6%) NLM patients who experienced recurrence during the follow-up period. Ten features were selected in this study to build the ML model. For the RF model, BMI is the most important influence factor and for the XGBoost model is intraoperative discharge. The results of tenfold cross-validation suggest that both the RF model and the XGBoost model have good predictive performance, but the XGBoost model has a better performance than the RF model in our study. The trends of SHAP values of all features in our models are consistent with the trends of these features\' clinical presentation. The inclusion of these ten features in the model is necessary to build practical prediction models for recurrence.
    CONCLUSIONS: The results of tenfold cross-validation and SHAP values suggest that the models have predictive ability. The trend of SHAP value provides auxiliary validation in our models and makes it have more clinical significance.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    堆肥中发芽指数(GI)的测量是一个费时费力的过程。这项研究采用了四种机器学习(ML)模型,即随机森林(RF),人工神经网络(ANN),支持向量回归(SVR)和决策树(DT),根据关键堆肥参数预测GI。预测结果表明,RF(>0.9)和ANN(>0.9)的决定系数(R2)高于SVR(<0.6)和DT(<0.8),这表明RF和ANN对GI显示出优越的预测性能。沙普利添加剂种植值结果表明,堆肥时间,温度,和pH是影响GI的重要特征。堆肥时间对GI的影响最大。总的来说,RF和ANN被认为是预测堆肥中GI的有效工具。这项研究提供了准确预测堆肥过程中GI的可靠方法,从而实现智能堆肥实践。
    The measurement of germination index (GI) in composting is a time-consuming and laborious process. This study employed four machine learning (ML) models, namely Random Forest (RF), Artificial Neural Network (ANN), Support Vector Regression (SVR), and Decision Tree (DT), to predict GI based on key composting parameters. The prediction results showed that the coefficient of determination (R2) for RF (>0.9) and ANN (>0.9) was higher than SVR (<0.6) and DT (<0.8), suggesting that RF and ANN displayed superior predictive performance for GI. The SHapley additive exPlanations value result indicated that composting time, temperature, and pH were the important features contributing to GI. Composting time was found to have the most significant impact on GI. Overall, RF and ANN were suggested as effective tools for predicting GI in composting. This study offers the reliable approach of accurately predicting GI in composting processes, thereby enabling intelligent composting practices.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号