machine learning model

机器学习模型
  • 文章类型: Journal Article
    含藻水体的化学适度预氧化是一种经济、有前景的控制藻类和外源污染物的策略,然而,它受到缺乏有效的在线评估和快速响应反馈方法的制约。在这里,激发-发射矩阵平行因素分析(EEM-PARAFAC)用于在激发/发射波长为260(360)/450nm的次氯酸钠(NaClO)预氧化后鉴定蓝细菌荧光团,在此基础上定量评估了藻类细胞完整性和细胞内有机物(IOM)释放。建立了荧光光谱数据的机器学习模型,用于使用NaClO预测中度预氧化。适度预氧化的最佳NaClO剂量取决于藻类密度,生长阶段,和水源水基质中的有机物浓度。低剂量的NaClO(<0.5mg/L)导致表面吸附的有机物(S-AOM)的短期解吸,而不会损害藻类细胞的完整性,而高剂量的NaClO(≥0.5mg/L)迅速引起细胞损伤。最佳NaClO用量从0.2-0.3mg/L增加到0.9-1.2mg/L,对应于藻类密度从0.1×10到2.0×10的源水细胞/mL。不同的生长阶段需要不同的NaClO剂量:静止期细胞需要0.3-0.5mg/L,对数期细胞0.6-0.8mg/L,和腐烂的细胞2.0-2.5毫克/升天然有机物和S-AOM的存在随着较高的溶解有机碳(DOC)浓度(1.00mg/LDOC需要0.8-1.0mg/LNaClO,而2.20mg/LDOC需要1.5-2.0mg/L)。与其他预测模型相比,机器学习模型(高斯过程回归-Matern(0.5))表现最好,在训练和测试集中实现1.000和0.976的R2值。最佳预氧化后的混凝有效去除藻类污染物,达到91%,92%,藻类细胞被去除92%,浊度,和叶绿素a,分别,从而证明了适度预氧化的有效性。本研究介绍了一种通过监测水源水质和跟踪预氧化后荧光团动态调整NaClO剂量的新方法,加强适度预氧化技术在含藻水处理中的应用。
    Chemical moderate preoxidation for algae-laden water is an economical and prospective strategy for controlling algae and exogenous pollutants, whereas it is constrained by a lack of effective on-line evaluation and quick-response feedback method. Herein, excitation-emission matrix parallel factor analysis (EEM-PARAFAC) was used to identify cyanobacteria fluorophores after preoxidation of sodium hypochlorite (NaClO) at Excitation/Emission wavelength of 260(360)/450 nm, based on which the algal cell integrity and intracellular organic matter (IOM) release were quantitatively assessed. Machine learning modeling of fluorescence spectral data for prediction of moderate preoxidation using NaClO was established. The optimal NaClO dosage for moderate preoxidation depended on algal density, growth phases, and organic matter concentrations in source water matrices. Low doses of NaClO (<0.5 mg/L) led to short-term desorption of surface-adsorbed organic matter (S-AOM) without compromising algal cell integrity, whereas high doses of NaClO (≥0.5 mg/L) quickly caused cell damage. The optimal NaClO dosage increased from 0.2-0.3 mg/L to 0.9-1.2 mg/L, corresponding to the source water with algal densities from 0.1 × 10⁶ to 2.0 × 10⁶ cells/mL. Different growth stages required varying NaClO doses: stationary phase cells needed 0.3-0.5 mg/L, log phase cells 0.6-0.8 mg/L, and decaying cells 2.0-2.5 mg/L. The presence of natural organic matter and S-AOM increased the NaClO dosage limit with higher dissolved organic carbon (DOC) concentrations (1.00 mg/L DOC required 0.8-1.0 mg/L NaClO, while 2.20 mg/L DOC required 1.5-2.0 mg/L). Compared to other predictive models, the machine learning model (Gaussian process regression-Matern (0.5)) performed best, achieving R2 values of 1.000 and 0.976 in training and testing sets. Optimal preoxidation followed by coagulation effectively removed algal contaminants, achieving 91%, 92%, and 92% removal for algal cells, turbidity, and chlorophyll-a, respectively, thereby demonstrating the effectiveness of moderate preoxidation. This study introduces a novel approach to dynamically adjust NaClO dosage by monitoring source water qualities and tracking post-preoxidation fluorophores, enhancing moderate preoxidation technology application in algae-laden water treatment.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本研究旨在从生物信息学和实验角度验证核心角化基因(CRGs)与阿尔茨海默病(AD)之间的关联,并建立风险预测模型。为此,从GSE109887分析了78个人类来源的颞叶样本,并通过聚类分析探索了所得CRGs的生物学功能,加权基因共表达网络分析和类似方法,以确定最佳机器模型。此外,使用外部数据集GSE33000和列线图来验证模型.使用SH-SY5Y细胞模型和Sprague-Dawley大鼠动物模型验证CRGs的mRNA和蛋白表达。RT‑qPCR和Westernblotting结果显示,二氢硫磺酰胺脱氢酶的mRNA和蛋白表达量,铁氧还蛋白1、谷氨酰胺酶和丙酮酸脱氢酶E1亚基β降低,二氢硫酰胺支链转酰酶E2在AD中的表达增加,这支持了生物信息学分析结果。CRG表达改变影响某些免疫细胞的聚集和浸润。本研究还证实了AD诊断模型和列线图的准确性和有效性。并验证了五个CRG与AD之间的关联,表明AD患者与健康个体之间存在显着差异。因此,CRGs有望作为AD诊断和预后监测的相关生物标志物。
    The present study aimed to validate the association between core cuproptosis genes (CRGs) and Alzheimer\'s disease (AD) from both bioinformatics and experimental perspectives and also to develop a risk prediction model. To this end, 78 human‑derived temporal back samples were analyzed from GSE109887, and the biological functions of the resulting CRGs were explored by cluster analysis, weighted gene co‑expression network analysis and similar methods to identify the best machine model. Moreover, an external dataset GSE33000 and a nomogram were used to validate the model. The mRNA and protein expression of CRGs were validated using the SH‑SY5Y cell model and the Sprague‑Dawley rat animal model. The RT‑qPCR and western blotting results showed that the mRNA and protein expression content of dihydrolipoamide dehydrogenase, ferredoxin 1, glutaminase and pyruvate dehydrogenase E1 subunit β decreased, and the expression of dihydrolipoamide branched chain transacylase E2 increased in AD, which supported the bioinformatic analysis results. The CRG expression alterations affected the aggregation and infiltration of certain immune cells. The present study also confirmed the accuracy and validity of AD diagnostic models and nomograms, and validated the association between five CRGs and AD, indicating a significant difference between patients with AD and healthy individuals. Therefore, CRGs are expected to serve as relevant biomarkers for the diagnosis and prognostic monitoring of AD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    化学预处理是提高木质纤维素废物(LW)累积甲烷产率(CMY)的常用方法,但其效果受多种因素影响。准确估计预处理LW的甲烷产量仍然是一个挑战。这里,基于254个LW样本,使用两个自动ML平台(基于树的管道优化工具和神经网络智能)构建了机器学习(ML)模型来预测预处理原料的甲烷生产性能。此外,预处理条件的相互作用效应,原料性质,通过模型可解释性分析,研究了消化条件对预处理LW产甲烷的影响。最优ML模型在验证集上表现良好,和消化时间,预处理剂,发现木质素含量(LC)是影响预处理LW甲烷产量的关键因素。如果原始LW中的LC低于15%,使用NaOH可以达到最大CMY,KOH,KOH和碱性过氧化氢(AHP),浓度为3.8%,4.4%,和4.5%,分别。另一方面,如果LC高于15%,只有超过4%的高浓度层次分析法才能显著提高甲烷产量。本研究为优化预处理工艺提供了有价值的指导,比较不同的化学预处理方法,并规范大型沼气厂的运行。
    Chemical pretreatment is a common method to enhance the cumulative methane yield (CMY) of lignocellulosic waste (LW) but its effectiveness is subject to various factors, and accurate estimation of methane production of pretreated LW remains a challenge. Here, based on 254 LW samples, a machine learning (ML) model to predict the methane production performance of pretreated feedstock was constructed using two automated ML platforms (tree-based pipeline optimization tool and neural network intelligence). Furthermore, the interactive effects of pretreatment conditions, feedstock properties, and digestion conditions on methane production of pretreated LW were studied through model interpretability analysis. The optimal ML model performed well on the validation set, and the digestion time, pretreatment agent, and lignin content (LC) were found to be key factors affecting the methane production of pretreated LW. If the LC in the raw LW was lower than 15%, the maximum CMY might be achieved using the NaOH, KOH, and alkaline hydrogen peroxide (AHP) with concentrations of 3.8%, 4.4%, and 4.5%, respectively. On the other hand, if LC was higher than 15%, only high concentrations of AHP exceeding 4% could significantly increase methane production. This study provides valuable guidance for optimizing pretreatment process, comparing different chemical pretreatment approaches, and regulating the operation of large-scale biogas plants.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    高性能混凝土(HPC)抗压强度与其组分之间存在复杂的高维非线性映射关系,对抗压强度的准确预测有很大影响。在本文中,结合BP神经网络(BPNN)的高效稳健软件计算策略,提出了支持向量机(SVM)和遗传算法(GA)用于HPC的抗压强度预测。从以前的文献中提取了8个特征,构建了包含454组数据的抗压强度数据库。对模型进行了训练和测试,以及4个机器学习(ML)模型的性能,即BPNN,SVM,GA-BPNN和GA-SVM,比较。结果表明,耦合模型优于单一模型。此外,由于GA-SVM具有较好的泛化能力和理论基础,其收敛速度和预测精度均优于GA-BPNN。然后利用灰色关联分析(GRA)和Shapley分析验证了GA-SVM模型的可解释性,结果表明,水胶比对抗压强度的影响最大。最后,多输入变量的组合来评估抗压强度,补充了本研究,并再次验证了水胶比的显著影响,为后续研究提供参考价值。
    There is a complex high-dimensional nonlinear mapping relationship between the compressive strength of High-Performance Concrete (HPC) and its components, which has great influence on the accurate prediction of compressive strength. In this paper, an efficient robust software calculation strategy combining BP Neural Network (BPNN), Support Vector Machine (SVM) and Genetic Algorithm (GA) is proposed for the prediction of compressive strength of HPC. 8 features were extracted from the previous literature, and a compressive strength database containing 454 sets of data was constructed. The model was trained and tested, and the performance of 4 Machine Learning (ML) models, namely BPNN, SVM, GA-BPNN and GA-SVM, was compared. The results show that the coupled model is superior to the single model. Moreover, because GA-SVM has better generalization ability and theoretical basis, its convergence speed and prediction accuracy are better than GA-BPNN. Then Grey Relational Analysis (GRA) and Shapley analysis were used to verify the interpretability of the GA-SVM model, which showed that the water-binder ratio had the most significant influence on the compressive strength. Finally, the combination of multiple input variables to evaluate the compressive strength supplemented this research, and again verified the significant influence of water-binder ratio, providing reference value for subsequent research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Letter
    暂无摘要。
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    阿尔茨海默病(AD)的发病率在全球范围内呈上升趋势,然而,由于与之相关的复杂病理生理机制,其治疗和预测仍具有挑战性。因此,本研究的目的是分析和表征铁凋亡相关基因(FEGs)在AD发病机理中的分子机制,以及构建预后模型。这些发现将为未来AD的诊断和治疗提供新的见解。首先,获得了来自基因表达综合数据库的AD数据集GSE33000和来自FerrDB的FEGs。接下来,无监督聚类分析用于获得与AD最相关的FEGs。随后,对FEGs进行富集分析以探索生物学功能。随后,通过CIBERSORT阐明了这些基因在免疫微环境中的作用。然后,通过比较不同机器学习模型的性能选择最优机器学习。为了验证预测效率,使用列线图对模型进行了验证,校正曲线,决策曲线分析和外部数据集。此外,使用逆转录定量PCR和Westernblot分析验证不同组间FEGs的表达.在AD中,FEGs表达的改变影响某些免疫细胞的聚集和浸润。这表明AD的发生与免疫浸润密切相关。最后,选择了最合适的机器学习模型,建立AD诊断模型和列线图。本研究提供了新的见解,可以增强对FEGs在AD中作用的分子机制的理解。此外,本研究提供了可能有助于AD诊断的生物标志物.
    The incidence of Alzheimer\'s disease (AD) is rising globally, yet its treatment and prediction of this condition remain challenging due to the complex pathophysiological mechanisms associated with it. Consequently, the objective of the present study was to analyze and characterize the molecular mechanisms underlying ferroptosis‑related genes (FEGs) in the pathogenesis of AD, as well as to construct a prognostic model. The findings will provide new insights for the future diagnosis and treatment of AD. First, the AD dataset GSE33000 from the Gene Expression Omnibus database and the FEGs from FerrDB were obtained. Next, unsupervised cluster analysis was used to obtain the FEGs that were most relevant to AD. Subsequently, enrichment analyses were performed on the FEGs to explore biological functions. Subsequently, the role of these genes in the immune microenvironment was elucidated through CIBERSORT. Then, the optimal machine learning was selected by comparing the performance of different machine learning models. To validate the prediction efficiency, the models were validated using nomograms, calibration curves, decision curve analysis and external datasets. Furthermore, the expression of FEGs between different groups was verified using reverse transcription quantitative PCR and western blot analysis. In AD, alterations in the expression of FEGs affect the aggregation and infiltration of certain immune cells. This indicated that the occurrence of AD is strongly associated with immune infiltration. Finally, the most appropriate machine learning models were selected, and AD diagnostic models and nomograms were built. The present study provided novel insights that enhance understanding with regard to the molecular mechanism of action of FEGs in AD. Moreover, the present study provided biomarkers that may facilitate the diagnosis of AD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在国际上倡导低碳健康的生活方式,环境PM2.5污染给希望从事户外运动和采取积极低碳通勤的城市居民带来了困境。在这项研究中,设计并提出了城市空气健康导航系统(UAHNS),以通过推荐PM2.5暴露最少的路线并基于拓扑数字地图动态发布早期风险警告来帮助用户,应用程序编程接口(API),极限梯度提升(XGBoost)模型,和两步空间插值。在武汉市对UAHNS的功能和应用进行了测试。结果表明,与经过训练的随机森林(RF)相比,LightGBM,Adaboost模型,等。,XGBoost模型表现更好,根据国家空气和气象监测站的数据,R2超过0.90,RMSE约为15.74μg/m3。Further,采用两步空间插值模型,以300m*300m的空间分辨率动态生成污染分布。然后,在武汉随机选择的通勤路线和时间下进行了暴露比较,显示较低PM2.5暴露的推荐途径有效地帮助。骑乘和步行的路线差异率约为14.9%和16.9%,分别。最后,UAHNS平台在武汉整体实现,由实时PM2.5查询组成,任何地点的一小时PM2.5预测功能,城市地图上的健康导航,和个性化的健康信息查询。
    Under international advocacy for a low-carbon and healthy lifestyle, ambient PM2.5 pollution poses a dilemma for urban residents who wish to engage in outdoor exercise and adopt active low-carbon commuting. In this study, an Urban Air Health Navigation System (UAHNS) was designed and proposed to assist users by recommending routes with the least PM2.5 exposure and dynamically issuing early risk warnings based on topologized digital maps, an application programming interface (API), an eXtreme Gradient Boosting (XGBoost) model, and two-step spatial interpolation. A test of the UAHNS\'s functions and applications was carried out in Wuhan city. The results showed that, compared with trained random forest (RF), LightGBM, Adaboost models, etc., the XGBoost model performed better, with an R2 exceeding 0.90 and an RMSE of approximately 15.74 μg/m3, based on data from national air and meteorological monitoring stations. Further, the two-step spatial interpolation model was adopted to dynamically generate pollution distribution at a spatial resolution of 300 m*300 m. Then, an exposure comparison was performed under randomly selected commuting routes and times in Wuhan, showing the recommended routes for lower PM2.5 exposure made effectively help. And the route difference ratios of about 14.9 % and 16.9 % for riding and walking, respectively. Finally, the UAHNS platform was integrally realized for Wuhan, consisting of a real-time PM2.5 query, a one-hour PM2.5 prediction function at any location, health navigation on city map, and a personalized health information query.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    恶性梗阻性黄疸患者ERCP植入后胆管炎的风险仍然未知。建立基于人工智能方法的模型来更准确地预测胆管炎的风险,根据患者支架植入术后患者的临床资料。这项回顾性研究包括218例接受ERCP手术的MOJ患者。共收集27个临床变量作为输入变量。7个模型(包括单变量分析和6个机器学习模型)被训练和测试用于分类预测。通过AUROC测量模型性能。RFT模型表现出出色的性能,精度高达0.86,AUROC高达0.87。RF和SHAP中的特征选择相似,和最佳变量子集的选择产生了一个高的性能与AUROC高达0.89。我们开发了一种混合机器学习模型,比传统的LR预测模型具有更好的预测性能,以及其他基于简单临床数据的胆管炎机器学习模型。该模型可以帮助医生进行临床诊断,采取合理的治疗方案,提高患者的生存率。
    The risk of cholangitis after ERCP implantation in malignant obstructive jaundice patients remains unknown. To develop models based on artificial intelligence methods to predict cholangitis risk more accurately, according to patients after stent implantation in patients\' MOJ clinical data. This retrospective study included 218 patients with MOJ undergoing ERCP surgery. A total of 27 clinical variables were collected as input variables. Seven models (including univariate analysis and six machine learning models) were trained and tested for classified prediction. The model\' performance was measured by AUROC. The RFT model demonstrated excellent performances with accuracies up to 0.86 and AUROC up to 0.87. Feature selection in RF and SHAP was similar, and the choice of the best variable subset produced a high performance with an AUROC up to 0.89. We have developed a hybrid machine learning model with better predictive performance than traditional LR prediction models, as well as other machine learning models for cholangitis based on simple clinical data. The model can assist doctors in clinical diagnosis, adopt reasonable treatment plans, and improve the survival rate of patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    生物炭的应用已成为修复被潜在有毒金属(类)(PTM)污染的土壤的一种有前途且可持续的解决方案,然而,其减少作物PTM积累的潜力仍有待充分阐明。在我们的研究中,基于276篇研究文章进行了层次荟萃分析,以量化生物炭应用对作物生长和PTM积累的影响。同时,开发了一种机器学习方法来识别主要的贡献特征。我们的发现表明,生物炭的应用显着促进了作物的生长,减少了作物组织中的PTM浓度,谷物呈下降趋势(36.1%,33.6至38.6%)>枝条(31.1%,29.3至32.8%)>根(27.5%,25.7至29.2%)。此外,发现生物炭修饰可增强其在PTM污染土壤中的修复潜力。观察到生物炭为减少作物对PTM的吸收提供了有利条件。主要通过降低可用PTM浓度和改善整体土壤质量。采用机器学习技术,我们确定了生物炭的特性,例如表面积和C含量是降低土壤-作物系统中PTM生物有效性的关键因素。此外,我们的研究表明,生物炭的应用可以降低与PTM在作物谷物中的存在相关的概率健康风险,从而有助于保护人类健康。这些发现强调了生物炭在修复受PTM污染的土地中的重要作用,并为提高作物安全生产提供了指导方针。
    Biochar application emerges as a promising and sustainable solution for the remediation of soils contaminated with potentially toxic metal (loid)s (PTMs), yet its potential to reduce PTM accumulation in crops remains to be fully elucidated. In our study, a hierarchical meta-analysis based on 276 research articles was conducted to quantify the effects of biochar application on crop growth and PTM accumulation. Meanwhile, a machine learning approach was developed to identify the major contributing features. Our findings revealed that biochar application significantly enhanced crop growth, and reduced PTM concentrations in crop tissues, showing a decrease trend of grains (36.1%, 33.6-38.6%) > shoots (31.1%, 29.3-32.8%) > roots (27.5%, 25.7-29.2%). Furthermore, biochar modifications were found to amplify its remediation potential in PTM-contaminated soils. Biochar application was observed to provide favorable conditions for reducing PTM uptake by crops, primarily through decreasing available PTM concentrations and improving overall soil quality. Employing machine learning techniques, we identified biochar properties, such as surface area and C content as a key factor in decreasing PTM bioavailability in soil-crop systems. Furthermore, our study indicated that biochar application could reduce probabilistic health risks associated with of the presence of PTMs in crop grains, thereby contributing to human health protection. These findings highlighted the essential role of biochar in remediating PTM-contaminated lands and offered guidelines for enhancing safe crop production.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本研究旨在探索特定的生化指标,并构建2型糖尿病(T2D)患者糖尿病肾病(DKD)的风险预测模型。
    这项研究包括234名T2D患者,其中166人患有DKD,2021年1月至2022年7月在吉林大学第一医院就诊。临床特征,比如年龄,性别,和典型的血液学参数,被收集并用于建模。五种机器学习算法[极限梯度提升(XGBoost),梯度增压机(GBM),支持向量机(SVM)逻辑回归(LR),和随机森林(RF)]用于识别关键的临床和病理特征,并建立DKD的风险预测模型。此外,从吉林大学第三医院收集70例患者(nT2D=20,nDKD=50)的临床数据进行外部验证.
    RF算法在预测发展到DKD方面表现最佳,确定五个主要指标:估计肾小球滤过率(eGFR),糖化白蛋白(GA),尿酸,HbA1c,锌(Zn)。预测模型显示出足够的预测准确性,在内部验证集和外部验证集中,曲线下面积(AUC)值为0.960(95%CI:0.936-0.984)和0.9326(95%CI:0.8747-0.9885),分别。RF模型的诊断效能(AUC=0.960)显著高于RF模型中筛选的具有最高特征重要性的五个特征中的每一个。
    使用RF算法构建的在线DKD风险预测模型是基于其在内部验证中的强大性能而选择的。
    UNASSIGNED: This study aimed to explore specific biochemical indicators and construct a risk prediction model for diabetic kidney disease (DKD) in patients with type 2 diabetes (T2D).
    UNASSIGNED: This study included 234 T2D patients, of whom 166 had DKD, at the First Hospital of Jilin University from January 2021 to July 2022. Clinical characteristics, such as age, gender, and typical hematological parameters, were collected and used for modeling. Five machine learning algorithms [Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), Logistic Regression (LR), and Random Forest (RF)] were used to identify critical clinical and pathological features and to build a risk prediction model for DKD. Additionally, clinical data from 70 patients (nT2D = 20, nDKD = 50) were collected for external validation from the Third Hospital of Jilin University.
    UNASSIGNED: The RF algorithm demonstrated the best performance in predicting progression to DKD, identifying five major indicators: estimated glomerular filtration rate (eGFR), glycated albumin (GA), Uric acid, HbA1c, and Zinc (Zn). The prediction model showed sufficient predictive accuracy with area under the curve (AUC) values of 0.960 (95% CI: 0.936-0.984) and 0.9326 (95% CI: 0.8747-0.9885) in the internal validation set and external validation set, respectively. The diagnostic efficacy of the RF model (AUC = 0.960) was significantly higher than each of the five features screened with the highest feature importance in the RF model.
    UNASSIGNED: The online DKD risk prediction model constructed using the RF algorithm was selected based on its strong performance in the internal validation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号