Machine learning techniques

机器学习技术
  • 文章类型: Journal Article
    目的:本研究的目的是评估机器学习方法在子宫内膜异位症(EM)诊断中的应用。
    方法:本研究共纳入106例EM患者和203例非EM患者(如单纯囊肿和单纯子宫肌瘤),均于2017年1月至2022年9月期间入住北京儿童医院顺义妇女儿童医院。所有参与者均无合并症,并通过术后病理证实其诊断。在EM和非EM组之间进行比较分析。评估了基线数据,包括白细胞计数,中性粒细胞与淋巴细胞比率(NLR),血小板与淋巴细胞比率,淋巴细胞与单核细胞的比率,平均血小板体积,血红蛋白,糖类抗原125(CA125),糖抗原199,凝血参数,和其他血清学指标。使用人工智能算法开发了最佳预测模型来确定EM的存在。目的是为EM的临床诊断和治疗提供新的见解。
    结果:与决策树相比,随机森林算法表现出优越的性能,LogitBoost,人工神经网络,天真贝叶斯,支持向量机,和机器学习方法中的线性回归。当应用随机森林算法时,将CA125与NLR组合产生比单独使用CA125更好的EM预测。CA125结合NLR预测EM的准确率为78.16%,灵敏度为86.21%,曲线下面积(AUC)为0.85(P<0.05)。相比之下,单独使用CA125的EM预测准确率为75.8%,灵敏度为79.3%,AUC为0.82(P<0.05)。
    结论:血清CA125联合NLR对EM的诊断价值高于单独血清CA125。这一发现表明NLR可以与血清CA125一起作为新的补充生物标志物用于EM的诊断。
    OBJECTIVE: The aim of this study is to assess the use of machine learning methodologies in the diagnosis of endometriosis (EM).
    METHODS: This study included a total of 106 patients with EM and 203 patients with non-EM conditions (like simple cysts and simple uterine fibroids), all admitted to the Shunyi Women\'s and Children\'s Hospital of Beijing Children\'s Hospital between January 2017 and September 2022. All participants were free of comorbidities and their diagnoses were confirmed via postoperative pathology. Comparative analysis was conducted between the EM and non-EM groups. Baseline data were assessed, including white blood cell count, neutrophil-to-lymphocyte ratio (NLR), platelet-to-lymphocyte ratio, lymphocyte-to-monocyte ratio, mean platelet volume, hemoglobin, carbohydrate antigen 125 (CA125), carbohydrate antigen 199, coagulation parameters, and other serologic indicators. An optimal predictive model was developed using an artificial intelligence algorithm to determine the presence of EM. The objective is to provide new insights for the clinical diagnosis and treatment of EM.
    RESULTS: The random forest algorithm demonstrated superior performance when compared to decision trees, LogitBoost, artificial neural networks, naïve Bayes, support vector machines, and linear regression in machine learning methods. Combining CA125 with the NLR yielded a better prediction of EM than using CA125 alone when applying the random forest algorithm. The accuracy of predicting EM with CA125 combined with NLR was 78.16%, with a sensitivity of 86.21% and an area under the curve (AUC) of 0.85 (P < 0.05). In contrast, using CA125 alone resulted in an EM prediction accuracy of 75.8%, with a sensitivity of 79.3% and an AUC of 0.82 (P < 0.05).
    CONCLUSIONS: The diagnostic value of serum CA125 combined with the NLR for EM is higher than that of serum CA125 alone. This finding indicates that NLR could serve as a new supplementary biomarker along with serum CA125 in the diagnosis of EM.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    本研究阐明了用于风能预测的动态混合模型的制定和验证,特别强调其短期和长期预测准确性的能力。该模型基于对过去风能发电的时间序列数据的同化,并采用了三种机器学习算法:人工神经网络(ANN),支持向量机(SVM)和K-最近邻居(K-NN)。经验数据,从一台2兆瓦并网风力涡轮机中收获,作为培训和验证阶段的基础。设计了一种比较评估方法,以仔细检查各种度量标准中每个组成算法的性能。这个评估框架有助于识别个体算法的局限性,随后通过在混合模型中实施动态切换机制来缓解。此创新功能使模型能够根据历史绩效数据自适应地选择最有效的预测技术。混合模型在这两个方面都表现出了优异的预测精度,在一天中每隔15分钟进行短期能源预测,在广泛的,长期的。它记录的归一化平均绝对误差(NMAE)为5.54%,明显低于其他测试模型中观察到的5.65%-9.22%的NMAE范围,并且明显优于文献中发现的平均NMAE,从6.73%到10.07%。这种多功能性使其对电网运营商和风电场管理具有不可估量的价值,协助运营和战略规划。该研究的发现不仅有助于可再生能源预测的现有知识体系,而且还表明混合模型在各种其他预测分析领域的更广泛适用性。
    This study elucidates the formulation and validation of a dynamic hybrid model for wind energy forecasting, with a particular emphasis on its capability for both short-term and long-term predictive accuracy. The model is predicated on the assimilation of time-series data from past wind energy generation and employs a triad of machine learning algorithms: Artificial Neural Network (ANN), Support Vector Machine (SVM), and K-Nearest Neighbors (K-NN). Empirical data, harvested from a 2 MW grid-connected wind turbine, served as the basis for the training and validation phases. A comparative evaluation methodology was devised to scrutinize the performance of each constituent algorithm across a diverse array of metrics. This evaluation framework facilitated the identification of individual algorithmic limitations, which were subsequently mitigated through the implementation of a dynamic switching mechanism within the hybrid model. This innovative feature enables the model to adaptively select the most efficacious forecasting technique based on historical performance data. The hybrid model demonstrated superior forecasting accuracy in both, short-term energy forecasts at 15-min intervals over a day, and in broad, long-term. It recorded a Normalized Mean Absolute Error (NMAE) of 5.54 %, which is notably lower than the NMAE range of 5.65 %-9.22 % observed in other tested models, and significantly better than the average NMAE found in the literature, which spans from 6.73 % to 10.07 %. Such versatility renders it invaluable for grid operators and wind farm management, aiding in both operational and strategic planning. The study\'s findings not only contribute to the existing body of knowledge in renewable energy forecasting but also suggest the hybrid model\'s broader applicability in various other predictive analytics domains.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:准确预测生存预后有助于指导临床决策。本研究的目的是开发一种使用机器学习技术来预测老年心房颤动(AF)患者复合血栓栓塞事件(CTE)发生的模型。这些事件包括新诊断的脑缺血事件,心血管事件,肺栓塞,下肢动脉栓塞.
    方法:本回顾性研究纳入2010年1月至2022年6月解放军总医院收治的6,079例老年房颤住院患者(≥75岁)。随机森林插补用于处理丢失的数据。在描述性统计部分,根据CTE的发生将患者分为两组,并对两组之间的差异进行了分类变量的卡方检验和连续变量的秩和检验。在机器学习部分,以7:3的比例将患者随机分为训练数据集(n=4,225)和验证数据集(n=1,824).四种机器学习模型(逻辑回归,决策树,随机森林,XGBoost)在训练数据集上进行了训练,并在验证数据集上进行了验证。
    结果:复合血栓栓塞事件的发生率为19.53%。最小绝对收缩和选择算子(LASSO)方法,使用5倍交叉验证,将其应用于训练数据集,并确定了总共18个与CTE发生显著相关的特征。随机森林模型在曲线下面积方面优于其他模型(ACC:0.9144,SEN:0.7725,SPE:0.9489,AUC:0.927,95%CI:0.9105-0.9443)。基于临床决策曲线的随机森林模型也显示出良好的临床有效性。Shapley加法移植(SHAP)显示,与模型相关的前五个特征是缺血性卒中病史,高甘油三酯(TG),高总胆固醇(TC),血浆D-二聚体高,年龄。
    结论:本研究提出了一个准确的模型来对CTE高危患者进行分层。随机森林模型具有良好的性能。缺血性卒中病史,年龄,高TG,高TC和高血浆D-二聚体可能与CTE相关。
    OBJECTIVE: Accurate prediction of survival prognosis is helpful to guide clinical decision-making. The aim of this study was to develop a model using machine learning techniques to predict the occurrence of composite thromboembolic events (CTEs) in elderly patients with atrial fibrillation(AF). These events encompass newly diagnosed cerebral ischemia events, cardiovascular events, pulmonary embolism, and lower extremity arterial embolism.
    METHODS: This retrospective study included 6,079 elderly hospitalized patients (≥ 75 years old) with AF admitted to the People\'s Liberation Army General Hospital in China from January 2010 to June 2022. Random forest imputation was used for handling missing data. In the descriptive statistics section, patients were divided into two groups based on the occurrence of CTEs, and differences between the two groups were analyzed using chi-square tests for categorical variables and rank-sum tests for continuous variables. In the machine learning section, the patients were randomly divided into a training dataset (n = 4,225) and a validation dataset (n = 1,824) in a 7:3 ratio. Four machine learning models (logistic regression, decision tree, random forest, XGBoost) were trained on the training dataset and validated on the validation dataset.
    RESULTS: The incidence of composite thromboembolic events was 19.53%. The Least Absolute Shrinkage and Selection Operator (LASSO) method, using 5-fold cross-validation, was applied to the training dataset and identified a total of 18 features that exhibited a significant association with the occurrence of CTEs. The random forest model outperformed other models in terms of area under the curve (ACC: 0.9144, SEN: 0.7725, SPE: 0.9489, AUC: 0.927, 95% CI: 0.9105-0.9443). The random forest model also showed good clinical validity based on the clinical decision curve. The Shapley Additive exPlanations (SHAP) showed that the top five features associated with the model were history of ischemic stroke, high triglyceride (TG), high total cholesterol (TC), high plasma D-dimer, age.
    CONCLUSIONS: This study proposes an accurate model to stratify patients with a high risk of CTEs. The random forest model has good performance. History of ischemic stroke, age, high TG, high TC and high plasma D-Dimer may be correlated with CTEs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Editorial
    人工智能(AI)在过去几年中迅速发展,特别是在医学上用于改进诊断。在临床细胞遗传学中,AI对于分析染色体异常和提高精度至关重要。然而,现有软件缺乏向有经验的用户学习的能力。人工智能整合延伸到基因组数据分析,个性化医学和研究,但伦理问题出现了。在这篇文章中,我们讨论了全自动化细胞遗传学测试解释的挑战,并关注其重要性和益处。
    Artificial intelligence (AI) has rapidly advanced in the past years, particularly in medicine for improved diagnostics. In clinical cytogenetics, AI is becoming crucial for analyzing chromosomal abnormalities and improving precision. However, existing software lack learning capabilities from experienced users. AI integration extends to genomic data analysis, personalized medicine and research, but ethical concerns arise. In this article, we discuss the challenges of the full automation in cytogenetic test interpretation and focus on its importance and benefits.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    预测和改善直肠癌对第二原发癌(SPCs)的反应仍然是临床研究的活跃和挑战性领域。确定SPC的预测风险因素将有助于指导更个性化的治疗策略。在这项研究中,我们建议将经验数据用作支持以患者为导向的决策的证据.所提出的模型由两个主要部分组成:用于提取和分类的管道以及临床风险评估。该研究包括4402个患者数据集,包括395名SPC患者,从三个医疗中心的三个癌症登记数据库中收集;根据文献综述和与临床专家的讨论,10个预测变量被认为是SPC的危险因素。根据重要性对患者进行分类的拟议提取和分类管道是诊断时的年龄,化疗,吸烟行为,联合舞台组,和性,正如以前的研究所证明的那样。C5方法具有最高的预测AUC(84.88%)。此外,所提出的模型与分类管道相关联,该管道显示出80.85%的可接受测试精度,召回率79.97%,特异性为88.12%,精度为85.79%,F1得分为79.88%。我们的结果表明,化疗是直肠癌幸存者SPCs最重要的预后危险因素。此外,我们的临床风险评估决策树阐明了评估这些风险因素组合的有效性的可能性.该模型可能为未来直肠癌幸存者的个性化治疗提供必要的评估和纵向变化。
    Predicting and improving the response of rectal cancer to second primary cancers (SPCs) remains an active and challenging field of clinical research. Identifying predictive risk factors for SPCs will help guide more personalized treatment strategies. In this study, we propose that experience data be used as evidence to support patient-oriented decision-making. The proposed model consists of two main components: a pipeline for extraction and classification and a clinical risk assessment. The study includes 4402 patient datasets, including 395 SPC patients, collected from three cancer registry databases at three medical centers; based on literature reviews and discussion with clinical experts, 10 predictive variables were considered risk factors for SPCs. The proposed extraction and classification pipelines that classified patients according to importance were age at diagnosis, chemotherapy, smoking behavior, combined stage group, and sex, as has been proven in previous studies. The C5 method had the highest predicted AUC (84.88%). In addition, the proposed model was associated with a classification pipeline that showed an acceptable testing accuracy of 80.85%, a recall of 79.97%, a specificity of 88.12%, a precision of 85.79%, and an F1 score of 79.88%. Our results indicate that chemotherapy is the most important prognostic risk factor for SPCs in rectal cancer survivors. Furthermore, our decision tree for clinical risk assessment illuminates the possibility of assessing the effectiveness of a combination of these risk factors. This proposed model may provide an essential evaluation and longitudinal change for personalized treatment of rectal cancer survivors in the future.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    非酒精性脂肪性肝病(NAFLD)的特征在于肝脏中过量脂肪的积累。如果在早期阶段未得到诊断和治疗,NAFLD可以发展为更严重的疾病,如炎症,肝纤维化,肝硬化,甚至肝衰竭.在这项研究中,机器学习技术被用来使用负担得起的和可访问的实验室测试数据来预测NAFLD,而常规技术肝脂肪变性指数(HSI)进行了计算比较。六种算法(随机森林,K近邻,Logistic回归,支持向量机,极端梯度增强,决策树),还有一个合奏模型,用于数据集分析。目的是开发一种具有成本效益的工具,以实现早期诊断,从而更好地管理病情。使用编辑最近邻居的合成少数群体过采样技术(SMOTEENN)解决了数据不平衡的问题。各种评估指标,包括F1得分,精度,准确度,召回,混淆矩阵,平均绝对误差(MAE),接收机工作特性(ROC),和曲线下面积(AUC)用于评估每种技术对疾病预测的适用性。使用国家健康和营养调查(NHANES)数据集的实验结果表明,与我们使用的机器学习技术和HSI相比,集成模型实现了最高的准确性(0.99)和AUC(1.00)。这些发现表明,集成模型具有作为医疗保健专业人员预测NAFLD的有益工具的潜力,利用可访问且具有成本效益的实验室测试数据。
    Non-Alcoholic Fatty Liver Disease (NAFLD) is characterized by the accumulation of excess fat in the liver. If left undiagnosed and untreated during the early stages, NAFLD can progress to more severe conditions such as inflammation, liver fibrosis, cirrhosis, and even liver failure. In this study, machine learning techniques were employed to predict NAFLD using affordable and accessible laboratory test data, while the conventional technique hepatic steatosis index (HSI)was calculated for comparison. Six algorithms (random forest, K-nearest Neighbors, Logistic Regression, Support Vector Machine, extreme gradient boosting, decision tree), along with an ensemble model, were utilized for dataset analysis. The objective was to develop a cost-effective tool for enabling early diagnosis, leading to better management of the condition. The issue of imbalanced data was addressed using the Synthetic Minority Oversampling Technique Edited Nearest Neighbors (SMOTEENN). Various evaluation metrics including the F1 score, precision, accuracy, recall, confusion matrix, the mean absolute error (MAE), receiver operating characteristics (ROC), and area under the curve (AUC) were employed to assess the suitability of each technique for disease prediction. Experimental results using the National Health and Nutrition Examination Survey (NHANES) dataset demonstrated that the ensemble model achieved the highest accuracy (0.99) and AUC (1.00) compared to the machine learning techniques that we used and HSI. These findings indicate that the ensemble model holds potential as a beneficial tool for healthcare professionals to predict NAFLD, leveraging accessible and cost-effective laboratory test data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:由于战争,创伤后应激障碍(PTSD)的发病率目前正在增加,恐怖主义,和大流行性疾病的情况。因此,PTSD的准确检测对患者的治疗至关重要,为此,本研究旨在对PTSD患者与健康对照者进行分类.
    方法:使用19名PTSD和24名健康对照男性受试者的静息状态功能MRI(rs-fMRI)扫描,使用组水平独立成分分析(ICA)和t检验来识别大多数受影响的大脑区域的激活模式。将受创伤后应激障碍影响的受试者与健康对照的六种机器学习技术进行分类,包括随机森林,天真的贝叶斯,支持向量机,决策树,K-最近邻,线性判别分析,和深度学习三维3D-CNN的数据进行了比较。
    结果:分析了最常见的11个创伤暴露区域和健康大脑的rs-fMRI扫描,以观察其激活水平。杏仁核和脑岛区域被确定为PTSD受试者大脑中感兴趣区域中最激活的区域。此外,机器学习技术已应用于从ICA提取的组件,但模型提供低分类精度。ICA分量也被馈送到3D-CNN模型中,用5倍交叉验证方法训练。3D-CNN模型表现出很高的准确性,如98.12%,98.25%,和98.00%的平均训练,验证,和测试数据集,分别。
    结论:研究结果表明,3D-CNN是一种超越其他六种技术的方法,它有助于准确识别PTSD患者。
    BACKGROUND: The incidence rate of Posttraumatic stress disorder (PTSD) is currently increasing due to wars, terrorism, and pandemic disease situations. Therefore, accurate detection of PTSD is crucial for the treatment of the patients, for this purpose, the present study aims to classify individuals with PTSD versus healthy control.
    METHODS: The resting-state functional MRI (rs-fMRI) scans of 19 PTSD and 24 healthy control male subjects have been used to identify the activation pattern in most affected brain regions using group-level independent component analysis (ICA) and t-test. To classify PTSD-affected subjects from healthy control six machine learning techniques including random forest, Naive Bayes, support vector machine, decision tree, K-nearest neighbor, linear discriminant analysis, and deep learning three-dimensional 3D-CNN have been performed on the data and compared.
    RESULTS: The rs-fMRI scans of the most commonly investigated 11 regions of trauma-exposed and healthy brains are analyzed to observe their level of activation. Amygdala and insula regions are determined as the most activated regions from the regions-of-interest in the brain of PTSD subjects. In addition, machine learning techniques have been applied to the components extracted from ICA but the models provided low classification accuracy. The ICA components are also fed into the 3D-CNN model, which is trained with a 5-fold cross-validation method. The 3D-CNN model demonstrated high accuracies, such as 98.12%, 98.25 %, and 98.00 % on average with training, validation, and testing datasets, respectively.
    CONCLUSIONS: The findings indicate that 3D-CNN is a surpassing method than the other six considered techniques and it helps to recognize PTSD patients accurately.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    肽激素作为基因组编码的信号转导分子,在多细胞生物体中发挥重要作用,他们的失调会导致各种健康问题。在这项研究中,我们提出了一种高精度预测激素肽的方法。用于训练的数据集,测试,并评估我们的模型由1174个激素和1174个非激素肽序列组成。最初,我们利用BLAST和MERCI软件开发了基于相似性的方法。尽管这些基于相似性的方法提供了很高的正确预测概率,他们有局限性,例如没有命中或预测有限的序列。为了克服这些限制,我们进一步开发了基于机器和深度学习的模型。我们的基于逻辑回归的模型在独立/验证数据集上实现了0.93的最大AUROC,准确度为86%。为了利用基于相似性和基于机器学习的模型的强大功能,我们开发了一种集成方法,在验证集上,AUROC为0.96,准确率为89.79%,Matthews相关系数(MCC)为0.8.为了便于研究人员预测和设计激素肽,我们开发了一个名为HOPPred的基于网络的服务器。该服务器提供了一个独特的功能,可以识别激素肽中的激素相关基序。可以在以下位置访问服务器:https://web。iitd.edu.in/raghava/hoppred/.
    Peptide hormones serve as genome-encoded signal transduction molecules that play essential roles in multicellular organisms, and their dysregulation can lead to various health problems. In this study, we propose a method for predicting hormonal peptides with high accuracy. The dataset used for training, testing, and evaluating our models consisted of 1174 hormonal and 1174 non-hormonal peptide sequences. Initially, we developed similarity-based methods utilizing BLAST and MERCI software. Although these similarity-based methods provided a high probability of correct prediction, they had limitations, such as no hits or prediction of limited sequences. To overcome these limitations, we further developed machine and deep learning-based models. Our logistic regression-based model achieved a maximum AUROC of 0.93 with an accuracy of 86% on an independent/validation dataset. To harness the power of similarity-based and machine learning-based models, we developed an ensemble method that achieved an AUROC of 0.96 with an accuracy of 89.79% and a Matthews correlation coefficient (MCC) of 0.8 on the validation set. To facilitate researchers in predicting and designing hormone peptides, we developed a web-based server called HOPPred. This server offers a unique feature that allows the identification of hormone-associated motifs within hormone peptides. The server can be accessed at: https://webs.iiitd.edu.in/raghava/hoppred/.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Systematic Review
    背景:人类情绪识别(HER)在过去几年中一直是研究的热门领域。尽管到目前为止取得了巨大的进展,在自闭症中使用她的注意力相对较少。众所周知,自闭症患者在日常社交交流和情绪反应的原型解释方面面临问题,最常见的是通过面部表情来表达。这对常规HER系统的应用提出了重大的实际挑战,通常是由神经典型的人开发的。
    目的:本研究回顾了在自闭症中使用HER系统的文献,特别是在传感技术和机器学习方法方面,以确定现有的障碍和未来可能的方向。
    方法:我们根据2020PRISMA指南对2011年1月至2023年6月之间发表的文章进行了系统回顾。通过搜索WebofScience和Scopus数据库确定了手稿。当与情感识别有关时,包括手稿,使用传感器和机器学习技术,涉及自闭症儿童,年轻,或成人。
    结果:搜索产生346篇文章。共有65份出版物符合资格标准,并被纳入审查。
    结论:研究主要使用面部表情技术作为情感识别方法。因此,摄像机是研究中使用最广泛的设备,尽管最近观察到生理传感器的使用有增长趋势。幸福,悲伤,愤怒,恐惧,厌恶,惊喜是最常见的。经典的监督机器学习技术主要是以无监督方法或最近的深度学习模型为代价的。研究集中在广义上的自闭症,但有限的努力已经针对更具体的频谱障碍。隐私或安全问题很少得到解决,如果是这样,在一个相当不够详细的水平。
    BACKGROUND: Human Emotion Recognition (HER) has been a popular field of study in the past years. Despite the great progresses made so far, relatively little attention has been paid to the use of HER in autism. People with autism are known to face problems with daily social communication and the prototypical interpretation of emotional responses, which are most frequently exerted via facial expressions. This poses significant practical challenges to the application of regular HER systems, which are normally developed for and by neurotypical people.
    OBJECTIVE: This study reviews the literature on the use of HER systems in autism, particularly with respect to sensing technologies and machine learning methods, as to identify existing barriers and possible future directions.
    METHODS: We conducted a systematic review of articles published between January 2011 and June 2023 according to the 2020 PRISMA guidelines. Manuscripts were identified through searching Web of Science and Scopus databases. Manuscripts were included when related to emotion recognition, used sensors and machine learning techniques, and involved children with autism, young, or adults.
    RESULTS: The search yielded 346 articles. A total of 65 publications met the eligibility criteria and were included in the review.
    CONCLUSIONS: Studies predominantly used facial expression techniques as the emotion recognition method. Consequently, video cameras were the most widely used devices across studies, although a growing trend in the use of physiological sensors was observed lately. Happiness, sadness, anger, fear, disgust, and surprise were most frequently addressed. Classical supervised machine learning techniques were primarily used at the expense of unsupervised approaches or more recent deep learning models. Studies focused on autism in a broad sense but limited efforts have been directed towards more specific disorders of the spectrum. Privacy or security issues were seldom addressed, and if so, at a rather insufficient level of detail.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    单轴压缩下岩石的强度,通常称为单轴抗压强度(UCS),在各种地质力学应用中起着至关重要的作用,如设计基础,采矿项目,岩石上的斜坡,隧道施工,和岩石表征。然而,在一些岩石中,取样和准备可能变得具有挑战性,这使得很难直接确定岩石的UCS。因此,间接方法被广泛用于估计UCS。本研究提出了两种机器学习模型,简单线性回归和逐步回归,在Python中实现,以计算Charnockite岩石的UCS。该模型考虑超声波脉冲速度(UPV),施密特锤子反弹数(N),巴西抗拉强度(BTS),和点负荷指数(PLI)作为预测Charnockite样本UCS的因素。三个回归指标,包括回归系数(R2),均方根误差(RMSE),和平均绝对误差(MAE),用于评估和比较模型的性能。结果表明,这两个模型都有很高的预测能力。值得注意的是,逐步模型实现了0.99的测试R2和0.988的训练R2,用于预测Charnockite强度,使其成为最精确的模型。对影响因素的分析表明,UPV在预测Charnoccite的UCS中起着重要作用。
    The strength of rock under uniaxial compression, commonly known as Uniaxial Compressive Strength (UCS), plays a crucial role in various geomechanical applications such as designing foundations, mining projects, slopes in rocks, tunnel construction, and rock characterization. However, sampling and preparation can become challenging in some rocks, making it difficult to determine the UCS of the rocks directly. Therefore, indirect approaches are widely used for estimating UCS. This study presents two Machine Learning Models, Simple Linear Regression and Step-wise Regression, implemented in Python to calculate the UCS of Charnockite rocks. The models consider Ultrasonic Pulse Velocity (UPV), Schmidt Hammer Rebound Number (N), Brazilian Tensile Strength (BTS), and Point Load Index (PLI) as factors for forecasting the UCS of Charnockite samples. Three regression metrics, including Coefficient of Regression (R2), Root Mean Square Error (RMSE), and Mean Absolute Error (MAE), were used to evaluate and compare the performance of the models. The results indicate a high predictive capability of both models. Notably, the Step-wise model achieved a testing R2 of 0.99 and a training R2 of 0.988 for predicting Charnockite strength, making it the most accurate model. The analysis of the influential factors indicates that UPV plays a significant role in predicting the UCS of Charnockite.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号