machine learning model

机器学习模型
  • 文章类型: Journal Article
    阿尔茨海默病是一种神经退行性疾病,其特征是脑细胞进行性变性,导致认知能力下降和记忆丧失。它是痴呆症的最常见原因,影响全球数百万人。虽然目前没有治愈阿尔茨海默病的方法,早期发现和治疗有助于减缓症状进展,提高生活质量。这项研究提出了一种诊断工具,用于使用基于特征的机器学习对轻度认知障碍和阿尔茨海默病进行分类,该机器学习应用于光学相干断层血管造影图像(OCT-A)。从OCT-A图像中提取几个特征,包括五个部门的容器密度,中央凹无血管区的区域,视网膜厚度,和基于范围过滤的OCT-A图像的直方图的新颖特征。为了确保多样化人口的有效性,收集了我们研究的大型本地数据库。我们研究的有希望的结果,92.17,%的最佳准确度将为早期发现阿尔茨海默病提供有效的诊断工具。
    Alzheimer\'s disease is a type of neurodegenerative disorder that is characterized by the progressive degeneration of brain cells, leading to cognitive decline and memory loss. It is the most common cause of dementia and affects millions of people worldwide. While there is currently no cure for Alzheimer\'s disease, early detection and treatment can help to slow the progression of symptoms and improve quality of life. This research presents a diagnostic tool for classifying mild cognitive impairment and Alzheimer\'s diseases using feature-based machine learning applied to optical coherence tomographic angiography images (OCT-A). Several features are extracted from the OCT-A image, including vessel density in five sectors, the area of the foveal avascular zone, retinal thickness, and novel features based on the histogram of the range-filtered OCT-A image. To ensure effectiveness for a diverse population, a large local database for our study was collected. The promising results of our study, with the best accuracy of 92.17,% will provide an efficient diagnostic tool for early detection of Alzheimer\'s disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    口腔鳞状细胞癌的组织学分级影响预后。在本研究中,我们进行了影像组学分析,从18F-FDGPET图像数据中提取特征,从功能创建机器学习模型,并验证了口腔鳞状细胞癌组织学分级预测的准确性。研究对象为191例患者,术前进行18F-FDG-PET检查,术后确认组织病理学分级,它们的肿瘤大小足以进行影像组学分析。这些患者被分成70%/30%的比例,用作训练数据和测试数据,分别。我们从每位患者的PET图像中提取了2993个影像组学特征。逻辑回归(LR),支持向量机(SVM)随机森林(RF),朴素贝叶斯(NB),并创建了K最近邻(KNN)机器学习模型。从受试者工作特征曲线获得的预测口腔鳞状细胞癌组织学分级的曲线下面积分别为LR的0.72、0.71、0.84、0.74和0.73,SVM,射频,NB,和KNN,分别。我们证实,PET影像组学分析可用于术前预测口腔鳞状细胞癌的组织学分级。
    The histological grade of oral squamous cell carcinoma affects the prognosis. In the present study, we performed a radiomics analysis to extract features from 18F-FDG PET image data, created machine learning models from the features, and verified the accuracy of the prediction of the histological grade of oral squamous cell carcinoma. The subjects were 191 patients in whom an 18F-FDG-PET examination was performed preoperatively and a histopathological grade was confirmed after surgery, and their tumor sizes were sufficient for a radiomics analysis. These patients were split in a 70%/30% ratio for use as training data and testing data, respectively. We extracted 2993 radiomics features from the PET images of each patient. Logistic Regression (LR), Support Vector Machine (SVM), Random Forest (RF), Naïve Bayes (NB), and K-Nearest Neighbor (KNN) machine learning models were created. The areas under the curve obtained from receiver operating characteristic curves for the prediction of the histological grade of oral squamous cell carcinoma were 0.72, 0.71, 0.84, 0.74, and 0.73 for LR, SVM, RF, NB, and KNN, respectively. We confirmed that a PET radiomics analysis is useful for the preoperative prediction of the histological grade of oral squamous cell carcinoma.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    高性能混凝土(HPC)抗压强度与其组分之间存在复杂的高维非线性映射关系,对抗压强度的准确预测有很大影响。在本文中,结合BP神经网络(BPNN)的高效稳健软件计算策略,提出了支持向量机(SVM)和遗传算法(GA)用于HPC的抗压强度预测。从以前的文献中提取了8个特征,构建了包含454组数据的抗压强度数据库。对模型进行了训练和测试,以及4个机器学习(ML)模型的性能,即BPNN,SVM,GA-BPNN和GA-SVM,比较。结果表明,耦合模型优于单一模型。此外,由于GA-SVM具有较好的泛化能力和理论基础,其收敛速度和预测精度均优于GA-BPNN。然后利用灰色关联分析(GRA)和Shapley分析验证了GA-SVM模型的可解释性,结果表明,水胶比对抗压强度的影响最大。最后,多输入变量的组合来评估抗压强度,补充了本研究,并再次验证了水胶比的显著影响,为后续研究提供参考价值。
    There is a complex high-dimensional nonlinear mapping relationship between the compressive strength of High-Performance Concrete (HPC) and its components, which has great influence on the accurate prediction of compressive strength. In this paper, an efficient robust software calculation strategy combining BP Neural Network (BPNN), Support Vector Machine (SVM) and Genetic Algorithm (GA) is proposed for the prediction of compressive strength of HPC. 8 features were extracted from the previous literature, and a compressive strength database containing 454 sets of data was constructed. The model was trained and tested, and the performance of 4 Machine Learning (ML) models, namely BPNN, SVM, GA-BPNN and GA-SVM, was compared. The results show that the coupled model is superior to the single model. Moreover, because GA-SVM has better generalization ability and theoretical basis, its convergence speed and prediction accuracy are better than GA-BPNN. Then Grey Relational Analysis (GRA) and Shapley analysis were used to verify the interpretability of the GA-SVM model, which showed that the water-binder ratio had the most significant influence on the compressive strength. Finally, the combination of multiple input variables to evaluate the compressive strength supplemented this research, and again verified the significant influence of water-binder ratio, providing reference value for subsequent research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:术前评估很重要,我们的研究探索了机器学习方法在麻醉风险分类和评估各种因素贡献中的应用。为了在模型训练期间最小化混杂变量的影响,我们使用了生理状态和年龄相似的同质组,他们接受了类似的盆腔器官相关手术,但不涉及恶性肿瘤.
    目的:2017年1月1日至2021年12月31日期间进行妊娠或妇科手术的育龄妇女(年龄=20-50岁)的数据来自国立台湾大学医院综合医学数据库。
    方法:我们首先进行了探索性分析并选择了关键特征。然后,我们进行了数据预处理,以获取与术前检查相关的特征。为了进一步提高预测性能,我们采用对数似然比算法生成合并症模式。最后,我们将处理后的特征输入到光梯度增强机(LightGBM)模型中进行训练和后续预测。
    结果:共纳入10,892例患者。在这个数据集中,9893名患者被归类为低麻醉风险(美国麻醉医师协会身体状况评分1-2),999例患者被归类为麻醉风险高(美国麻醉医师协会身体状况评分>2)。LightGBM模型的接收器工作特性曲线下的面积为90.25。
    结论:通过结合合并症信息和临床实验室数据,我们基于LightGBM模型的方法为麻醉风险分类提供了更准确的预测.
    背景:本研究已在国立台湾大学医院研究伦理委员会注册,试验编号为202204010RINB。
    BACKGROUND: Preoperative evaluation is important, and this study explored the application of machine learning methods for anesthetic risk classification and the evaluation of the contributions of various factors. To minimize the effects of confounding variables during model training, we used a homogenous group with similar physiological states and ages undergoing similar pelvic organ-related procedures not involving malignancies.
    OBJECTIVE: Data on women of reproductive age (age 20-50 years) who underwent gestational or gynecological surgery between January 1, 2017, and December 31, 2021, were obtained from the National Taiwan University Hospital Integrated Medical Database.
    METHODS: We first performed an exploratory analysis and selected key features. We then performed data preprocessing to acquire relevant features related to preoperative examination. To further enhance predictive performance, we used the log-likelihood ratio algorithm to generate comorbidity patterns. Finally, we input the processed features into the light gradient boosting machine (LightGBM) model for training and subsequent prediction.
    RESULTS: A total of 10,892 patients were included. Within this data set, 9893 patients were classified as having low anesthetic risk (American Society of Anesthesiologists physical status score of 1-2), and 999 patients were classified as having high anesthetic risk (American Society of Anesthesiologists physical status score of >2). The area under the receiver operating characteristic curve of the proposed model was 0.6831.
    CONCLUSIONS: By combining comorbidity information and clinical laboratory data, our methodology based on the LightGBM model provides more accurate predictions for anesthetic risk classification.
    BACKGROUND: Research Ethics Committee of the National Taiwan University Hospital 202204010RINB; https://www.ntuh.gov.tw/RECO/Index.action.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:及时有效地识别患有抑郁症(DS)的个体对于提供及时治疗至关重要。机器学习模型在这一领域表现出了希望;然而,研究往往不足以证明使用这些模型的实际好处,并且无法提供切实的实际应用。
    目的:本研究旨在建立一种新的方法来识别可能表现出DS的个体,通过概率测度以更可解释的方式识别最有影响力的特征,并提出可用于实际应用的工具。
    方法:该研究使用了3个数据集:PROACTIVE,2013年巴西国家健康调查(PesquisaNacionaldeSaúde[PNS])和PNS2019,包括社会人口统计学和健康相关特征。使用贝叶斯网络进行特征选择。然后使用选定的特征来训练机器学习模型以预测DS,在9项患者健康问卷中,评分≥10。与随机方法相比,该研究还分析了不同敏感性率对减少筛选访谈的影响。
    结果:该方法允许用户在灵敏度之间进行明智的权衡,特异性,减少面试次数。在通过最大化Youden指数确定的阈值0.444、0.412和0.472下,模型的灵敏度为0.717、0.741和0.718,特异性为0.644、0.737和0.766,分别为PNS2013和PNS2019。这3个数据集的接收器工作特性曲线下面积分别为0.736、0.801和0.809,分别。对于PROACTIVE数据集,最具影响力的特征是姿势平衡,呼吸急促,以及老年人的感觉。在PNS2013数据集中,特点是能够进行日常活动,胸痛,睡眠问题,和慢性背部问题。PNS2019数据集与PNS2013数据集共享3个最具影响力的特征。然而,不同的是用言语虐待代替了慢性背部问题。重要的是要注意,PNS数据集中包含的特征与PROACTIVE数据集中的特征不同。实证分析表明,使用所提出的模型可导致筛选访谈减少52%,同时保持0.80的敏感性。
    结论:这项研究开发了一种新的方法来识别患有DS的个体,展示了使用贝叶斯网络识别最重要特征的实用性。此外,这种方法有可能大大减少筛选访谈的数量,同时保持高度的敏感性,从而促进改善DS患者的早期识别和干预策略。
    BACKGROUND: Identifying individuals with depressive symptomatology (DS) promptly and effectively is of paramount importance for providing timely treatment. Machine learning models have shown promise in this area; however, studies often fall short in demonstrating the practical benefits of using these models and fail to provide tangible real-world applications.
    OBJECTIVE: This study aims to establish a novel methodology for identifying individuals likely to exhibit DS, identify the most influential features in a more explainable way via probabilistic measures, and propose tools that can be used in real-world applications.
    METHODS: The study used 3 data sets: PROACTIVE, the Brazilian National Health Survey (Pesquisa Nacional de Saúde [PNS]) 2013, and PNS 2019, comprising sociodemographic and health-related features. A Bayesian network was used for feature selection. Selected features were then used to train machine learning models to predict DS, operationalized as a score of ≥10 on the 9-item Patient Health Questionnaire. The study also analyzed the impact of varying sensitivity rates on the reduction of screening interviews compared to a random approach.
    RESULTS: The methodology allows the users to make an informed trade-off among sensitivity, specificity, and a reduction in the number of interviews. At the thresholds of 0.444, 0.412, and 0.472, determined by maximizing the Youden index, the models achieved sensitivities of 0.717, 0.741, and 0.718, and specificities of 0.644, 0.737, and 0.766 for PROACTIVE, PNS 2013, and PNS 2019, respectively. The area under the receiver operating characteristic curve was 0.736, 0.801, and 0.809 for these 3 data sets, respectively. For the PROACTIVE data set, the most influential features identified were postural balance, shortness of breath, and how old people feel they are. In the PNS 2013 data set, the features were the ability to do usual activities, chest pain, sleep problems, and chronic back problems. The PNS 2019 data set shared 3 of the most influential features with the PNS 2013 data set. However, the difference was the replacement of chronic back problems with verbal abuse. It is important to note that the features contained in the PNS data sets differ from those found in the PROACTIVE data set. An empirical analysis demonstrated that using the proposed model led to a potential reduction in screening interviews of up to 52% while maintaining a sensitivity of 0.80.
    CONCLUSIONS: This study developed a novel methodology for identifying individuals with DS, demonstrating the utility of using Bayesian networks to identify the most significant features. Moreover, this approach has the potential to substantially reduce the number of screening interviews while maintaining high sensitivity, thereby facilitating improved early identification and intervention strategies for individuals experiencing DS.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    阿尔茨海默病(AD)的发病率在全球范围内呈上升趋势,然而,由于与之相关的复杂病理生理机制,其治疗和预测仍具有挑战性。因此,本研究的目的是分析和表征铁凋亡相关基因(FEGs)在AD发病机理中的分子机制,以及构建预后模型。这些发现将为未来AD的诊断和治疗提供新的见解。首先,获得了来自基因表达综合数据库的AD数据集GSE33000和来自FerrDB的FEGs。接下来,无监督聚类分析用于获得与AD最相关的FEGs。随后,对FEGs进行富集分析以探索生物学功能。随后,通过CIBERSORT阐明了这些基因在免疫微环境中的作用。然后,通过比较不同机器学习模型的性能选择最优机器学习。为了验证预测效率,使用列线图对模型进行了验证,校正曲线,决策曲线分析和外部数据集。此外,使用逆转录定量PCR和Westernblot分析验证不同组间FEGs的表达.在AD中,FEGs表达的改变影响某些免疫细胞的聚集和浸润。这表明AD的发生与免疫浸润密切相关。最后,选择了最合适的机器学习模型,建立AD诊断模型和列线图。本研究提供了新的见解,可以增强对FEGs在AD中作用的分子机制的理解。此外,本研究提供了可能有助于AD诊断的生物标志物.
    The incidence of Alzheimer\'s disease (AD) is rising globally, yet its treatment and prediction of this condition remain challenging due to the complex pathophysiological mechanisms associated with it. Consequently, the objective of the present study was to analyze and characterize the molecular mechanisms underlying ferroptosis‑related genes (FEGs) in the pathogenesis of AD, as well as to construct a prognostic model. The findings will provide new insights for the future diagnosis and treatment of AD. First, the AD dataset GSE33000 from the Gene Expression Omnibus database and the FEGs from FerrDB were obtained. Next, unsupervised cluster analysis was used to obtain the FEGs that were most relevant to AD. Subsequently, enrichment analyses were performed on the FEGs to explore biological functions. Subsequently, the role of these genes in the immune microenvironment was elucidated through CIBERSORT. Then, the optimal machine learning was selected by comparing the performance of different machine learning models. To validate the prediction efficiency, the models were validated using nomograms, calibration curves, decision curve analysis and external datasets. Furthermore, the expression of FEGs between different groups was verified using reverse transcription quantitative PCR and western blot analysis. In AD, alterations in the expression of FEGs affect the aggregation and infiltration of certain immune cells. This indicated that the occurrence of AD is strongly associated with immune infiltration. Finally, the most appropriate machine learning models were selected, and AD diagnostic models and nomograms were built. The present study provided novel insights that enhance understanding with regard to the molecular mechanism of action of FEGs in AD. Moreover, the present study provided biomarkers that may facilitate the diagnosis of AD.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    传粉者收集的花粉可以用作觅食行为的标记,也可以指示每种环境中存在的植物物种。花粉摄入对传粉者的健康和生存至关重要。在觅食活动中,一些传粉者,比如蜜蜂,操纵收集的花粉与唾液分泌物和花蜜(花粉)混合,改变花粉化学特征。已经开发了不同的工具来鉴定花粉的植物起源,基于显微镜,光谱学,或分子标记。然而,到目前为止,花粉从未被调查过。在我们的工作中,春季采集了5个不同气候地区的花粉。使用基于显微镜的技术鉴定了花粉,然后用MALDI-MS分析测试了四种不同的化学提取溶液和两种物理破坏方法以实现MALDI-MS有效方案。在用乙酸或三氟乙酸萃取后,使用超声破碎方法获得最佳性能。因此,我们提出了一种新的快速可靠的方法,用于使用MALDI-MS鉴定球花粉的植物起源。这种新方法为从植物生物多样性到生态系统营养相互作用的广泛环境研究打开了大门。
    Pollen collected by pollinators can be used as a marker of the foraging behavior as well as indicate the botanical species present in each environment. Pollen intake is essential for pollinators\' health and survival. During the foraging activity, some pollinators, such as honeybees, manipulate the collected pollen mixing it with salivary secretions and nectar (corbicular pollen) changing the pollen chemical profile. Different tools have been developed for the identification of the botanical origin of pollen, based on microscopy, spectrometry, or molecular markers. However, up to date, corbicular pollen has never been investigated. In our work, corbicular pollen from 5 regions with different climate conditions was collected during spring. Pollens were identified with microscopy-based techniques, and then analyzed in MALDI-MS. Four different chemical extraction solutions and two physical disruption methods were tested to achieve a MALDI-MS effective protocol. The best performance was obtained using a sonication disruption method after extraction with acetic acid or trifluoroacetic acid. Therefore, we propose a new rapid and reliable methodology for the identification of the botanical origin of the corbicular pollens using MALDI-MS. This new approach opens to a wide range of environmental studies spanning from plant biodiversity to ecosystem trophic interactions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    恶性梗阻性黄疸患者ERCP植入后胆管炎的风险仍然未知。建立基于人工智能方法的模型来更准确地预测胆管炎的风险,根据患者支架植入术后患者的临床资料。这项回顾性研究包括218例接受ERCP手术的MOJ患者。共收集27个临床变量作为输入变量。7个模型(包括单变量分析和6个机器学习模型)被训练和测试用于分类预测。通过AUROC测量模型性能。RFT模型表现出出色的性能,精度高达0.86,AUROC高达0.87。RF和SHAP中的特征选择相似,和最佳变量子集的选择产生了一个高的性能与AUROC高达0.89。我们开发了一种混合机器学习模型,比传统的LR预测模型具有更好的预测性能,以及其他基于简单临床数据的胆管炎机器学习模型。该模型可以帮助医生进行临床诊断,采取合理的治疗方案,提高患者的生存率。
    The risk of cholangitis after ERCP implantation in malignant obstructive jaundice patients remains unknown. To develop models based on artificial intelligence methods to predict cholangitis risk more accurately, according to patients after stent implantation in patients\' MOJ clinical data. This retrospective study included 218 patients with MOJ undergoing ERCP surgery. A total of 27 clinical variables were collected as input variables. Seven models (including univariate analysis and six machine learning models) were trained and tested for classified prediction. The model\' performance was measured by AUROC. The RFT model demonstrated excellent performances with accuracies up to 0.86 and AUROC up to 0.87. Feature selection in RF and SHAP was similar, and the choice of the best variable subset produced a high performance with an AUROC up to 0.89. We have developed a hybrid machine learning model with better predictive performance than traditional LR prediction models, as well as other machine learning models for cholangitis based on simple clinical data. The model can assist doctors in clinical diagnosis, adopt reasonable treatment plans, and improve the survival rate of patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    参考蒸散量(ET0)是有效灌溉计划和地下水保护的重要参数。针对可用气象参数的特定组合,为ET0估计设计了不同的机器学习模型。然而,到目前为止,还没有提出可以处理可用气象参数的多种组合来估计ET0的模型。本文为此提出了一种改进的混合准模糊人工神经网络(ANN)模型(EvatCrop)的新颖体系结构。与其他三种流行的模型相比,EvatCrop产生了更好的结果,决策树,人工神经网络,和自适应神经模糊推理系统,与研究地点和输入参数的组合无关。对于真实的案例研究,它被应用于北孟加拉特莱农业气候区的地下水压力区,印度,从2000年到2014年,利用国家环境预测中心提供的每日气象数据进行培训和测试。将模型的精度与标准Penman-Monteith模型(FAO56PM)进行比较。经验结果表明,在不同的数据受限情况下,模型性能显着变化。当完整的输入参数集可用时,EvatCrop得出了最佳的决定系数值(R2=0.988),一致度(d=0.997),均方根误差(RMSE=0.183),均方根相对误差(RMSRE=0.034)。
    Reference evapotranspiration (ET0 ) is a significant parameter for efficient irrigation scheduling and groundwater conservation. Different machine learning models have been designed for ET0 estimation for specific combinations of available meteorological parameters. However, no single model has been suggested so far that can handle diverse combinations of available meteorological parameters for the estimation of ET0. This article suggests a novel architecture of an improved hybrid quasi-fuzzy artificial neural network (ANN) model (EvatCrop) for this purpose. EvatCrop yielded superior results when compared with the other three popular models, decision trees, artificial neural networks, and adaptive neuro-fuzzy inference systems, irrespective of study locations and the combinations of input parameters. For real-field case studies, it was applied in the groundwater-stressed area of the Terai agro-climatic region of North Bengal, India, and trained and tested with the daily meteorological data available from the National Centres for Environmental Prediction from 2000 to 2014. The precision of the model was compared with the standard Penman-Monteith model (FAO56PM). Empirical results depicted that the model performances remarkably varied under different data-limited situations. When the complete set of input parameters was available, EvatCrop resulted in the best values of coefficient of determination (R2 = 0.988), degree of agreement (d = 0.997), root mean square error (RMSE = 0.183), and root mean square relative error (RMSRE = 0.034).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    本研究旨在探索特定的生化指标,并构建2型糖尿病(T2D)患者糖尿病肾病(DKD)的风险预测模型。
    这项研究包括234名T2D患者,其中166人患有DKD,2021年1月至2022年7月在吉林大学第一医院就诊。临床特征,比如年龄,性别,和典型的血液学参数,被收集并用于建模。五种机器学习算法[极限梯度提升(XGBoost),梯度增压机(GBM),支持向量机(SVM)逻辑回归(LR),和随机森林(RF)]用于识别关键的临床和病理特征,并建立DKD的风险预测模型。此外,从吉林大学第三医院收集70例患者(nT2D=20,nDKD=50)的临床数据进行外部验证.
    RF算法在预测发展到DKD方面表现最佳,确定五个主要指标:估计肾小球滤过率(eGFR),糖化白蛋白(GA),尿酸,HbA1c,锌(Zn)。预测模型显示出足够的预测准确性,在内部验证集和外部验证集中,曲线下面积(AUC)值为0.960(95%CI:0.936-0.984)和0.9326(95%CI:0.8747-0.9885),分别。RF模型的诊断效能(AUC=0.960)显著高于RF模型中筛选的具有最高特征重要性的五个特征中的每一个。
    使用RF算法构建的在线DKD风险预测模型是基于其在内部验证中的强大性能而选择的。
    UNASSIGNED: This study aimed to explore specific biochemical indicators and construct a risk prediction model for diabetic kidney disease (DKD) in patients with type 2 diabetes (T2D).
    UNASSIGNED: This study included 234 T2D patients, of whom 166 had DKD, at the First Hospital of Jilin University from January 2021 to July 2022. Clinical characteristics, such as age, gender, and typical hematological parameters, were collected and used for modeling. Five machine learning algorithms [Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), Logistic Regression (LR), and Random Forest (RF)] were used to identify critical clinical and pathological features and to build a risk prediction model for DKD. Additionally, clinical data from 70 patients (nT2D = 20, nDKD = 50) were collected for external validation from the Third Hospital of Jilin University.
    UNASSIGNED: The RF algorithm demonstrated the best performance in predicting progression to DKD, identifying five major indicators: estimated glomerular filtration rate (eGFR), glycated albumin (GA), Uric acid, HbA1c, and Zinc (Zn). The prediction model showed sufficient predictive accuracy with area under the curve (AUC) values of 0.960 (95% CI: 0.936-0.984) and 0.9326 (95% CI: 0.8747-0.9885) in the internal validation set and external validation set, respectively. The diagnostic efficacy of the RF model (AUC = 0.960) was significantly higher than each of the five features screened with the highest feature importance in the RF model.
    UNASSIGNED: The online DKD risk prediction model constructed using the RF algorithm was selected based on its strong performance in the internal validation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号