ensemble

合奏
  • 文章类型: Journal Article
    背景:COVID-19(PASC)的后遗症,也被称为长科维德,是急性COVID-19后一系列长期症状的广泛分组。这些症状可能发生在一系列生物系统中,导致在确定PASC的危险因素和该疾病的病因方面面临挑战。对预测未来PASC的特征的理解是有价值的,因为这可以为识别高风险个体和未来的预防工作提供信息。然而,目前有关PASC危险因素的知识有限。
    目的:使用来自国家COVID队列合作组织的55,257名患者(其中1名PASC患者与4名匹配对照)的样本,作为美国国立卫生研究院长期COVID计算挑战的一部分,我们试图从一组经筛选的临床知情协变量中预测PASC诊断的个体风险.国家COVID队列合作组织包括来自美国84个地点的2200多万患者的电子健康记录。
    方法:我们预测了个体PASC状态,给定协变量信息,使用SuperLearner(一种集成机器学习算法,也称为堆叠)来学习梯度提升和随机森林算法的最优组合,以最大化接收器算子曲线下的面积。我们基于3个级别评估了变量重要性(Shapley值):个体特征,时间窗口,和临床领域。我们使用一组随机选择的研究地点从外部验证了这些发现。
    结果:我们能够准确预测个体PASC诊断(曲线下面积0.874)。观察期长度的个体特征,急性COVID-19和病毒性下呼吸道感染期间卫生保健相互作用的数量对随后的PASC诊断最具预测性.暂时,我们发现基线特征是未来PASC诊断的最具预测性的,与之前的特征相比,during,或急性COVID-19后。我们发现医疗保健使用的临床领域,人口统计学或人体测量学,和呼吸因素是PASC诊断的最具预测性的因素。
    结论:这里概述的方法提供了一个开放源代码,使用超级学习者使用电子健康记录数据预测PASC状态的应用示例,可以在各种设置中复制。在个体预测因子和临床领域,我们一致发现,与医疗保健使用相关的因素是PASC诊断的最强预测因子.这表明,任何使用PASC诊断作为主要结果的观察性研究都必须严格考虑异质医疗保健的使用。我们的研究结果支持以下假设:临床医生可能能够在急性COVID-19诊断之前准确评估患者的PASC风险,这可以改善早期干预和预防性护理。我们的发现还强调了呼吸特征在PASC风险评估中的重要性。
    RR2-10.1101/2023.07.27.23293272。
    BACKGROUND: Postacute sequelae of COVID-19 (PASC), also known as long COVID, is a broad grouping of a range of long-term symptoms following acute COVID-19. These symptoms can occur across a range of biological systems, leading to challenges in determining risk factors for PASC and the causal etiology of this disorder. An understanding of characteristics that are predictive of future PASC is valuable, as this can inform the identification of high-risk individuals and future preventative efforts. However, current knowledge regarding PASC risk factors is limited.
    OBJECTIVE: Using a sample of 55,257 patients (at a ratio of 1 patient with PASC to 4 matched controls) from the National COVID Cohort Collaborative, as part of the National Institutes of Health Long COVID Computational Challenge, we sought to predict individual risk of PASC diagnosis from a curated set of clinically informed covariates. The National COVID Cohort Collaborative includes electronic health records for more than 22 million patients from 84 sites across the United States.
    METHODS: We predicted individual PASC status, given covariate information, using Super Learner (an ensemble machine learning algorithm also known as stacking) to learn the optimal combination of gradient boosting and random forest algorithms to maximize the area under the receiver operator curve. We evaluated variable importance (Shapley values) based on 3 levels: individual features, temporal windows, and clinical domains. We externally validated these findings using a holdout set of randomly selected study sites.
    RESULTS: We were able to predict individual PASC diagnoses accurately (area under the curve 0.874). The individual features of the length of observation period, number of health care interactions during acute COVID-19, and viral lower respiratory infection were the most predictive of subsequent PASC diagnosis. Temporally, we found that baseline characteristics were the most predictive of future PASC diagnosis, compared with characteristics immediately before, during, or after acute COVID-19. We found that the clinical domains of health care use, demographics or anthropometry, and respiratory factors were the most predictive of PASC diagnosis.
    CONCLUSIONS: The methods outlined here provide an open-source, applied example of using Super Learner to predict PASC status using electronic health record data, which can be replicated across a variety of settings. Across individual predictors and clinical domains, we consistently found that factors related to health care use were the strongest predictors of PASC diagnosis. This indicates that any observational studies using PASC diagnosis as a primary outcome must rigorously account for heterogeneous health care use. Our temporal findings support the hypothesis that clinicians may be able to accurately assess the risk of PASC in patients before acute COVID-19 diagnosis, which could improve early interventions and preventive care. Our findings also highlight the importance of respiratory characteristics in PASC risk assessment.
    UNASSIGNED: RR2-10.1101/2023.07.27.23293272.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    突触连接定义了在特定功能任务期间参与相关活动的神经元组。这些神经元的协同群形成集合,参与的作战单位,例如,感官知觉,运动协调和记忆(然后称为全写)。传统上,集合形成被认为是通过长期增强(LTP)作为可塑性机制来加强突触连接而发生的。这种突触记忆理论源于Hebb制定的学习规则,与许多实验观察结果一致。这里,我们提议,作为替代,神经元的内在兴奋性和可塑性构成了第二个,非突触机制,这对合奏的初始形成可能很重要。的确,在行为学习之后,在多个大脑区域广泛观察到增强的神经兴奋性。在皮质结构和杏仁核中,兴奋性变化通常被报告为短暂的,即使它们可以持续几十分钟到几天。也许正是出于这个原因,它们传统上被认为是调制的,仅通过促进LTP诱导来支持集合形成,没有进一步参与记忆功能(记忆分配假设)。我们在这里建议-基于两条线的证据-除了调节LTP分配,增强的兴奋性在学习中起着更根本的作用。首先,增强的兴奋性构成了活跃合奏的标志,由于它,在没有突触可塑性的情况下,亚阈值突触连接变为超阈值(冰山模型)。第二,增强的兴奋性促进树突状电位向体细胞的传播,并允许增强EPSP振幅(LTP)与尖峰输出的耦合(从而增强整体参与)。这个许可门模型描述了永久增加兴奋性的需求,这似乎与它作为一种短暂机制的传统考虑相矛盾。我们建议通过低阈值的内在可塑性诱导,可以对兴奋性进行更长的修改。这表明兴奋性可能会在短时间间隔内进行开/关调节。与此一致,在小脑浦肯野细胞中,兴奋性持续几天到几周,这表明在某些电路中,该现象的持续时间首先不是限制因素。在我们的模型中,突触可塑性定义了神经元通过嵌入的连接网络接收的信息内容。然而,细胞自主兴奋性的可塑性可以动态调节单个神经元的集合参与以及集合的整体活动状态。
    Synaptic connectivity defines groups of neurons that engage in correlated activity during specific functional tasks. These co-active groups of neurons form ensembles, the operational units involved in, for example, sensory perception, motor coordination and memory (then called an engram). Traditionally, ensemble formation has been thought to occur via strengthening of synaptic connections via long-term potentiation (LTP) as a plasticity mechanism. This synaptic theory of memory arises from the learning rules formulated by Hebb and is consistent with many experimental observations. Here, we propose, as an alternative, that the intrinsic excitability of neurons and its plasticity constitute a second, non-synaptic mechanism that could be important for the initial formation of ensembles. Indeed, enhanced neural excitability is widely observed in multiple brain areas subsequent to behavioral learning. In cortical structures and the amygdala, excitability changes are often reported as transient, even though they can last tens of minutes to a few days. Perhaps it is for this reason that they have been traditionally considered as modulatory, merely supporting ensemble formation by facilitating LTP induction, without further involvement in memory function (memory allocation hypothesis). We here suggest-based on two lines of evidence-that beyond modulating LTP allocation, enhanced excitability plays a more fundamental role in learning. First, enhanced excitability constitutes a signature of active ensembles and, due to it, subthreshold synaptic connections become suprathreshold in the absence of synaptic plasticity (iceberg model). Second, enhanced excitability promotes the propagation of dendritic potentials toward the soma and allows for enhanced coupling of EPSP amplitude (LTP) to the spike output (and thus ensemble participation). This permissive gate model describes a need for permanently increased excitability, which seems at odds with its traditional consideration as a short-lived mechanism. We propose that longer modifications in excitability are made possible by a low threshold for intrinsic plasticity induction, suggesting that excitability might be on/off-modulated at short intervals. Consistent with this, in cerebellar Purkinje cells, excitability lasts days to weeks, which shows that in some circuits the duration of the phenomenon is not a limiting factor in the first place. In our model, synaptic plasticity defines the information content received by neurons through the connectivity network that they are embedded in. However, the plasticity of cell-autonomous excitability could dynamically regulate the ensemble participation of individual neurons as well as the overall activity state of an ensemble.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    极端天气事件,比如那些与风和降水有关的,导致每年数十亿欧元的损失。虽然在次大陆尺度上已经发现了由于全球变暖导致的极端降水的变化,它们复杂的特点使它们在更大的区域范围内成为评估的挑战。由于对全球变暖的动态响应变化显示出高度的不确定性,极端风提出了更大的挑战。这种情况因局部尺度与地形的相互作用而变得复杂,城市,陆海对比,等。此处提供的数据集试图解决这些挑战,并提供可以对极端风和降水(最多五天降水)进行可靠评估的信息。我们通过利用高分辨率(12公里)EURO-CORDEX模拟的大型集成(52名成员)来实现这一目标。数据集将是有价值的,不仅是科学界,但也包括公众中的从业者(例如,市政规划师,政府机构)和私营部门(例如,保险公司和再保险公司)。
    Extreme weather events, such as those associated with winds and precipitation, result in billions of euros in damages annually. While changes in extreme precipitation due to global warming have already been detected at sub-continental scales, their complex characteristics make them a challenges to asses at more regional scales. Extreme winds present an even greater challenge as the varying dynamical response to global warming exhibits high levels of uncertainty. This situation is complicated by local scale interactions with orography, cities, land-sea contrasts, etc. The dataset presented here attempts to address these challenges and provide information that will allow robust assessment of extreme winds and precipitation (maximum five day precipitation). We achieve this by leveraging a large ensemble (52 members) of high resolution (12 km) EURO-CORDEX simulations. The dataset will be of value, not only to the scientific community, but also practitioners in the public (e.g., municipal planners, government agencies) and private sectors (e.g., insurers and reinsurers).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    分析基因组序列在理解生物多样性和分类竹子物种中起着至关重要的作用。用于基因组序列分析的现有方法受到诸如复杂性、精度低,以及需要不断重新配置以响应不断发展的基因组数据集。
    这项研究通过引入一种新颖的基于双启发式特征选择的集成分类模型(DHFS-ECM)来解决这些限制,以从基因组序列中精确识别竹子物种。
    所提出的DHFS-ECM方法采用遗传算法来执行双启发式特征选择。这个过程最大化了类间方差,导致选择信息N-gram特征集。随后,类内方差水平用于创建最佳训练集和验证集,确保全面覆盖特定类别的功能。然后通过集成分类层处理选定的特征,结合多个分层模型进行特定物种分类。
    与最新方法的比较分析表明,DHFS-ECM在准确性方面取得了显着提高(9.5%),精度(5.9%),召回(8.5%),和AUC表现(4.5%)。重要的是,由于双重启发式遗传算法模型促进的连续学习,该模型即使在物种类别数量增加的情况下也能保持其性能。
    DHFS-ECM提供了几个关键优势,包括高效的特征提取,降低模型复杂性,增强的可解释性,并通过集成分类层增加了鲁棒性和准确性。这些属性使DHFS-ECM成为实时临床应用的有前途的工具,并对基因组序列分析领域做出了有价值的贡献。
    UNASSIGNED: Analyzing genomic sequences plays a crucial role in understanding biological diversity and classifying Bamboo species. Existing methods for genomic sequence analysis suffer from limitations such as complexity, low accuracy, and the need for constant reconfiguration in response to evolving genomic datasets.
    UNASSIGNED: This study addresses these limitations by introducing a novel Dual Heuristic Feature Selection-based Ensemble Classification Model (DHFS-ECM) for the precise identification of Bamboo species from genomic sequences.
    UNASSIGNED: The proposed DHFS-ECM method employs a Genetic Algorithm to perform dual heuristic feature selection. This process maximizes inter-class variance, leading to the selection of informative N-gram feature sets. Subsequently, intra-class variance levels are used to create optimal training and validation sets, ensuring comprehensive coverage of class-specific features. The selected features are then processed through an ensemble classification layer, combining multiple stratification models for species-specific categorization.
    UNASSIGNED: Comparative analysis with state-of-the-art methods demonstrate that DHFS-ECM achieves remarkable improvements in accuracy (9.5%), precision (5.9%), recall (8.5%), and AUC performance (4.5%). Importantly, the model maintains its performance even with an increased number of species classes due to the continuous learning facilitated by the Dual Heuristic Genetic Algorithm Model.
    UNASSIGNED: DHFS-ECM offers several key advantages, including efficient feature extraction, reduced model complexity, enhanced interpretability, and increased robustness and accuracy through the ensemble classification layer. These attributes make DHFS-ECM a promising tool for real-time clinical applications and a valuable contribution to the field of genomic sequence analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在2022-2023年前所未有的水痘流行期间,近实时的短期预测疫情的轨迹是至关重要的干预实施和指导政策。然而,随着案件数量大幅下降,评估模型性能对于推进疫情预测领域至关重要。使用来自疾病控制和预防中心和我们的世界数据团队的实验室确认的水痘病例数据,我们生成了巴西的回顾性连续每周预测,加拿大,法国,德国,西班牙,联合王国,美国和全球范围内使用自回归综合移动平均(ARIMA)模型,广义加法模型,简单线性回归,Facebook的先知模式,以及子流行病波和n子流行病建模框架。我们使用平均均方误差评估预测性能,平均绝对误差,加权区间分数,95%预测区间覆盖率,技能分数和温克勒分数。总的来说,在大多数地点和预测范围内,n-sub流行病建模框架胜过其他模型,未加权的合奏模型表现最频繁。相对于所有性能指标的ARIMA模型(大于10%),n-sub流行病和空间波框架在平均预测性能上有了显着提高。调查结果进一步支持用于短期预测新出现和重新出现的传染病流行的次流行框架。
    During the 2022-2023 unprecedented mpox epidemic, near real-time short-term forecasts of the epidemic\'s trajectory were essential in intervention implementation and guiding policy. However, as case levels have significantly decreased, evaluating model performance is vital to advancing the field of epidemic forecasting. Using laboratory-confirmed mpox case data from the Centers for Disease Control and Prevention and Our World in Data teams, we generated retrospective sequential weekly forecasts for Brazil, Canada, France, Germany, Spain, the United Kingdom, the United States and at the global scale using an auto-regressive integrated moving average (ARIMA) model, generalized additive model, simple linear regression, Facebook\'s Prophet model, as well as the sub-epidemic wave and n-sub-epidemic modelling frameworks. We assessed forecast performance using average mean squared error, mean absolute error, weighted interval scores, 95% prediction interval coverage, skill scores and Winkler scores. Overall, the n-sub-epidemic modelling framework outcompeted other models across most locations and forecasting horizons, with the unweighted ensemble model performing best most frequently. The n-sub-epidemic and spatial-wave frameworks considerably improved in average forecasting performance relative to the ARIMA model (greater than 10%) for all performance metrics. Findings further support sub-epidemic frameworks for short-term forecasting epidemics of emerging and re-emerging infectious diseases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    尽管在基于深度学习的表面缺陷检测方面取得了许多成功,该行业在进行包装缺陷检查方面仍然面临挑战,这些检查包括成分清单等关键信息。特别是,虽然以前的成就主要集中在高质量图像中的缺陷检查,他们不考虑在低质量图像(如包含图像模糊的图像)中进行缺陷检查。为了解决这个问题,我们提出了一种高尚的推理技术,称为时间质量集成(TQE),它结合了时间和质量权重。时间加权通过考虑与观察图像相关的定时来向输入图像分配权重。质量权重优先考虑高质量图像,以确保推理过程强调清晰可靠的输入图像。这两个权重提高了低质量图像推断过程的准确性和可靠性。此外,为了通过实验评估TQE的一般适用性,我们采用了广泛使用的卷积神经网络(CNN),如ResNet-34、EfficientNet、ECAEfficientNet,GoogLeNet,和ShuffleNetV2作为骨干网络。总之,考虑到至少包括一个低质量图像的情况,TQE的F1得分比使用单个CNN模型高约17.64%至22.41%,比平均投票集合高约1.86%至2.06%。
    Despite achieving numerous successes with surface defect inspection based on deep learning, the industry still faces challenges in conducting packaging defect inspections that include critical information such as ingredient lists. In particular, while previous achievements primarily focus on defect inspection in high-quality images, they do not consider defect inspection in low-quality images such as those containing image blur. To address this issue, we proposed a noble inference technique named temporal-quality ensemble (TQE), which combines temporal and quality weights. Temporal weighting assigns weights to input images by considering the timing in relation to the observed image. Quality weight prioritizes high-quality images to ensure the inference process emphasizes clear and reliable input images. These two weights improve both the accuracy and reliability of the inference process of low-quality images. In addition, to experimentally evaluate the general applicability of TQE, we adopt widely used convolutional neural networks (CNNs) such as ResNet-34, EfficientNet, ECAEfficientNet, GoogLeNet, and ShuffleNetV2 as the backbone network. In conclusion, considering cases where at least one low-quality image is included, TQE has an F1-score approximately 17.64% to 22.41% higher than using single CNN models and about 1.86% to 2.06% higher than an average voting ensemble.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    亚季节性降雨预测技能对于支持应对极端水文气象的准备至关重要。我们评估一个过程知情的评估,根据预测模型成员代表潜在降雨预测因子的能力对预测模型成员进行子样本,可以改善中美洲下个月的月度降雨量预测,使用哥斯达黎加和危地马拉作为测试用例。我们通过对C3S多模型集合中的五个动态预测模型中的130个成员进行二次采样,基于它们对(a)纬向风向和(b)太平洋和大西洋海表温度(SST)的表示,来生成约束集合均值。在初始化时。我们的结果表明,在多个月和位置中,均方误差技能增加了0.4,并提高了极端降雨的检出率。该方法可转移到由缓慢变化的过程驱动的其他区域。过程信息抽样是成功的,因为它可以识别出风/SST误差增加时无法代表整个降雨分布的成员。
    Subseasonal rainfall forecast skill is critical to support preparedness for hydrometeorological extremes. We assess how a process-informed evaluation, which subsamples forecasting model members based on their ability to represent potential predictors of rainfall, can improve monthly rainfall forecasts within Central America in the following month, using Costa Rica and Guatemala as test cases. We generate a constrained ensemble mean by subsampling 130 members from five dynamic forecasting models in the C3S multimodel ensemble based on their representation of both (a) zonal wind direction and (b) Pacific and Atlantic sea surface temperatures (SSTs), at the time of initialization. Our results show in multiple months and locations increased mean squared error skill by 0.4 and improved detection rates of rainfall extremes. This method is transferrable to other regions driven by slowly-changing processes. Process-informed subsampling is successful because it identifies members that fail to represent the entire rainfall distribution when wind/SST error increases.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    智能技术和预测模型在监狱中的融合为改革囚犯行为监测提供了一个令人兴奋的机会,以便及早发现痛苦的迹象并有效减轻自杀风险。虽然机器学习算法已被广泛用于预测自杀行为,经常被忽视的一个关键方面是这些模型的互操作性。在自杀预测的模型解释上所做的大多数工作通常将其自身限制为减少特征,并仅突出重要的贡献特征。为了解决这个研究空白,我们使用锚解释来创建基于简单规则的人类可读的语句,which,根据我们的知识,以前从未用于自杀预测模型。我们还克服了锚解释的局限性,在高维数据集上创建弱规则,首先在SHapley加法扩张(SHAP)的帮助下减少数据特征。我们通过对XGBoost和随机森林的最终集成模型的锚解释进一步减少了数据特征。我们的结果表明,与最先进的模型相比,有了显著的改进,准确度和精密度分别为98.6%和98.9%,分别。最佳自杀意念模型的F1得分似乎为96.7%。
    The convergence of smart technologies and predictive modelling in prisons presents an exciting opportunity to revolutionize the monitoring of inmate behaviour, allowing for the early detection of signs of distress and the effective mitigation of suicide risks. While machine learning algorithms have been extensively employed in predicting suicidal behaviour, a critical aspect that has often been overlooked is the interoperability of these models. Most of the work done on model interpretations for suicide predictions often limits itself to feature reduction and highlighting important contributing features only. To address this research gap, we used Anchor explanations for creating human-readable statements based on simple rules, which, to our knowledge, have never been used before for suicide prediction models. We also overcome the limitation of anchor explanations, which create weak rules on high-dimensionality datasets, by first reducing data features with the help of SHapley Additive exPlanations (SHAP). We further reduce data features through anchor interpretations for the final ensemble model of XGBoost and random forest. Our results indicate significant improvement when compared with state-of-the-art models, having an accuracy and precision of 98.6% and 98.9%, respectively. The F1-score for the best suicide ideation model appeared to be 96.7%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    前扣带皮质(ACC)活动对于需要随着时间的推移整合多种体验的能力的手术很重要,比如规则学习,认知灵活性,工作记忆,和长期记忆回忆。为了阐明这一点,我们分析了大鼠在一小时的疗程中重复相同行为时的神经元活动,以研究活动随时间的变化。当大鼠在三个不同的响应端口(n=5)执行无决策操作任务时,我们记录了神经元集合。神经元状态空间分析显示,行为的每次重复都是不同的,最近的行为比时间间隔更远的行为更相似。ACC活动主要是缓慢的,神经状态空间的低维表示与行为节奏一致的逐渐变化。时间进展,或漂移,在每个会议的最高主要组成部分上都很明显,并且是由经验的积累而不是内部时钟驱动的。值得注意的是,这些信号在受试者之间是一致的,允许我们根据不同动物的数据训练的模型准确预测试验次数。我们观察到,在延长的持续时间(数十分钟)内,非连续的斜坡点火率驱动了低维集合表示。40%的ACC神经元放电在一系列试验长度上倾斜,并且持续时间较短的倾斜神经元的组合产生了追踪较长持续时间的集合。这些发现为ACC提供了有价值的见解,在合奏级别,通过反映长期经验的积累来传达时间信息。
    Anterior cingulate cortex (ACC) activity is important for operations that require the ability to integrate multiple experiences over time, such as rule learning, cognitive flexibility, working memory, and long-term memory recall. To shed light on this, we analyzed neuronal activity while rats repeated the same behaviors during hour-long sessions to investigate how activity changed over time. We recorded neuronal ensembles as rats performed a decision-free operant task with varying reward likelihoods at three different response ports (n = 5). Neuronal state space analysis revealed that each repetition of a behavior was distinct, with more recent behaviors more similar than those further apart in time. ACC activity was dominated by a slow, gradual change in low-dimensional representations of neural state space aligning with the pace of behavior. Temporal progression, or drift, was apparent on the top principal component for every session and was driven by the accumulation of experiences and not an internal clock. Notably, these signals were consistent across subjects, allowing us to accurately predict trial numbers based on a model trained on data from a different animal. We observed that non-continuous ramping firing rates over extended durations (tens of minutes) drove the low-dimensional ensemble representations. 40% of ACC neurons\' firing ramped over a range of trial lengths and combinations of shorter duration ramping neurons created ensembles that tracked longer durations. These findings provide valuable insights into how the ACC, at an ensemble level, conveys temporal information by reflecting the accumulation of experiences over extended periods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    乳腺癌仍然是一个紧迫的全球健康问题,需要进行准确的诊断才能进行有效的干预。深度学习模型(AlexNet、ResNet-50,VGG16,GoogLeNet)显示出显着的微钙化鉴定(>90%)。然而,不同的架构和方法带来了挑战。我们提出了一个集成模型,合并独特的视角,提高精度,了解乳腺癌干预的关键因素。评估支持GoogleNet和ResNet-50,推动他们选择组合功能,确保提高精度,和临床环境中微钙化检测的可靠性。
    本研究提出了一个全面的乳房X线照片预处理框架,使用优化的深度学习集成方法。所提出的框架从使用Otsu分割和形态学操作的伪影去除开始。后续步骤包括调整图像大小,自适应中值滤波,以及通过ResNet-50模型的迁移学习开发深度卷积神经网络(D-CNN)。超参数优化,和集成优化(AlexNet,GoogLeNet,VGG16,ResNet-50)用于识别微钙化的局部区域。严格的评估方案验证了单个模型的有效性,最终在集成模型中展示出较高的预测准确性。
    根据我们的分析,提出的集成模型在微钙化分类中表现出卓越的性能。模型的平均置信度得分证明了这一点,这表明在区分这些关键特性方面具有高度的可靠性和确定性。所提出的模型在微钙化的分类中表现出值得注意的平均置信水平为0.9305。优于替代模型,并为模型的可靠性提供了实质性的见解。集成模型对正常病例分类的平均置信度为0.8859,这增强了模型的一致性和可靠性预测。此外,集成模型在准确性方面获得了非常高的性能,精度,召回,F1分数,和曲线下面积(AUC)。
    所提出的模型对数据集进行了彻底的整合,并专注于班级内的平均置信度等级,从而提高了乳腺癌的临床诊断准确性和有效性。这项研究引入了一种新颖的方法,利用集成模型和严格的评估标准,大大提高了乳腺癌诊断的准确性和可靠性。特别是在微钙化的检测中。
    UNASSIGNED: Breast cancer remains a pressing global health concern, necessitating accurate diagnostics for effective interventions. Deep learning models (AlexNet, ResNet-50, VGG16, GoogLeNet) show remarkable microcalcification identification (>90%). However, distinct architectures and methodologies pose challenges. We propose an ensemble model, merging unique perspectives, enhancing precision, and understanding critical factors for breast cancer intervention. Evaluation favors GoogleNet and ResNet-50, driving their selection for combined functionalities, ensuring improved precision, and dependability in microcalcification detection in clinical settings.
    UNASSIGNED: This study presents a comprehensive mammogram preprocessing framework using an optimized deep learning ensemble approach. The proposed framework begins with artifact removal using Otsu Segmentation and morphological operation. Subsequent steps include image resizing, adaptive median filtering, and deep convolutional neural network (D-CNN) development via transfer learning with ResNet-50 model. Hyperparameters are optimized, and ensemble optimization (AlexNet, GoogLeNet, VGG16, ResNet-50) are constructed to identify the localized area of microcalcification. Rigorous evaluation protocol validates the efficacy of individual models, culminating in the ensemble model demonstrating superior predictive accuracy.
    UNASSIGNED: Based on our analysis, the proposed ensemble model exhibited exceptional performance in the classification of microcalcifications. This was evidenced by the model\'s average confidence score, which indicated a high degree of dependability and certainty in differentiating these critical characteristics. The proposed model demonstrated a noteworthy average confidence level of 0.9305 in the classification of microcalcification, outperforming alternative models and providing substantial insights into the dependability of the model. The average confidence of the ensemble model in classifying normal cases was 0.8859, which strengthened the model\'s consistent and dependable predictions. In addition, the ensemble models attained remarkably high performances in terms of accuracy, precision, recall, F1-score, and area under the curve (AUC).
    UNASSIGNED: The proposed model\'s thorough dataset integration and focus on average confidence ratings within classes improve clinical diagnosis accuracy and effectiveness for breast cancer. This study introduces a novel methodology that takes advantage of an ensemble model and rigorous evaluation standards to substantially improve the accuracy and dependability of breast cancer diagnostics, specifically in the detection of microcalcifications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号