Convolutional neural network (CNN)

卷积神经网络 (CNN)
  • 文章类型: Journal Article
    砂质白云岩是一种分布广泛的岩石。砂质白云岩的单轴抗压强度(UCS)是土木工程应用中的一个重要指标。岩土工程,地下工程。直接测量UCS的成本很高,耗时,在某些情况下甚至不可行。为了解决这个问题,建立了基于卷积神经网络(CNN)和回归分析(RA)的间接测量方法。新方法是直接和有效的UCS预测,并具有重大的实际意义。为了评估新方法的性能,收集了158个不同沙化等级的白云石样品,用于在云南省云南中部引水(CYWD)工程玉溪段及其附近进行UCS测试。中国西南部。根据RA结果建立了两个相关系数高的回归方程,来预测沙质白云岩的UCS。此外,通过施密特锤回弹试验确定了沙质白云岩的最小厚度。结果表明,CNN在预测沙质白云岩UCS的精度方面优于RA。此外,CNN可以有效地处理测试结果中的不确定性,使其成为预测沙质白云岩UCS的最有效工具之一。
    Sandy Dolomite is a kind of widely distributed rock. The uniaxial compressive strength (UCS) of Sandy Dolomite is an important metric in the application in civil engineering, geotechnical engineering, and underground engineering. Direct measurement of UCS is costly, time-consuming, and even infeasible in some cases. To address this problem, we establish an indirect measuring method based on the convolutional neural network (CNN) and regression analysis (RA). The new method is straightforward and effective for UCS prediction, and has significant practical implications. To evaluate the performance of the new method, 158 dolomite samples of different sandification grades are collected for testing their UCS along and near the Yuxi section of the Central Yunnan Water Diversion (CYWD) Project in Yunnan Province, Southwest of China. Two regression equations with high correlation coefficients are established according to the RA results, to predict the UCS of Sandy Dolomites. Moreover, the minimum thickness of Sandy Dolomite was determined by the Schmidt hammer rebound test. Results show that CNN outperforms RA in terms of prediction the precision of Sandy Dolomite UCS. In addition, CNN can effectively deal with uncertainty in test results, making it one of the most effective tools for predicting the UCS of Sandy Dolomite.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    2020年,我们中心使用卷积神经网络(CNN)建立了Tanner-Whitehouse3(TW3)人工智能(AI)系统,它建立在9059张射线照片上。然而,系统,我们的研究基于此,缺乏比较的黄金标准,并且没有在不同的工作环境中进行彻底的评估。
    为了进一步验证AI系统在临床骨龄评估(BAA)中的适用性,并增强BAA的准确性和同质性,进行了前瞻性多中心验证.这项研究利用了744例患者的左手X光片,从1岁到20岁不等,378个男孩和366个女孩。这些X光片是在2020年8月至12月期间从9家不同的儿童医院获得的。BAA使用TW3AI系统进行,并由经验丰富的审稿人进行审查。1年内骨龄准确度,均方根误差(RMSE),和平均绝对误差(MAE)进行统计计算,以评估准确性。进行Kappa检验和Bland-Altman(B-A)图测量诊断一致性。
    系统表现出高水平的性能,产生与审稿人密切相关的结果。它实现了0.52年的RMSE和94.55%的半径精度,尺骨,短骨系列。在评估腕骨系列时,该系统实现了0.85年的RMSE和80.38%的精度。总的来说,该系统显示了令人满意的精度和RMSE,特别是7岁以上的患者。该系统在评估1-6岁患者的腕骨年龄方面表现出色。Kappa检验和B-A图都证明了系统和审稿人之间的一致性,尽管该模型在不断区分特定骨骼方面遇到了挑战,比如capetate.此外,该系统的性能在不同性别和年龄组被证明是可以接受的,以及射线照相仪器。
    在这个多中心验证中,该系统展示了其提高健康分娩效率和一致性的潜力,最终改善患者预后并降低医疗成本。
    UNASSIGNED: In 2020, our center established a Tanner-Whitehouse 3 (TW3) artificial intelligence (AI) system using a convolutional neural network (CNN), which was built upon 9059 radiographs. However, the system, upon which our study is based, lacked a gold standard for comparison and had not undergone thorough evaluation in different working environments.
    UNASSIGNED: To further verify the applicability of the AI system in clinical bone age assessment (BAA) and to enhance the accuracy and homogeneity of BAA, a prospective multi-center validation was conducted. This study utilized 744 left-hand radiographs of patients, ranging from 1 to 20 years of age, with 378 boys and 366 girls. These radiographs were obtained from nine different children\'s hospitals between August and December 2020. The BAAs were performed using the TW3 AI system and were also reviewed by experienced reviewers. Bone age accuracy within 1 year, root mean square error (RMSE), and mean absolute error (MAE) were statistically calculated to evaluate the accuracy. Kappa test and Bland-Altman (B-A) plot were conducted to measure the diagnostic consistency.
    UNASSIGNED: The system exhibited a high level of performance, producing results that closely aligned with those of the reviewers. It achieved a RMSE of 0.52 years and an accuracy of 94.55% for the radius, ulna, and short bones series. When assessing the carpal series of bones, the system achieved a RMSE of 0.85 years and an accuracy of 80.38%. Overall, the system displayed satisfactory accuracy and RMSE, particularly in patients over 7 years old. The system excelled in evaluating the carpal bone age of patients aged 1-6. Both the Kappa test and B-A plot demonstrated substantial consistency between the system and the reviewers, although the model encountered challenges in consistently distinguishing specific bones, such as the capitate. Furthermore, the system\'s performance proved acceptable across different genders and age groups, as well as radiography instruments.
    UNASSIGNED: In this multi-center validation, the system showcased its potential to enhance the efficiency and consistency of healthy delivery, ultimately resulting in improved patient outcomes and reduced healthcare costs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的开发一种完全自动化的设备和序列无关的卷积神经网络(CNN),用于可靠和高通量的异构标记,非结构化MRI数据。材料与方法回顾,多中心脑MRI数据(2179例胶质母细胞瘤患者,8544次考试,来自249家医院和29种扫描仪类型的63327序列)用于开发基于ResNet-18架构的网络,以区分9种MRI序列类型,包括T1加权,对比后T1加权,T2加权,流体衰减反转恢复,磁化率加权,表观扩散系数,扩散加权(低和高b值),和梯度召回回波T2*加权和动态磁化率对比相关图像。来自每个序列的二维中段图像被分配给训练或验证(大约80%)和测试(大约20%)使用分层分割,以确保跨机构的平衡组,病人,和MRI序列类型。对每种序列类型的预测精度进行了量化,模型性能的亚组比较采用χ2检验。结果在测试集上,CNN(ResNet-18)集成模型在所有序列类型中的总体准确度为97.9%(95%CI:97.6,98.1),从敏感性加权图像的84.2%(95%CI:81.8,86.6)到T2加权图像的99.8%(95%CI:99.7,99.9)。与ResNet-50相比,ResNet-18模型的精度明显更好,尽管其结构更简单(97.9%vs97.1%;P≤.001)。ResNet-18模型的准确性不受任何序列类型的二维中段图像上肿瘤的存在与不存在的影响(P>.05)。结论发达的CNN(www.github.com/neuroAI-HD/HD-SEQ-ID)可靠地区分多中心和大规模人群神经影像学数据中的九种类型的MRI序列,并可能提高速度,准确度,临床和研究神经放射工作流程的效率。关键词:磁共振成像,神经网络,CNS,脑/脑干,计算机应用-一般(信息学),卷积神经网络(CNN)深度学习算法,机器学习算法补充材料可用于本文。©RSNA,2023年。
    Purpose To develop a fully automated device- and sequence-independent convolutional neural network (CNN) for reliable and high-throughput labeling of heterogeneous, unstructured MRI data. Materials and Methods Retrospective, multicentric brain MRI data (2179 patients with glioblastoma, 8544 examinations, 63 327 sequences) from 249 hospitals and 29 scanner types were used to develop a network based on ResNet-18 architecture to differentiate nine MRI sequence types, including T1-weighted, postcontrast T1-weighted, T2-weighted, fluid-attenuated inversion recovery, susceptibility-weighted, apparent diffusion coefficient, diffusion-weighted (low and high b value), and gradient-recalled echo T2*-weighted and dynamic susceptibility contrast-related images. The two-dimensional-midsection images from each sequence were allocated to training or validation (approximately 80%) and testing (approximately 20%) using a stratified split to ensure balanced groups across institutions, patients, and MRI sequence types. The prediction accuracy was quantified for each sequence type, and subgroup comparison of model performance was performed using χ2 tests. Results On the test set, the overall accuracy of the CNN (ResNet-18) ensemble model among all sequence types was 97.9% (95% CI: 97.6, 98.1), ranging from 84.2% for susceptibility-weighted images (95% CI: 81.8, 86.6) to 99.8% for T2-weighted images (95% CI: 99.7, 99.9). The ResNet-18 model achieved significantly better accuracy compared with ResNet-50 despite its simpler architecture (97.9% vs 97.1%; P ≤ .001). The accuracy of the ResNet-18 model was not affected by the presence versus absence of tumor on the two-dimensional-midsection images for any sequence type (P > .05). Conclusion The developed CNN (www.github.com/neuroAI-HD/HD-SEQ-ID) reliably differentiates nine types of MRI sequences within multicenter and large-scale population neuroimaging data and may enhance the speed, accuracy, and efficiency of clinical and research neuroradiologic workflows. Keywords: MR-Imaging, Neural Networks, CNS, Brain/Brain Stem, Computer Applications-General (Informatics), Convolutional Neural Network (CNN), Deep Learning Algorithms, Machine Learning Algorithms Supplemental material is available for this article. © RSNA, 2023.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    尽管衡量工人的生产率至关重要,测量每个工人的生产率是具有挑战性的,因为他们分散在不同的建筑工地。本文提出了一种基于惯性测量单元(IMU)和活动分类的生产率测量框架。利用两种深度学习算法和三种传感器组合来识别和分析该框架在砌体工作中的可行性。使用所提出的方法,使用具有多个传感器的卷积神经网络模型,可以以96.70%的最大准确度进行工人活动分类,使用带有单个传感器的长短期记忆(LSTM)模型,最低精度为72.11%。生产率的测量精度高达96.47%。这项研究的主要贡献是提出了一种对详细活动进行分类的方法,并探索了用于测量工人生产率的IMU传感器数量的影响。
    Although measuring worker productivity is crucial, the measurement of the productivity of each worker is challenging due to their dispersion across various construction jobsites. This paper presents a framework for measuring productivity based on an inertial measurement unit (IMU) and activity classification. Two deep learning algorithms and three sensor combinations were utilized to identify and analyze the feasibility of the framework in masonry work. Using the proposed method, worker activity classification could be performed with a maximum accuracy of 96.70% using the convolutional neural network model with multiple sensors, and a minimum accuracy of 72.11% using the long short-term memory (LSTM) model with a single sensor. Productivity could be measured with an accuracy of up to 96.47%. The main contributions of this study are the proposal of a method for classifying detailed activities and an exploration of the effect of the number of IMU sensors used in measuring worker productivity.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在本文中,提出了一种结构健康监测(SHM)系统,以提供自动预警,以早期检测损坏及其在复合管道中的位置。该研究考虑了具有嵌入式光纤布拉格光栅(FBG)传感系统的玄武岩纤维增强聚合物(BFRP)管道,并首先讨论了结合FBG传感器以准确检测管道中损坏信息的缺点和挑战。这项研究的新颖性和主要重点是,然而,提出的方法依赖于设计集成的传感诊断SHM系统,该系统能够通过实施基于人工智能(AI)的算法,结合深度学习和其他有效的机器学习方法,在早期阶段检测复合管道中的损坏。使用增强型卷积神经网络(ECNN),无需重新训练模型。所提出的架构通过用于推理的k-最近邻(k-NN)算法来替换softmax层。有限元模型是通过损伤测试下的管道测量结果开发和校准的。然后,该模型用于评估管道在内部压力载荷下和由于爆破引起的压力变化下的应变分布模式。并找到轴向和周向不同位置处的应变关系。还开发了一种使用分布式应变模式的管道损伤机制预测算法。ECNN的设计和训练是为了识别管道劣化的状况,因此可以检测到损坏的开始。来自当前方法的应变结果与文献中可用的实验结果显示出极好的一致性。ECNN数据与FBG传感器数据的平均误差为0.093%,从而证实了该方法的可靠性和准确性。提出的ECNN以93.33%的精度(P%)实现高性能,91.18%的回归率(R%)和90.54%的F1评分(F%)。
    In this paper, a structural health monitoring (SHM) system is proposed to provide automatic early warning for detecting damage and its location in composite pipelines at an early stage. The study considers a basalt fiber reinforced polymer (BFRP) pipeline with an embedded Fiber Bragg grating (FBG) sensory system and first discusses the shortcomings and challenges with incorporating FBG sensors for accurate detection of damage information in pipelines. The novelty and the main focus of this study is, however, a proposed approach that relies on designing an integrated sensing-diagnostic SHM system that has the capability to detect damage in composite pipelines at an early stage via implementation of an artificial intelligence (AI)-based algorithm combining deep learning and other efficient machine learning methods using an Enhanced Convolutional Neural Network (ECNN) without retraining the model. The proposed architecture replaces the softmax layer by a k-Nearest Neighbor (k-NN) algorithm for inference. Finite element models are developed and calibrated by the results of pipe measurements under damage tests. The models are then used to assess the patterns of the strain distributions of the pipeline under internal pressure loading and under pressure changes due to bursts, and to find the relationship of strains at different locations axially and circumferentially. A prediction algorithm for pipe damage mechanisms using distributed strain patterns is also developed. The ECNN is designed and trained to identify the condition of pipe deterioration so the initiation of damage can be detected. The strain results from the current method and the available experimental results in the literature show excellent agreement. The average error between the ECNN data and FBG sensor data is 0.093%, thus confirming the reliability and accuracy of the proposed method. The proposed ECNN achieves high performance with 93.33% accuracy (P%), 91.18% regression rate (R%) and a 90.54% F1-score (F%).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    图像识别和神经影像学越来越多地用于了解阿尔茨海默病(AD)的进展。然而,单光子发射计算机断层扫描(SPECT)的图像数据有限。医学图像分析需要大量的,标记的训练数据集。因此,研究集中在克服这个问题上。在这项研究中,比较了五种卷积神经网络(CNN)模型(MobileNetV2和NASNetMobile(轻量级模型);VGG16,InceptionV3和ResNet(较重权重模型))在医学图像上的检测性能,以建立流行病学研究的分类模型。收集了99名受试者的脑部扫描图像数据,和4711图像被使用。人口统计学数据使用卡方检验和单向方差分析与Bonferroni的事后检验进行比较。使用准确性和损失函数来评估CNN模型的性能。临床痴呆等级(CDR)为2的受试者的认知能力筛查工具和微型精神状态考试成绩明显低于CDR为1或0.5的受试者。这项研究分析了各种CNN模型对医学图像的分类性能,并证明了迁移学习在识别轻度认知障碍方面的有效性。轻度AD,和基于SPECT图像的中度AD评分。
    Image recognition and neuroimaging are increasingly being used to understand the progression of Alzheimer\'s disease (AD). However, image data from single-photon emission computed tomography (SPECT) are limited. Medical image analysis requires large, labeled training datasets. Therefore, studies have focused on overcoming this problem. In this study, the detection performance of five convolutional neural network (CNN) models (MobileNet V2 and NASNetMobile (lightweight models); VGG16, Inception V3, and ResNet (heavier weight models)) on medical images was compared to establish a classification model for epidemiological research. Brain scan image data were collected from 99 subjects, and 4711 images were used. Demographic data were compared using the chi-squared test and one-way analysis of variance with Bonferroni\'s post hoc test. Accuracy and loss functions were used to evaluate the performance of CNN models. The cognitive abilities screening instrument and mini mental state exam scores of subjects with a clinical dementia rating (CDR) of 2 were considerably lower than those of subjects with a CDR of 1 or 0.5. This study analyzed the classification performance of various CNN models for medical images and proved the effectiveness of transfer learning in identifying the mild cognitive impairment, mild AD, and moderate AD scoring based on SPECT images.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    深度学习对象检测模型已成功应用于开发计算机辅助诊断系统,以在结肠镜检查期间辅助息肉检测。这里,我们证明有必要包括阴性样本(I)减少息肉发现阶段的假阳性,通过包括具有可能混淆检测模型的伪像的图像(例如,医疗器械,水射流,粪便,血,相机过度靠近结肠壁,模糊的图像,等。)通常不包括在模型开发数据集中,和(Ii)正确估计模型的更现实的性能。通过重新训练我们先前开发的基于YOLOv3的检测模型,其数据集包括15%的其他非息肉图像,并带有各种伪影,我们能够在内部测试数据集中总体上提高其F1性能(从平均F1为0.869到0.893),现在包括这种类型的图像,以及包括非息肉图像的四个公共数据集(平均F1为0.695至0.722)。
    Deep learning object-detection models are being successfully applied to develop computer-aided diagnosis systems for aiding polyp detection during colonoscopies. Here, we evidence the need to include negative samples for both (i) reducing false positives during the polyp-finding phase, by including images with artifacts that may confuse the detection models (e.g., medical instruments, water jets, feces, blood, excessive proximity of the camera to the colon wall, blurred images, etc.) that are usually not included in model development datasets, and (ii) correctly estimating a more realistic performance of the models. By retraining our previously developed YOLOv3-based detection model with a dataset that includes 15% of additional not-polyp images with a variety of artifacts, we were able to generally improve its F1 performance in our internal test datasets (from an average F1 of 0.869 to 0.893), which now include such type of images, as well as in four public datasets that include not-polyp images (from an average F1 of 0.695 to 0.722).
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    未经证实:内镜下可见的胃肿瘤性病变(GNL),包括早期胃癌和上皮内瘤,应准确诊断并及时治疗。然而,GNL的高漏诊率有助于胃癌进展的潜在风险.这项研究的目的是开发一种基于深度学习的计算机辅助诊断(CAD)系统,用于在带窄带成像(ME-NBI)的放大内窥镜下对疑似浅表病变的患者进行GNL的诊断和分割。
    未经授权:对两个中心的GNL患者的ME-NBI图像进行回顾性分析。在这些图像上开发并训练了两个卷积神经网络(CNN)模块。CNN1被训练来诊断GNL,CNN2被训练用于分割。使用另一个内部测试集和来自另一个中心的外部测试集来评估诊断和分割性能。
    UNASSIGNED:CNN1显示出具有准确性的诊断性能,灵敏度,特异性,阳性预测值(PPV)和阴性预测值(NPV)为90.8%,92.5%,89.0%,89.4%和92.2%,分别,内部测试集中的曲线下面积(AUC)为0.928。在CNN1协助下,所有内镜医师的准确率均高于独立诊断.CNN2和地面实况之间的平均交点(IOU)为0.5837,精度很高,召回率和骰子系数分别为0.776、0.983和0.867。
    UNASSIGNED:此CAD系统可用作辅助工具来诊断和分割GNL,协助内镜医师更准确地诊断GNLs并描绘其范围,以提高病变活检的阳性率并确保内镜切除的完整性。
    UNASSIGNED: Endoscopically visible gastric neoplastic lesions (GNLs), including early gastric cancer and intraepithelial neoplasia, should be accurately diagnosed and promptly treated. However, a high rate of missed diagnosis of GNLs contributes to the potential risk of the progression of gastric cancer. The aim of this study was to develop a deep learning-based computer-aided diagnosis (CAD) system for the diagnosis and segmentation of GNLs under magnifying endoscopy with narrow-band imaging (ME-NBI) in patients with suspected superficial lesions.
    UNASSIGNED: ME-NBI images of patients with GNLs in two centers were retrospectively analysed. Two convolutional neural network (CNN) modules were developed and trained on these images. CNN1 was trained to diagnose GNLs, and CNN2 was trained for segmentation. An additional internal test set and an external test set from another center were used to evaluate the diagnosis and segmentation performance.
    UNASSIGNED: CNN1 showed a diagnostic performance with an accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) of 90.8%, 92.5%, 89.0%, 89.4% and 92.2%, respectively, and an area under the curve (AUC) of 0.928 in the internal test set. With CNN1 assistance, all endoscopists had a higher accuracy than for an independent diagnosis. The average intersection over union (IOU) between CNN2 and the ground truth was 0.5837, with a precision, recall and the Dice coefficient of 0.776, 0.983 and 0.867, respectively.
    UNASSIGNED: This CAD system can be used as an auxiliary tool to diagnose and segment GNLs, assisting endoscopists in more accurately diagnosing GNLs and delineating their extent to improve the positive rate of lesion biopsy and ensure the integrity of endoscopic resection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    自闭症谱系障碍(ASD)是一种以认知缺陷为特征的神经系统疾病,身体活动,和社交技能。没有特定的药物来治疗这种疾病;只有早期干预才能改善大脑功能。因为没有医学测试来识别ASD,诊断可能具有挑战性。为了确定诊断,医生考虑孩子的行为和发育历史。人脸可以用作生物标志物,因为它是大脑的潜在反射之一,因此可以用作早期诊断的简单而方便的工具。本研究使用几种基于深度卷积神经网络(CNN)的迁移学习方法来使用面部图像检测自闭症儿童。进行了实证研究,以选择最佳优化器和超参数集以使用CNN模型获得更好的预测精度。在使用优化的设置进行训练和验证之后,修改后的Xception模型通过在测试集上实现95%的准确度,展示了最佳性能,而VGG19、ResNet50V2、MobileNetV2和EfficientNetB0达到86.5%,94%,92%,85.8%,准确度,分别。我们的初步计算结果表明,我们的迁移学习方法优于现有方法。我们修改后的模型可用于帮助医生和从业人员验证其初始筛查以检测患有ASD疾病的儿童。
    Autism spectrum disorder (ASD) is a neurological illness characterized by deficits in cognition, physical activities, and social skills. There is no specific medication to treat this illness; only early intervention can improve brain functionality. Since there is no medical test to identify ASD, a diagnosis might be challenging. In order to determine a diagnosis, doctors consider the child\'s behavior and developmental history. The human face can be used as a biomarker as it is one of the potential reflections of the brain and thus can be used as a simple and handy tool for early diagnosis. This study uses several deep convolutional neural network (CNN)-based transfer learning approaches to detect autistic children using the facial image. An empirical study is conducted to select the best optimizer and set of hyperparameters to achieve better prediction accuracy using the CNN model. After training and validating with the optimized setting, the modified Xception model demonstrates the best performance by achieving an accuracy of 95% on the test set, whereas the VGG19, ResNet50V2, MobileNetV2, and EfficientNetB0 achieved 86.5%, 94%, 92%, and 85.8%, accuracy, respectively. Our preliminary computational results demonstrate that our transfer learning approaches outperformed existing methods. Our modified model can be employed to assist doctors and practitioners in validating their initial screening to detect children with ASD disease.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    UNASSIGNED:使用胸部X光片评估细粒度注释克服基于深度学习(DL)的诊断中的快捷学习的能力。
    UNASSIGNED:使用射线照片级注释(疾病存在:是或否)和细粒度病变级注释(病变边界框)开发了两个DL模型,分别命名为CheXNet和CheXDet。回顾性收集了从2005年1月至2019年9月获得的34501张胸片,并对心脏肥大进行了注释。胸腔积液,质量,结节,肺炎,气胸,结核病,骨折,主动脉钙化.在测试集(n=2922)上比较了模型的内部分类性能和病变定位性能;在美国国立卫生研究院(NIH)Google(n=4376)和PadChest(n=24536)数据集上比较了外部分类性能;在NIHChestX-ray14数据集(n=880)上比较了外部病变定位性能。还将模型与放射科医师在内部测试集的子集上的表现进行了比较(n=496)。使用受试者工作特征(ROC)曲线分析评估性能。
    未经评估:如果有足够的训练数据,两种模型的表现与放射科医师相似。CheXDet在外部分类方面取得了显著改进,例如在NIHGoogle上对骨折进行分类(CheXDetROC曲线下面积[AUC],0.67;CheXNetAUC,0.51;P<.001)和PadChest(CheXDetAUC,0.78;CheXNetAUC,0.55;P<.001)。对于所有数据集上的大多数异常,CheXDet实现了比CheXNet更高的病变检测性能,例如在内部装置上检测气胸(CheXDet成刀替代自由反应ROC[JAFROC]品质因数[FOM],0.87;CheXNetJAFROCFOM,0.13;P<.001)和NIHChestX-ray14(CheXDetJAFROCFOM,0.55;CheXNetJAFROCFOM,0.04;P<.001)。
    UNASSIGNED:细粒度注释克服了快捷学习,使DL模型能够识别正确的病变模式,提高模型的泛化性。关键词:计算机辅助诊断,常规射线照相术,卷积神经网络(CNN)深度学习算法,机器学习算法,本地化补充材料可用于本文©RSNA,2022年。
    UNASSIGNED: To evaluate the ability of fine-grained annotations to overcome shortcut learning in deep learning (DL)-based diagnosis using chest radiographs.
    UNASSIGNED: Two DL models were developed using radiograph-level annotations (disease present: yes or no) and fine-grained lesion-level annotations (lesion bounding boxes), respectively named CheXNet and CheXDet. A total of 34 501 chest radiographs obtained from January 2005 to September 2019 were retrospectively collected and annotated regarding cardiomegaly, pleural effusion, mass, nodule, pneumonia, pneumothorax, tuberculosis, fracture, and aortic calcification. The internal classification performance and lesion localization performance of the models were compared on a testing set (n = 2922); external classification performance was compared on National Institutes of Health (NIH) Google (n = 4376) and PadChest (n = 24 536) datasets; and external lesion localization performance was compared on the NIH ChestX-ray14 dataset (n = 880). The models were also compared with radiologist performance on a subset of the internal testing set (n = 496). Performance was evaluated using receiver operating characteristic (ROC) curve analysis.
    UNASSIGNED: Given sufficient training data, both models performed similarly to radiologists. CheXDet achieved significant improvement for external classification, such as classifying fracture on NIH Google (CheXDet area under the ROC curve [AUC], 0.67; CheXNet AUC, 0.51; P < .001) and PadChest (CheXDet AUC, 0.78; CheXNet AUC, 0.55; P < .001). CheXDet achieved higher lesion detection performance than CheXNet for most abnormalities on all datasets, such as detecting pneumothorax on the internal set (CheXDet jackknife alternative free-response ROC [JAFROC] figure of merit [FOM], 0.87; CheXNet JAFROC FOM, 0.13; P < .001) and NIH ChestX-ray14 (CheXDet JAFROC FOM, 0.55; CheXNet JAFROC FOM, 0.04; P < .001).
    UNASSIGNED: Fine-grained annotations overcame shortcut learning and enabled DL models to identify correct lesion patterns, improving the generalizability of the models.Keywords: Computer-aided Diagnosis, Conventional Radiography, Convolutional Neural Network (CNN), Deep Learning Algorithms, Machine Learning Algorithms, Localization Supplemental material is available for this article © RSNA, 2022.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号