Long-tailed learning

  • 文章类型: Journal Article
    许多现实世界的图像识别问题,如诊断医学成像检查,是“长尾”-有一些常见的发现,其次是许多相对罕见的情况。在胸部X线摄影中,诊断既是一个长尾问题,也是一个多标签问题,因为患者通常同时出现多个发现。尽管研究人员已经开始研究医学图像识别中的长尾学习问题,很少有人研究长尾造成的标签不平衡和标签共现的相互作用,多标签疾病分类。为了与研究界就这一新兴主题进行交流,我们进行了公开的挑战,CXR-LT,在长尾,胸部X光片(CXRs)的多标签胸部疾病分类。我们公开发布了超过350,000个CXR的大规模基准测试数据集,每个标记有长尾分布后的26个临床发现中的至少一个。我们综合了性能最好的解决方案的共同主题,为长尾提供实用建议,多标签医学图像分类。最后,我们利用这些见解提出了一条涉及视觉-语言基础模型的前进道路,用于少量和零剂量疾病分类.
    Many real-world image recognition problems, such as diagnostic medical imaging exams, are \"long-tailed\" - there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:在过去的十年中,长尾学习已成为深度学习在医学中应用的热门研究热点。然而,没有科学计量学报告对这一科学领域提供了系统的概述。我们利用文献计量技术来识别和分析长尾学习在医学深度学习应用中的文献,并调查研究趋势。核心作者,和核心期刊。我们扩展了对医学领域长尾学习研究的主要组成部分和主要方法的理解。
    方法:WebofScience被用来收集直到2023年12月出版的所有关于医学长尾学习的文章。评估了所有检索到的标题和摘要的适用性。对于文献计量分析,提取了所有数值数据。CiteSpace用于基于关键字创建集群和视觉知识图。
    结果:共579篇文章符合评价标准。在过去的十年里,年度出版物数量和引用频率均显示出显着增长,遵循幂律和指数趋势,分别。这一领域值得注意的贡献者包括HusanbirSinghPannu,FadiThabtah,还有TalhaMahboobAlam,在IEEEACCESS等领先期刊上,生物学和医学计算机,IEEE医学成像事务,计算机医学图像和图形已成为传播该领域研究的关键平台。医学领域长尾学习研究的核心包含六个主要主题:不平衡数据的深度学习,模型优化,图像分析中的神经网络,健康记录中的数据不平衡,CNN在诊断和风险评估中,和疾病机制中的遗传信息。
    结论:本研究通过文献计量分析和可视化知识图总结了将长尾学习应用于医学深度学习的最新进展。它解释了新趋势,来源,核心作者,期刊,和研究热点。尽管这一领域在医学深度学习研究中显示出巨大的前景,我们的研究结果将为未来的研究和临床实践提供有价值的见解.
    BACKGROUND: In the last decade, long-tail learning has become a popular research focus in deep learning applications in medicine. However, no scientometric reports have provided a systematic overview of this scientific field. We utilized bibliometric techniques to identify and analyze the literature on long-tailed learning in deep learning applications in medicine and investigate research trends, core authors, and core journals. We expanded our understanding of the primary components and principal methodologies of long-tail learning research in the medical field.
    METHODS: Web of Science was utilized to collect all articles on long-tailed learning in medicine published until December 2023. The suitability of all retrieved titles and abstracts was evaluated. For bibliometric analysis, all numerical data were extracted. CiteSpace was used to create clustered and visual knowledge graphs based on keywords.
    RESULTS: A total of 579 articles met the evaluation criteria. Over the last decade, the annual number of publications and citation frequency both showed significant growth, following a power-law and exponential trend, respectively. Noteworthy contributors to this field include Husanbir Singh Pannu, Fadi Thabtah, and Talha Mahboob Alam, while leading journals such as IEEE ACCESS, COMPUTERS IN BIOLOGY AND MEDICINE, IEEE TRANSACTIONS ON MEDICAL IMAGING, and COMPUTERIZED MEDICAL IMAGING AND GRAPHICS have emerged as pivotal platforms for disseminating research in this area. The core of long-tailed learning research within the medical domain is encapsulated in six principal themes: deep learning for imbalanced data, model optimization, neural networks in image analysis, data imbalance in health records, CNN in diagnostics and risk assessment, and genetic information in disease mechanisms.
    CONCLUSIONS: This study summarizes recent advancements in applying long-tail learning to deep learning in medicine through bibliometric analysis and visual knowledge graphs. It explains new trends, sources, core authors, journals, and research hotspots. Although this field has shown great promise in medical deep learning research, our findings will provide pertinent and valuable insights for future research and clinical practice.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • DOI:
    文章类型: Preprint
    许多现实世界的图像识别问题,如诊断医学成像检查,是\"长尾\"$\\unicode{x2013}$有一些常见的发现,后面还有许多相对罕见的情况。在胸部X线摄影中,诊断既是一个长尾问题,也是一个多标签问题,因为患者通常同时出现多个发现。尽管研究人员已经开始研究医学图像识别中的长尾学习问题,很少有人研究长尾造成的标签不平衡和标签共现的相互作用,多标签疾病分类。为了与研究界就这一新兴主题进行交流,我们进行了公开的挑战,CXR-LT,在长尾,胸部X光片(CXRs)的多标签胸部疾病分类。我们公开发布了超过350,000个CXR的大规模基准测试数据集,每个标记有长尾分布后的26个临床发现中的至少一个。我们综合了性能最好的解决方案的共同主题,为长尾提供实用建议,多标签医学图像分类。最后,我们利用这些见解提出了一条涉及视觉-语言基础模型的前进道路,用于少量和零剂量疾病分类.
    Many real-world image recognition problems, such as diagnostic medical imaging exams, are \"long-tailed\" - there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    剪枝已经成为压缩深度神经网络的强大技术,减少内存使用和推理时间,而不会显著影响整体性能。然而,修剪影响模型行为的细微差别方式还没有得到很好的理解,特别是对于长尾,临床环境中常见的多标签数据集。在部署用于诊断的修剪模型时,这种知识差距可能会产生危险的影响,意外的模型行为可能会影响患者的健康。为了填补这个空白,我们首先分析了修剪对神经网络的影响,该神经网络被训练为通过胸部X线(CXR)诊断胸部疾病。在两个大型CXR数据集上,我们检查哪些疾病受修剪影响最大,并根据疾病频率和共现行为来描述“健忘性”类别。Further,我们确定未压缩和严重修剪的模型不一致的单个CXR,被称为修剪识别样本(PIE),并进行人类读者研究以评估他们的统一素质。我们发现放射科医生认为PIE具有更多的标签噪声,图像质量较低,诊断难度较高。这项工作代表了理解修剪对深度长尾模型行为影响的第一步,多标签医学图像分类。所有代码,模型权重,和数据访问说明可以在https://github.com/VITA-Group/PruneCXR找到。
    Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance. However, the nuanced ways in which pruning impacts model behavior are not well understood, particularly for long-tailed, multi-label datasets commonly found in clinical settings. This knowledge gap could have dangerous implications when deploying a pruned model for diagnosis, where unexpected model behavior could impact patient well-being. To fill this gap, we perform the first analysis of pruning\'s effect on neural networks trained to diagnose thorax diseases from chest X-rays (CXRs). On two large CXR datasets, we examine which diseases are most affected by pruning and characterize class \"forgettability\" based on disease frequency and co-occurrence behavior. Further, we identify individual CXRs where uncompressed and heavily pruned models disagree, known as pruning-identified exemplars (PIEs), and conduct a human reader study to evaluate their unifying qualities. We find that radiologists perceive PIEs as having more label noise, lower image quality, and higher diagnosis difficulty. This work represents a first step toward understanding the impact of pruning on model behavior in deep long-tailed, multi-label medical image classification. All code, model weights, and data access instructions can be found at https://github.com/VITA-Group/PruneCXR.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • DOI:
    文章类型: Preprint
    剪枝已经成为压缩深度神经网络的强大技术,减少内存使用和推理时间,而不会显著影响整体性能。然而,修剪影响模型行为的细微差别方式还没有得到很好的理解,特别是对于长尾,临床环境中常见的多标签数据集。在部署用于诊断的修剪模型时,这种知识差距可能会产生危险的影响,意外的模型行为可能会影响患者的健康。为了填补这个空白,我们首先分析了修剪对神经网络的影响,该神经网络被训练为通过胸部X线(CXR)诊断胸部疾病。在两个大型CXR数据集上,我们检查哪些疾病受修剪影响最大,并根据疾病频率和共现行为来描述“健忘性”类别。Further,我们确定未压缩和严重修剪的模型不一致的单个CXR,被称为修剪识别样本(PIE),并进行人类读者研究以评估他们的统一素质。我们发现放射科医生认为PIE具有更多的标签噪声,图像质量较低,诊断难度较高。这项工作代表了理解修剪对深度长尾模型行为影响的第一步,多标签医学图像分类。所有代码,模型权重,和数据访问说明可以在https://github.com/VITA-Group/PruneCXR找到。
    Pruning has emerged as a powerful technique for compressing deep neural networks, reducing memory usage and inference time without significantly affecting overall performance. However, the nuanced ways in which pruning impacts model behavior are not well understood, particularly for long-tailed, multi-label datasets commonly found in clinical settings. This knowledge gap could have dangerous implications when deploying a pruned model for diagnosis, where unexpected model behavior could impact patient well-being. To fill this gap, we perform the first analysis of pruning\'s effect on neural networks trained to diagnose thorax diseases from chest X-rays (CXRs). On two large CXR datasets, we examine which diseases are most affected by pruning and characterize class \"forgettability\" based on disease frequency and co-occurrence behavior. Further, we identify individual CXRs where uncompressed and heavily pruned models disagree, known as pruning-identified exemplars (PIEs), and conduct a human reader study to evaluate their unifying qualities. We find that radiologists perceive PIEs as having more label noise, lower image quality, and higher diagnosis difficulty. This work represents a first step toward understanding the impact of pruning on model behavior in deep long-tailed, multi-label medical image classification. All code, model weights, and data access instructions can be found at https://github.com/VITA-Group/PruneCXR.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    影像检查,比如胸片,将产生一组小的共同发现和一组更大的不寻常的发现。虽然训练有素的放射科医生可以通过研究一些代表性的例子来学习罕见疾病的视觉表现,教机器从这种“长尾”分布中学习要困难得多,因为标准方法很容易偏向最常见的类。在本文中,我们在胸部X线片上对胸部疾病特定领域的长尾学习问题进行了全面的基准研究。我们专注于从自然分布的胸部X射线数据中学习,不仅在常见的“头”类上优化分类精度,但也是罕见但关键的“尾巴”类。要做到这一点,我们引入了一种具有挑战性的新的长尾胸部X线基准,以促进开发用于医学图像分类的长尾学习方法的研究.基准包括两个胸部X射线数据集,用于19和20路胸部疾病分类,包含多达53,000的类和少至7个标记的训练图像。我们在这个新的基准上评估了标准和最先进的长尾学习方法,分析这些方法的哪些方面最有利于长尾医学图像分类,并总结未来算法设计的见解。数据集,经过训练的模型,和代码可在https://github.com/VITA-Group/LongTailCXR获得。
    Imaging exams, such as chest radiography, will yield a small set of common findings and a much larger set of uncommon findings. While a trained radiologist can learn the visual presentation of rare conditions by studying a few representative examples, teaching a machine to learn from such a \"long-tailed\" distribution is much more difficult, as standard methods would be easily biased toward the most frequent classes. In this paper, we present a comprehensive benchmark study of the long-tailed learning problem in the specific domain of thorax diseases on chest X-rays. We focus on learning from naturally distributed chest X-ray data, optimizing classification accuracy over not only the common \"head\" classes, but also the rare yet critical \"tail\" classes. To accomplish this, we introduce a challenging new long-tailed chest X-ray benchmark to facilitate research on developing long-tailed learning methods for medical image classification. The benchmark consists of two chest X-ray datasets for 19- and 20-way thorax disease classification, containing classes with as many as 53,000 and as few as 7 labeled training images. We evaluate both standard and state-of-the-art long-tailed learning methods on this new benchmark, analyzing which aspects of these methods are most beneficial for long-tailed medical image classification and summarizing insights for future algorithm design. The datasets, trained models, and code are available at https://github.com/VITA-Group/LongTailCXR.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号