Deep metric learning

  • 文章类型: Journal Article
    在生物图像分析领域,深度学习(DL)已成为核心工具包,例如用于分割和分类。然而,传统的DL方法受到大型生物多样性数据集的挑战,这些数据集的特征是类别不平衡且难以区分它们之间的表型差异。这里我们介绍生物编码器,用于度量学习的用户友好的工具包,它通过专注于学习各个数据点之间的关系而不是类的可分离性来克服这些挑战。BioEncoder作为Python包发布,创建用于跨不同数据集的易用性和灵活性。它具有与分类单元无关的数据加载器,自定义增强选项,以及通过基于文本的配置文件进行简单的超参数调整。该工具包的意义在于它有潜力在生物图像分析中解锁新的研究途径,同时使对先进的深度度量学习技术的访问民主化。BioEncoder专注于对工具包的迫切需要,这些工具包弥合了复杂的DL管道与生物学研究中的实际应用之间的差距。
    In the realm of biological image analysis, deep learning (DL) has become a core toolkit, for example for segmentation and classification. However, conventional DL methods are challenged by large biodiversity datasets characterized by unbalanced classes and hard-to-distinguish phenotypic differences between them. Here we present BioEncoder, a user-friendly toolkit for metric learning, which overcomes these challenges by focussing on learning relationships between individual data points rather than on the separability of classes. BioEncoder is released as a Python package, created for ease of use and flexibility across diverse datasets. It features taxon-agnostic data loaders, custom augmentation options, and simple hyperparameter adjustments through text-based configuration files. The toolkit\'s significance lies in its potential to unlock new research avenues in biological image analysis while democratizing access to advanced deep metric learning techniques. BioEncoder focuses on the urgent need for toolkits bridging the gap between complex DL pipelines and practical applications in biological research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    锥虫病,南美的一个重大健康问题,南亚,东南亚,需要积极调查才能有效控制疾病。为了解决这个问题,我们开发了一种结合深度度量学习(DML)和图像检索的混合模型。该模型精通在薄血膜检查的显微镜图像中识别锥虫物种。利用ResNet50骨干神经网络,经过训练的模型表现突出,准确率超过99.71%,召回率高达96%。承认在现场场景中需要自动化工具,我们展示了我们的模型作为自主筛查方法的潜力.这是通过使用流行的卷积神经网络(CNN)应用程序来实现的,和KNN算法返回的基于矢量数据库的图像。这一成就主要归因于三元组裕度损失函数的实现,精度为98%。在五次交叉验证中展示的模型的鲁棒性突出了ResNet50神经网络,基于DML,作为最先进的CNN模型,AUC>98%。DML的采用显着提高了模型的性能,保持不受数据集变化的影响,并使其成为实地考察研究的有用工具。与传统分类模型相比,DML在管理具有大量类的大规模数据集方面提供了若干优势,增强可扩展性。该模型有能力推广到训练期间没有遇到的新颖课程,证明在新类可能不断出现的情况下特别有利。它也非常适合需要精确识别的应用,特别是在区分密切相关的类。此外,DML对与班级不平衡有关的问题表现出更大的弹性,因为它专注于学习距离或相似性,对这种不平衡更宽容。这些贡献使得DML模型的有效性和实用性,特别是在田野调查中。
    Trypanosomiasis, a significant health concern in South America, South Asia, and Southeast Asia, requires active surveys to effectively control the disease. To address this, we have developed a hybrid model that combines deep metric learning (DML) and image retrieval. This model is proficient at identifying Trypanosoma species in microscopic images of thin-blood film examinations. Utilizing the ResNet50 backbone neural network, a trained-model has demonstrated outstanding performance, achieving an accuracy exceeding 99.71 % and up to 96 % in recall. Acknowledging the necessity for automated tools in field scenarios, we demonstrated the potential of our model as an autonomous screening approach. This was achieved by using prevailing convolutional neural network (CNN) applications, and vector database based-images returned by the KNN algorithm. This achievement is primarily attributed to the implementation of the Triplet Margin Loss function as 98 % of precision. The robustness of the model demonstrated in five-fold cross-validation highlights the ResNet50 neural network, based on DML, as a state-of-the-art CNN model as AUC >98 %. The adoption of DML significantly improves the performance of the model, remaining unaffected by variations in the dataset and rendering it a useful tool for fieldwork studies. DML offers several advantages over conventional classification model to manage large-scale datasets with a high volume of classes, enhancing scalability. The model has the capacity to generalize to novel classes that were not encountered during training, proving particularly advantageous in scenarios where new classes may consistently emerge. It is also well suited for applications requiring precise recognition, especially in discriminating between closely related classes. Furthermore, the DML exhibits greater resilience to issues related to class imbalance, as it concentrates on learning distances or similarities, which are more tolerant to such imbalances. These contributions significantly make the effectiveness and practicality of DML model, particularly in in fieldwork research.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    Polymicrogyria(PMG)是一种主要见于儿童的皮质组织疾病,这可能与癫痫发作有关,发育迟缓和运动无力。PMG通常在磁共振成像(MRI)上被诊断,但是即使对于有经验的放射科医师,一些病例也可能难以检测。在这项研究中,我们创建了一个开放的儿科MRI数据集(PPMR),其中包含来自东安大略省儿童医院(CHEO)的PMG和对照病例,渥太华,加拿大。PMG和对照MRI之间的差异是微妙的,疾病特征的真实分布未知。这使得难以在MRI中自动检测潜在的PMG病例。为了能够自动检测潜在的PMG病例,我们提出了一种基于新型中心的深度对比度量学习损失函数(cDCM)的异常检测方法。尽管使用小且不平衡的数据集,我们的方法在71.86%的精度下实现了88.07%的召回率。这将有助于放射科医师选择潜在的PMGMRI的计算机辅助工具。据我们所知,我们的研究首次将机器学习技术应用于仅从MRI识别PMG.我们的代码可在以下网址获得:https://github.com/RichardChangCA/Deep-Contrastive-Metric-Learning-Method-to-Detect-Polymicrogyria-in-Pediatric-Brain-MRI。我们的儿科MRI数据集可在以下网址获得:https://www。kaggle.com/datasets/lingfengzhang/儿科-polymicrogyria-mri-dataset.
    Polymicrogyria (PMG) is a disorder of cortical organization mainly seen in children, which can be associated with seizures, developmental delay and motor weakness. PMG is typically diagnosed on magnetic resonance imaging (MRI) but some cases can be challenging to detect even for experienced radiologists. In this study, we create an open pediatric MRI dataset (PPMR) containing both PMG and control cases from the Children\'s Hospital of Eastern Ontario (CHEO), Ottawa, Canada. The differences between PMG and control MRIs are subtle and the true distribution of the features of the disease is unknown. This makes automatic detection of potential PMG cases in MRI difficult. To enable the automatic detection of potential PMG cases, we propose an anomaly detection method based on a novel center-based deep contrastive metric learning loss function (cDCM). Despite working with a small and imbalanced dataset our method achieves 88.07% recall at 71.86% precision. This will facilitate a computer-aided tool for radiologists to select potential PMG MRIs. To the best of our knowledge, our research is the first to apply machine learning techniques to identify PMG solely from MRI. Our code is available at: https://github.com/RichardChangCA/Deep-Contrastive-Metric-Learning-Method-to-Detect-Polymicrogyria-in-Pediatric-Brain-MRI. Our pediatric MRI dataset is available at: https://www.kaggle.com/datasets/lingfengzhang/pediatric-polymicrogyria-mri-dataset.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在放射学解释期间,放射科医师从医学图像的元数据中读取患者标识符以识别被检查的患者。然而,对于放射科医生来说,识别“不正确的”元数据和患者识别错误是一项挑战。我们提出了一种方法,该方法使用患者重新识别技术将正确的元数据链接到丢失或错误分配元数据的躯干计算机断层扫描图像的图像集。该方法基于特征向量匹配技术,该技术使用深度特征提取器来适应侦察计算机断层扫描图像数据集中包含的跨供应商域。要识别“不正确”元数据,我们计算了随访图像与关联到正确元数据的已存储基线图像之间的最高相似性得分.重新识别性能测试相似性得分最高的图像是否属于同一患者,即,附加到图像的元数据是否正确。相同“正确”患者的随访图像和基线图像之间的相似性得分通常高于“不正确”患者。所提出的特征提取器具有足够的鲁棒性,可以在不进行额外训练的情况下提取单个可区分特征,即使是未知的侦察员计算机断层扫描图像。此外,所提出的增强技术通过合并由于每次检查期间患者表高度的变化而导致的宽度放大倍数的变化,进一步改善了不同供应商的子集的重新识别性能.我们认为,使用所提出的方法进行元数据检查将有助于检测由于不可避免的错误(例如人为错误)而分配的“不正确”患者标识符的元数据。
    During radiologic interpretation, radiologists read patient identifiers from the metadata of medical images to recognize the patient being examined. However, it is challenging for radiologists to identify \"incorrect\" metadata and patient identification errors. We propose a method that uses a patient re-identification technique to link correct metadata to an image set of computed tomography images of a trunk with lost or wrongly assigned metadata. This method is based on a feature vector matching technique that uses a deep feature extractor to adapt to the cross-vendor domain contained in the scout computed tomography image dataset. To identify \"incorrect\" metadata, we calculated the highest similarity score between a follow-up image and a stored baseline image linked to the correct metadata. The re-identification performance tests whether the image with the highest similarity score belongs to the same patient, i.e., whether the metadata attached to the image are correct. The similarity scores between the follow-up and baseline images for the same \"correct\" patients were generally greater than those for \"incorrect\" patients. The proposed feature extractor was sufficiently robust to extract individual distinguishable features without additional training, even for unknown scout computed tomography images. Furthermore, the proposed augmentation technique further improved the re-identification performance of the subset for different vendors by incorporating changes in width magnification due to changes in patient table height during each examination. We believe that metadata checking using the proposed method would help detect the metadata with an \"incorrect\" patient identifier assigned due to unavoidable errors such as human error.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    环境DNA(eDNA)元编码为记录海洋和陆地生态系统中的生物多样性模式提供了一种有效的方法。这些数据的复杂性使当前的方法无法提取和分析它们包含的所有相关生态信息,新的方法可以提供更好的降维和聚类。在这里,我们提出了两种新的基于深度学习的方法,它们结合了不同类型的神经网络(NN)来协调eDNA样本并在二维空间中可视化生态系统属性:第一种基于变分自编码器,第二种基于深度度量学习。我们新方法的优势在于两个输入的组合:为检测到的每个分子操作分类单位(MOTU)找到的序列数及其相应的核苷酸序列。使用三个不同的数据集,我们证明了我们的方法在二维潜在空间中准确地表示了几个生物多样性指标:每个样本的MOTU丰富度,每个样本的序列α-多样性,Jaccard和样本之间的序列β-多样性。我们表明,我们的非线性方法可以更好地从eDNA数据集中提取特征,同时避免与eDNA相关的主要偏见。我们的方法优于传统的降维方法,如主成分分析,t分布随机邻域嵌入,用于降维的非度量多维缩放和均匀流形逼近和投影。我们的结果表明,NN提供了一种从eDNA元编码数据中提取结构的更有效方法,从而改善其生态解释,从而改善生物多样性监测。
    Environmental DNA (eDNA) metabarcoding provides an efficient approach for documenting biodiversity patterns in marine and terrestrial ecosystems. The complexity of these data prevents current methods from extracting and analyzing all the relevant ecological information they contain, and new methods may provide better dimensionality reduction and clustering. Here we present two new deep learning-based methods that combine different types of neural networks (NNs) to ordinate eDNA samples and visualize ecosystem properties in a two-dimensional space: the first is based on variational autoencoders and the second on deep metric learning. The strength of our new methods lies in the combination of two inputs: the number of sequences found for each molecular operational taxonomic unit (MOTU) detected and their corresponding nucleotide sequence. Using three different datasets, we show that our methods accurately represent several biodiversity indicators in a two-dimensional latent space: MOTU richness per sample, sequence α-diversity per sample, Jaccard\'s and sequence β-diversity between samples. We show that our nonlinear methods are better at extracting features from eDNA datasets while avoiding the major biases associated with eDNA. Our methods outperform traditional dimension reduction methods such as Principal Component Analysis, t-distributed Stochastic Neighbour Embedding, Nonmetric Multidimensional Scaling and Uniform Manifold Approximation and Projection for dimension reduction. Our results suggest that NNs provide a more efficient way of extracting structure from eDNA metabarcoding data, thereby improving their ecological interpretation and thus biodiversity monitoring.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    目的:由于其在患者护理中的重要性,病理图像分析中的癌症分级是一项主要任务,治疗,和管理。用于计算病理学的人工神经网络的最新发展已显示出提高癌症诊断的准确性和质量的巨大潜力。这些改进通常归因于网络体系结构的进步,通常会导致计算和资源的增加。在这项工作中,我们提出了一种高效的卷积神经网络,旨在通过度量学习以准确和稳健的方式进行多类癌症分类。
    方法:我们提出了一种质心感知度量学习网络,用于改善病理图像中的癌症分级。所提出的网络利用特征嵌入空间内不同类别的质心来优化病理图像之间的相对距离,这表明了它们之间先天的相似性/差异性。为了改进优化,我们引入了一个新的损失函数和一个针对所提出的网络和度量学习的训练策略。
    结果:我们在结直肠癌和胃癌的多个数据集上评估了所提出的方法。对于结肠直肠癌,采用了从不同采集设置收集的两个不同数据集.所提出的方法达到了一定的准确性,F1分数,二次加权κ为88.7%,0.849,第一个数据集为0.946,占83.3%,0.764,第二个数据集为0.907,分别。对于胃癌,所提出的方法获得了85.9%的准确率,F1评分为0.793,二次加权κ为0.939。我们还发现,所提出的方法优于其他竞争模型,并且计算效率很高。
    结论:实验结果表明,所提出的网络预测结果既准确又可靠。所提出的网络不仅在癌症分类方面优于其他相关方法,而且在训练和推理过程中也实现了卓越的计算效率。未来的研究将需要进一步开发所提出的方法,并将该方法应用于其他问题和领域。
    OBJECTIVE: Cancer grading in pathology image analysis is a major task due to its importance in patient care, treatment, and management. The recent developments in artificial neural networks for computational pathology have demonstrated great potential to improve the accuracy and quality of cancer diagnosis. These improvements are generally ascribable to the advance in the architecture of the networks, often leading to increase in the computation and resources. In this work, we propose an efficient convolutional neural network that is designed to conduct multi-class cancer classification in an accurate and robust manner via metric learning.
    METHODS: We propose a centroid-aware metric learning network for an improved cancer grading in pathology images. The proposed network utilizes centroids of different classes within the feature embedding space to optimize the relative distances between pathology images, which manifest the innate similarities/dissimilarities between them. For improved optimization, we introduce a new loss function and a training strategy that are tailored to the proposed network and metric learning.
    RESULTS: We evaluated the proposed approach on multiple datasets of colorectal and gastric cancers. For the colorectal cancer, two different datasets were employed that were collected from different acquisition settings. the proposed method achieved an accuracy, F1-score, quadratic weighted kappa of 88.7%, 0.849, and 0.946 for the first dataset and 83.3%, 0.764, and 0.907 for the second dataset, respectively. For the gastric cancer, the proposed method obtained an accuracy of 85.9%, F1-score of 0.793, and quadratic weighted kappa of 0.939. We also found that the proposed method outperforms other competing models and is computationally efficient.
    CONCLUSIONS: The experimental results demonstrate that the prediction results by the proposed network are both accurate and reliable. The proposed network not only outperformed other related methods in cancer classification but also achieved superior computational efficiency during training and inference. The future study will entail further development of the proposed method and the application of the method to other problems and domains.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    未知疾病的出现通常很少或没有可用的样品。零射学习和少射学习在医学图像分析中具有广阔的应用前景。在本文中,我们提出了一种跨模态深度度量学习广义零分学习(CM-DML-GZSL)模型。拟议的网络由视觉特征提取器组成,固定的语义特征提取器,和深度回归模块。该网络属于用于多种模态的双流网络。在多标签设置中,每个样本平均包含少量阳性标签和大量阴性标签。这种正负不平衡主导了优化过程,并可能阻止在训练期间建立视觉特征和语义向量之间的有效对应关系。导致精度较低。在这方面引入了一种新颖的加权聚焦欧几里得距离度量损失。这种损失不仅可以动态增加硬质样品的重量,而且可以减少简单样品的重量,但它也可以促进样本和与其正标签相对应的语义向量之间的联系,这有助于减轻在广义零拍学习设置中预测看不见的类的偏差。加权聚焦欧氏距离度量损失函数可以动态调整样本权重,为胸部X光诊断提供零拍多标签学习,正如在大型公开数据集上的实验结果表明的那样。
    The emergence of unknown diseases is often with few or no samples available. Zero-shot learning and few-shot learning have promising applications in medical image analysis. In this paper, we propose a Cross-Modal Deep Metric Learning Generalized Zero-Shot Learning (CM-DML-GZSL) model. The proposed network consists of a visual feature extractor, a fixed semantic feature extractor, and a deep regression module. The network belongs to a two-stream network for multiple modalities. In a multi-label setting, each sample contains a small number of positive labels and a large number of negative labels on average. This positive-negative imbalance dominates the optimization procedure and may prevent the establishment of an effective correspondence between visual features and semantic vectors during training, resulting in a low degree of accuracy. A novel weighted focused Euclidean distance metric loss is introduced in this regard. This loss not only can dynamically increase the weight of hard samples and decrease the weight of simple samples, but it can also promote the connection between samples and semantic vectors corresponding to their positive labels, which helps mitigate bias in predicting unseen classes in the generalized zero-shot learning setting. The weighted focused Euclidean distance metric loss function can dynamically adjust sample weights, enabling zero-shot multi-label learning for chest X-ray diagnosis, as experimental results on large publicly available datasets demonstrate.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在人脸识别系统中,光的方向,反射,面部的情绪和身体变化是使识别困难的一些主要因素。研究人员继续致力于基于深度学习的算法来克服这些困难。开发能够高精度工作并降低计算成本的模型至关重要,尤其是在实时人脸识别系统中。称为代表性学习的深度度量学习算法在该领域中经常是优选的。然而,除了提取突出的代表性特征外,这些特征向量的适当分类也是影响性能的重要因素。本研究中的场景变化指标(SCI)旨在通过深度度量学习模型来降低或消除滑动窗口中的错误识别率。该模型检测场景不改变的块,并尝试更精确地用新值识别分类器阶段中使用的比较阈值。增加跨不变场景块的灵敏度比率允许在数据库中的样本之间进行更少的比较。实验研究中提出的模型与原始深度度量学习模型相比,准确率达到了99.25%,F-1得分值达到了99.28%。实验结果表明,即使在不变的场景中,同一个人的面部图像存在差异,错误识别可以被最小化,因为被比较的样本面积被缩小。
    In face recognition systems, light direction, reflection, and emotional and physical changes on the face are some of the main factors that make recognition difficult. Researchers continue to work on deep learning-based algorithms to overcome these difficulties. It is essential to develop models that will work with high accuracy and reduce the computational cost, especially in real-time face recognition systems. Deep metric learning algorithms called representative learning are frequently preferred in this field. However, in addition to the extraction of outstanding representative features, the appropriate classification of these feature vectors is also an essential factor affecting the performance. The Scene Change Indicator (SCI) in this study is proposed to reduce or eliminate false recognition rates in sliding windows with a deep metric learning model. This model detects the blocks where the scene does not change and tries to identify the comparison threshold value used in the classifier stage with a new value more precisely. Increasing the sensitivity ratio across the unchanging scene blocks allows for fewer comparisons among the samples in the database. The model proposed in the experimental study reached 99.25% accuracy and 99.28% F-1 score values ​​compared to the original deep metric learning model. Experimental results show that even if there are differences in facial images of the same person in unchanging scenes, misrecognition can be minimized because the sample area being compared is narrowed.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    从临床图像中提取的生物指纹可用于患者身份验证,以确定图片存档和通信系统中的错误归档的临床图像。然而,这些方法尚未纳入临床使用,并且它们的性能会随着临床图像的变化而降低。深度学习可以用来提高这些方法的性能。提出了一种新颖的方法,可以使用后前(PA)和前后(AP)胸部X射线图像在被检查的患者中自动识别个体。所提出的方法使用基于深度卷积神经网络(DCNN)的深度度量学习来克服患者验证和识别的极端分类要求。它在NIH胸部X射线数据集(ChestX-ray8)上进行了三个步骤的训练:预处理,具有EfficientNetV2-S骨干的DCNN特征提取,和深度度量学习的分类。所提出的方法是使用两个公共数据集和两个临床胸部X射线图像数据集进行评估,这些数据集包含来自接受筛查和医院护理的患者的数据。在包含PA和AP视图位置的PadChest数据集上,预训练了300个时期的1280维特征提取器在接收器工作特性曲线下的面积为0.9894,误差率为0.0269,并且前1精度为0.839时表现最佳。这项研究的结果为自动患者识别的发展提供了相当多的见解,以减少由于人为错误而导致医疗事故的可能性。
    Biological fingerprints extracted from clinical images can be used for patient identity verification to determine misfiled clinical images in picture archiving and communication systems. However, such methods have not been incorporated into clinical use, and their performance can degrade with variability in the clinical images. Deep learning can be used to improve the performance of these methods. A novel method is proposed to automatically identify individuals among examined patients using posteroanterior (PA) and anteroposterior (AP) chest X-ray images. The proposed method uses deep metric learning based on a deep convolutional neural network (DCNN) to overcome the extreme classification requirements for patient validation and identification. It was trained on the NIH chest X-ray dataset (ChestX-ray8) in three steps: preprocessing, DCNN feature extraction with an EfficientNetV2-S backbone, and classification with deep metric learning. The proposed method was evaluated using two public datasets and two clinical chest X-ray image datasets containing data from patients undergoing screening and hospital care. A 1280-dimensional feature extractor pretrained for 300 epochs performed the best with an area under the receiver operating characteristic curve of 0.9894, an equal error rate of 0.0269, and a top-1 accuracy of 0.839 on the PadChest dataset containing both PA and AP view positions. The findings of this study provide considerable insights into the development of automated patient identification to reduce the possibility of medical malpractice due to human errors.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:结直肠癌是全球癌症相关死亡的主要原因。预防CRC的最佳方法是结肠镜检查。然而,并非所有结肠息肉都有癌变的风险。因此,息肉使用不同的分类系统进行分类。分类后,进一步的治疗和程序是基于息肉的分类。然而,分类并不容易。因此,我们建议使用两种新型自动分类系统,帮助胃肠病学家根据NICE和Paris分类对息肉进行分类.
    方法:我们建立了两个分类系统。一种是根据息肉的形状对息肉进行分类(巴黎)。另一种是根据息肉的纹理和表面图案(NICE)对息肉进行分类。介绍了巴黎分类的两步过程:首先,检测和裁剪图像上的息肉,其次,用变压器网络根据裁剪区域对息肉进行分类。对于NICE分类,我们设计了一种基于深度度量学习方法的少射学习算法。该算法为息肉创建了一个嵌入空间,它允许从几个例子中进行分类,以说明我们数据库中NICE注释图像的数据稀缺性。
    结果:对于巴黎分类,我们达到了89.35%的准确率,超越了文献中的所有论文,并为公共数据集上的其他出版物建立了新的最新技术和基线准确性。对于NICE分类,我们获得了81.13%的竞争准确率,从而证明了在数据稀缺的环境中,少射学习范式在息肉分类中的可行性.此外,我们展示了算法的不同烧蚀。最后,通过显示解释神经激活的神经网络的热图,我们进一步阐述了系统的可解释性。
    结论:总的来说,我们介绍了两种息肉分类系统来帮助胃肠病学家。我们在巴黎分类中实现了最先进的性能,并在NICE分类中展示了少射学习范式的可行性,解决医疗机器学习中面临的普遍数据稀缺问题。
    Colorectal cancer is a leading cause of cancer-related deaths worldwide. The best method to prevent CRC is a colonoscopy. However, not all colon polyps have the risk of becoming cancerous. Therefore, polyps are classified using different classification systems. After the classification, further treatment and procedures are based on the classification of the polyp. Nevertheless, classification is not easy. Therefore, we suggest two novel automated classifications system assisting gastroenterologists in classifying polyps based on the NICE and Paris classification.
    We build two classification systems. One is classifying polyps based on their shape (Paris). The other classifies polyps based on their texture and surface patterns (NICE). A two-step process for the Paris classification is introduced: First, detecting and cropping the polyp on the image, and secondly, classifying the polyp based on the cropped area with a transformer network. For the NICE classification, we design a few-shot learning algorithm based on the Deep Metric Learning approach. The algorithm creates an embedding space for polyps, which allows classification from a few examples to account for the data scarcity of NICE annotated images in our database.
    For the Paris classification, we achieve an accuracy of 89.35 %, surpassing all papers in the literature and establishing a new state-of-the-art and baseline accuracy for other publications on a public data set. For the NICE classification, we achieve a competitive accuracy of 81.13 % and demonstrate thereby the viability of the few-shot learning paradigm in polyp classification in data-scarce environments. Additionally, we show different ablations of the algorithms. Finally, we further elaborate on the explainability of the system by showing heat maps of the neural network explaining neural activations.
    Overall we introduce two polyp classification systems to assist gastroenterologists. We achieve state-of-the-art performance in the Paris classification and demonstrate the viability of the few-shot learning paradigm in the NICE classification, addressing the prevalent data scarcity issues faced in medical machine learning.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号