pelvis radiographs

  • 文章类型: Journal Article
    背景:医学图像分析管道通常涉及分割,这需要大量注释的训练数据,这是耗时且昂贵的。为了解决这个问题,我们提出了利用生成模型来实现少拍图像分割。
    方法:我们在480,407个骨盆射线照片上训练了去噪扩散概率模型(DDPM),以生成256±256px的合成图像。DDPM以人口统计学和放射学特征为条件,并由领域专家和客观图像质量指标(Frechet起始距离[FID]和起始评分[IS])进行严格验证。下一步,三个地标(大转子[GT],小转子[LT],和闭孔[OF])在45例真实患者的X光片上进行了注释;25例用于训练,20例用于测试。要提取特征,每个图像在三个时间步长和每次通过预先训练的DDPM,从特定块中提取特征。将特征与真实图像连接以形成具有4225个通道的图像。功能集被分成随机补丁,被送到U网。使用骰子相似系数(DSC)与在射线照片上训练的香草U-Net进行性能比较。
    结果:确定真实图像与生成图像的专家准确度为57.5%,而模型达到FID=7.2和IS=210。在20个特征集上训练的分割UNet对于OF实现了0.90、0.84和0.61的DSC,GT,和LT分割,分别,至少比天真训练的模型高0.30点。
    结论:我们证明了DDPM作为特征提取器的适用性,促进医学图像分割,很少有注释样本。
    BACKGROUND: Medical image analysis pipelines often involve segmentation, which requires a large amount of annotated training data, which is time-consuming and costly. To address this issue, we proposed leveraging generative models to achieve few-shot image segmentation.
    METHODS: We trained a denoising diffusion probabilistic model (DDPM) on 480,407 pelvis radiographs to generate 256 ✕ 256 px synthetic images. The DDPM was conditioned on demographic and radiologic characteristics and was rigorously validated by domain experts and objective image quality metrics (Frechet inception distance [FID] and inception score [IS]). For the next step, three landmarks (greater trochanter [GT], lesser trochanter [LT], and obturator foramen [OF]) were annotated on 45 real-patient radiographs; 25 for training and 20 for testing. To extract features, each image was passed through the pre-trained DDPM at three timesteps and for each pass, features from specific blocks were extracted. The features were concatenated with the real image to form an image with 4225 channels. The feature-set was broken into random patches, which were fed to a U-Net. Dice Similarity Coefficient (DSC) was used to compare the performance with a vanilla U-Net trained on radiographs.
    RESULTS: Expert accuracy was 57.5 % in determining real versus generated images, while the model reached an FID = 7.2 and IS = 210. The segmentation UNet trained on the 20 feature-sets achieved a DSC of 0.90, 0.84, and 0.61 for OF, GT, and LT segmentation, respectively, which was at least 0.30 points higher than the naively trained model.
    CONCLUSIONS: We demonstrated the applicability of DDPMs as feature extractors, facilitating medical image segmentation with few annotated samples.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:在这项工作中,我们应用并验证了一种称为生成对抗网络(GAN)的人工智能(AI)技术,以创建大量高保真合成前后(AP)骨盆X射线照片,可以实现基于深度学习(DL)的图像分析。确保患者隐私。
    方法:从1998年至2018年的机构注册中收集了具有天然臀部的AP骨盆X射线照片。数据用于训练模型以创建512×512像素的合成AP骨盆图像。该网络接受了通过增强生成的2500万张图像的培训。由三名整形外科医生和两名放射科医生评估了一组100张随机图像(50/50真实/合成),以区分真实图像与合成图像。使用合成图像训练了两个模型(联合定位和分割),并在真实图像上进行了测试。
    结果:最终模型在37,640张真实X射线照片(16,782名患者)上进行了训练。在图像保真度的计算机评估中,最终模型获得了“优秀”评级。在对配对图像的盲目审查中(1真实,1合成),整形外科医生评审人员无法正确识别哪幅图像是合成的(准确率=55%,Kappa=0.11),突出显示合成图像保真度。当通过建立的DL模型评估合成图像和真实图像时,它们显示出等效的性能。
    结论:这项工作显示了使用DL技术生成大量高保真合成骨盆图像的能力,这些图像无法从计算机或专家的真实成像中辨别。这些图像可用于跨机构共享和模型预培训,进一步推进DL模型的性能,而不会对患者数据安全造成风险。
    In this work, we applied and validated an artificial intelligence technique known as generative adversarial networks (GANs) to create large volumes of high-fidelity synthetic anteroposterior (AP) pelvis radiographs that can enable deep learning (DL)-based image analyses, while ensuring patient privacy.
    AP pelvis radiographs with native hips were gathered from an institutional registry between 1998 and 2018. The data was used to train a model to create 512 × 512 pixel synthetic AP pelvis images. The network was trained on 25 million images produced through augmentation. A set of 100 random images (50/50 real/synthetic) was evaluated by 3 orthopaedic surgeons and 2 radiologists to discern real versus synthetic images. Two models (joint localization and segmentation) were trained using synthetic images and tested on real images.
    The final model was trained on 37,640 real radiographs (16,782 patients). In a computer assessment of image fidelity, the final model achieved an \"excellent\" rating. In a blinded review of paired images (1 real, 1 synthetic), orthopaedic surgeon reviewers were unable to correctly identify which image was synthetic (accuracy = 55%, Kappa = 0.11), highlighting synthetic image fidelity. The synthetic and real images showed equivalent performance when they were assessed by established DL models.
    This work shows the ability to use a DL technique to generate a large volume of high-fidelity synthetic pelvis images not discernible from real imaging by computers or experts. These images can be used for cross-institutional sharing and model pretraining, further advancing the performance of DL models without risk to patient data safety.
    Level III.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号