UNet

UNet
  • 文章类型: Journal Article
    腹部有多个重要器官,与各种疾病有关,对人类健康构成重大风险。及早发现腹部器官状况,可以及时进行干预和治疗,防止患者健康恶化。分割腹部器官有助于医生更准确地诊断器官病变。然而,腹部器官的解剖结构相对复杂,器官相互重叠,共享类似的功能,从而为细分任务提出了挑战。在真实的医疗场景中,模型必须展示实时和低延迟功能,需要在最小化参数数量的同时提高分割精度。研究人员开发了各种腹部器官分割方法,从卷积神经网络(CNN)到变形金刚。然而,这些方法在准确识别器官分割边界时经常遇到困难。MetaFormer抽象了变形金刚的框架,不包括多头自我关注,为解决计算机视觉问题和克服视觉变形金刚和CNN骨干网络的局限性提供了新的视角。为了进一步提高分割效果,我们提出了一个U形网络,集成SEFormer和深度级联上采样(dCUP)作为编码器和解码器,分别,进入UNet结构,名为SEF-UNet。SEFormer将挤压和激励模块与深度可分离卷积相结合,实例化MetaFormer框架,增强局部细节和纹理信息的捕获,从而提高边缘分割精度。dCUP在上采样过程中进一步集成了浅层和深层信息层。我们的模型显着提高了分割精度,同时减少了参数计数,并在分割彼此重叠的器官边缘方面表现出卓越的性能,从而在真实的医疗场景中提供潜在的部署。
    The abdomen houses multiple vital organs, which are associated with various diseases posing significant risks to human health. Early detection of abdominal organ conditions allows for timely intervention and treatment, preventing deterioration of patients\' health. Segmenting abdominal organs aids physicians in more accurately diagnosing organ lesions. However, the anatomical structures of abdominal organs are relatively complex, with organs overlapping each other, sharing similar features, thereby presenting challenges for segmentation tasks. In real medical scenarios, models must demonstrate real-time and low-latency features, necessitating an improvement in segmentation accuracy while minimizing the number of parameters. Researchers have developed various methods for abdominal organ segmentation, ranging from convolutional neural networks (CNNs) to Transformers. However, these methods often encounter difficulties in accurately identifying organ segmentation boundaries. MetaFormer abstracts the framework of Transformers, excluding the multi-head Self-Attention, offering a new perspective for solving computer vision problems and overcoming the limitations of Vision Transformers and CNN backbone networks. To further enhance segmentation effectiveness, we propose a U-shaped network, integrating SEFormer and depthwise cascaded upsampling (dCUP) as the encoder and decoder, respectively, into the UNet structure, named SEF-UNet. SEFormer combines Squeeze-and-Excitation modules with depthwise separable convolutions, instantiating the MetaFormer framework, enhancing the capture of local details and texture information, thereby improving edge segmentation accuracy. dCUP further integrates shallow and deep information layers during the upsampling process. Our model significantly improves segmentation accuracy while reducing the parameter count and exhibits superior performance in segmenting organ edges that overlap each other, thereby offering potential deployment in real medical scenarios.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:MRI图像上的膀胱癌(BC)分割是确定是否存在肌肉浸润的第一步。本研究旨在评估三种深度学习(DL)模型在多参数MRI(mp-MRI)图像上的肿瘤分割性能。
    方法:我们研究了53例膀胱癌患者。膀胱肿瘤在T2加权(T2WI)的每个切片上进行分割,扩散加权成像/表观扩散系数(DWI/ADC),和在3TeslaMRI扫描仪上采集的T1加权对比增强(T1WI)图像。我们训练了Unet,MAnet,和PSPnet使用三个损失函数:交叉熵(CE),骰子相似系数损失(DSC),和病灶丢失(FL)。我们使用DSC评估了模型性能,Hausdorff距离(HD),和预期校准误差(ECE)。
    结果:具有CE+DSC损失函数的MAnet算法在ADC上给出了最高的DSC值,T2WI,和T1WI图像。PSPnet与CE+DSC在ADC上获得了最小的HDs,T2WI,和T1WI图像。总体上,ADC和T1WI的分割精度优于T2WI。在ADC图像上,带FL的PSPnet的ECE最小,而在T2WI和T1WI上使用CE+DSC的MAnet是最小的。
    结论:与Unet相比,根据评估指标的选择,具有混合CEDSC损失函数的MAnet和PSPnet在BC分割中显示出更好的性能。
    BACKGROUND: Bladder cancer (BC) segmentation on MRI images is the first step to determining the presence of muscular invasion. This study aimed to assess the tumor segmentation performance of three deep learning (DL) models on multi-parametric MRI (mp-MRI) images.
    METHODS: We studied 53 patients with bladder cancer. Bladder tumors were segmented on each slice of T2-weighted (T2WI), diffusion-weighted imaging/apparent diffusion coefficient (DWI/ADC), and T1-weighted contrast-enhanced (T1WI) images acquired at a 3Tesla MRI scanner. We trained Unet, MAnet, and PSPnet using three loss functions: cross-entropy (CE), dice similarity coefficient loss (DSC), and focal loss (FL). We evaluated the model performances using DSC, Hausdorff distance (HD), and expected calibration error (ECE).
    RESULTS: The MAnet algorithm with the CE+DSC loss function gave the highest DSC values on the ADC, T2WI, and T1WI images. PSPnet with CE+DSC obtained the smallest HDs on the ADC, T2WI, and T1WI images. The segmentation accuracy overall was better on the ADC and T1WI than on the T2WI. The ECEs were the smallest for PSPnet with FL on the ADC images, while they were the smallest for MAnet with CE+DSC on the T2WI and T1WI.
    CONCLUSIONS: Compared to Unet, MAnet and PSPnet with a hybrid CE+DSC loss function displayed better performances in BC segmentation depending on the choice of the evaluation metric.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    ECG通过记录心脏活动来帮助诊断心脏病。在长期测量中,由于传感器分离而发生数据丢失。因此,心电信号缺失数据的重建研究至关重要。然而,ECG需要用户参与并且不能用于连续心脏监测。PPG信号的连续监测相反是低成本的并且易于执行。在这项研究中,提出了一种深度神经网络模型,用于使用PPG数据重建丢失的ECG信号。该模型是以WNet架构为基础的端到端深度学习神经网络,在建立第二个模型时,添加了双向长短期记忆网络。使用来自MIMICIII匹配子集的146条记录来验证两个模型的性能。与参考文献相比,使用所提出的模型重建的ECG的皮尔逊相关系数为0.851,均方根误差(RMSE)为0.075,均方根差异百分比(PRD)为5.452,Fréchet距离(FD)为0.302。实验结果表明,从PPG重建丢失的ECG信号是可行的。
    ECG helps in diagnosing heart disease by recording heart activity. During long-term measurements, data loss occurs due to sensor detachment. Therefore, research into the reconstruction of missing ECG data is essential. However, ECG requires user participation and cannot be used for continuous heart monitoring. Continuous monitoring of PPG signals is conversely low-cost and easy to carry out. In this study, a deep neural network model is proposed for the reconstruction of missing ECG signals using PPG data. This model is an end-to-end deep learning neural network utilizing WNet architecture as a basis, on which a bidirectional long short-term memory network is added in establishing a second model. The performance of both models is verified using 146 records from the MIMIC III matched subset. Compared with the reference, the ECG reconstructed using the proposed model has a Pearson\'s correlation coefficient of 0.851, root mean square error (RMSE) of 0.075, percentage root mean square difference (PRD) of 5.452, and a Fréchet distance (FD) of 0.302. The experimental results demonstrate that it is feasible to reconstruct missing ECG signals from PPG.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    锥形束计算机断层扫描(CBCT)广泛应用于现代牙科,牙齿分割构成了基于这些成像数据的数字工作流程的组成部分。先前的方法严重依赖于手动分割,并且在临床实践中是耗时且费力的。最近,随着计算机视觉技术的进步,学者们进行了深入的研究,提出了各种快速准确的牙齿分割方法。在这次审查中,我们回顾了该领域的55篇文章,并讨论了其有效性,优势,以及每种方法的缺点。除了简单的分类和讨论,本文旨在揭示如何通过应用和改进现有的图像分割算法来改进牙齿分割方法,以解决诸如牙齿形态不规则和边界模糊等问题。假设随着这些方法的优化,手工操作将减少,和更高的精度和鲁棒性的牙齿分割将实现。最后,我们强调了这一领域仍然存在的挑战,并为未来的方向提供了前景。
    Cone beam computed tomography (CBCT) is widely employed in modern dentistry, and tooth segmentation constitutes an integral part of the digital workflow based on these imaging data. Previous methodologies rely heavily on manual segmentation and are time-consuming and labor-intensive in clinical practice. Recently, with advancements in computer vision technology, scholars have conducted in-depth research, proposing various fast and accurate tooth segmentation methods. In this review, we review 55 articles in this field and discuss the effectiveness, advantages, and disadvantages of each approach. In addition to simple classification and discussion, this review aims to reveal how tooth segmentation methods can be improved by the application and refinement of existing image segmentation algorithms to solve problems such as irregular morphology and fuzzy boundaries of teeth. It is assumed that with the optimization of these methods, manual operation will be reduced, and greater accuracy and robustness in tooth segmentation will be achieved. Finally, we highlight the challenges that still exist in this field and provide prospects for future directions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    胶质瘤是一种常见于中枢神经系统的原发性恶性颅脑肿瘤。根据研究,术前诊断和充分了解胶质瘤的影像学特征是非常重要的。尽管如此,传统的图像分配和机器智慧的分割方法在神经胶质瘤分割中是不可接受的。该分析探讨了磁共振成像(MRI)脑肿瘤图像作为脑胶质瘤有效分割方法的潜力。
    这项研究使用了来自附属医院的200张MRI图像,并应用了2维残差块UNet(2DResUNet)。使用2×2内核大小(64内核)1步2D卷积(Conv)层从输入图像中提取特征。本研究中实现的2DDDenseUNet模型在UNet架构中引入了ResBlock机制,以及在输入阶段用于数据增强的高斯噪声层,以及用于替换传统2D卷积层的池化层。最后,验证了所提出的协议的性能及其在胶质瘤分割中的有效措施。
    5倍交叉验证评估的结果表明,尽管对Dice评分的评估结果略低,但提出的2DSesUNet和2DDenseUNet结构具有很高的灵敏度。同时,与实验中使用的其他模型相比,本文提出的DM-DA-UNet模型在各项指标上有了显著的改进,提高了模型的可靠性,为临床治疗策略的准确制定提供了参考和依据。本研究中使用的方法显示出比UNet模型更强的特征提取能力。此外,我们的发现表明,在训练过程中使用广义死亡伤害和偏见交叉熵作为损失函数有效地缓解了神经胶质瘤数据的类失衡,并有效地分割了神经胶质瘤。
    基于改进的UNet网络的方法在MRI脑肿瘤纵向分割程序中具有明显的优势。结果表明,我们开发了一个2D残差块UNet,这可以提高胶质瘤分割在临床过程中的结合。
    UNASSIGNED: Glioma is a primary malignant craniocerebral tumor commonly found in the central nervous system. According to research, preoperative diagnosis of glioma and a full understanding of its imaging features are very significant. Still, the traditional segmentation methods of image dispensation and machine wisdom are not acceptable in glioma segmentation. This analysis explores the potential of magnetic resonance imaging (MRI) brain tumor images as an effective segmentation method of glioma.
    UNASSIGNED: This study used 200 MRI images from the affiliated hospital and applied the 2-dimensional residual block UNet (2DResUNet). Features were extracted from input images using a 2×2 kernel size (64-kernel) 1-step 2D convolution (Conv) layer. The 2DDenseUNet model implemented in this study incorporates a ResBlock mechanism within the UNet architecture, as well as a Gaussian noise layer for data augmentation at the input stage, and a pooling layer for replacing the conventional 2D convolutional layers. Finally, the performance of the proposed protocol and its effective measures in glioma segmentation were verified.
    UNASSIGNED: The outcomes of the 5-fold cross-validation evaluation show that the proposed 2DResUNet and 2DDenseUNet structure has a high sensitivity despite the slightly lower evaluation result on the Dice score. At the same time, compared with other models used in the experiment, the DM-DA-UNet model proposed in this paper was significantly improved in various indicators, increasing the reliability of the model and providing a reference and basis for the accurate formulation of clinical treatment strategies. The method used in this study showed stronger feature extraction ability than the UNet model. In addition, our findings demonstrated that using generalized die harm and prejudiced cross entropy as loss functions in the training process effectively alleviated the class imbalance of glioma data and effectively segmented glioma.
    UNASSIGNED: The method based on the improved UNet network has obvious advantages in the MRI brain tumor portrait segmentation procedure. The result showed that we developed a 2D residual block UNet, which can improve the incorporation of glioma segmentation into the clinical process.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    全球范围内获得乳腺癌诊断的机会有限导致治疗延迟。超声波,一种有效但未得到充分利用的方法,需要对超声波检查者进行专门的培训,这阻碍了它的广泛使用。体积扫描成像(VSI)是一种创新方法,可使未经培训的操作员捕获高质量的超声图像。结合深度学习,比如卷积神经网络,它有可能改变乳腺癌的诊断,提高准确性,节省时间和成本,改善患者预后。广泛使用的UNet架构,以医学图像分割而闻名,有局限性,如梯度消失,缺乏多尺度特征提取和选择性区域注意。在这项研究中,我们提出了一种新的分割模型,称为小波_注意力_UNet(WATUNet)。在这个模型中,我们在编码器和解码器之间合并了小波门和注意门,而不是简单的连接来克服上述限制,从而提高模型性能。使用两个数据集进行分析:780张图像的公共“乳腺超声图像”数据集和3818张图像的私人VSI数据集,作者在罗切斯特大学拍摄。两个数据集都包含分为三种类型的分段病变:无肿块,良性肿块,和恶性肿块。我们的细分结果显示,与其他深度网络相比,性能更优越。所提出的算法在VSI数据集上获得了0.94的Dice系数和0.94的F1得分,在公共数据集上得分为0.93和0.94。分别。此外,我们的模型在McNemar的测试中显著优于其他模型,在381图像VSI集上进行了错误发现率校正。实验结果表明,所提出的WATUNet模型在标准护理和VSI图像中均实现了乳腺病变的精确分割,超越最先进的模型。因此,该模型在协助病变识别方面具有相当大的前景,临床诊断乳腺病变的重要步骤。
    Limited access to breast cancer diagnosis globally leads to delayed treatment. Ultrasound, an effective yet underutilized method, requires specialized training for sonographers, which hinders its widespread use. Volume sweep imaging (VSI) is an innovative approach that enables untrained operators to capture high-quality ultrasound images. Combined with deep learning, like convolutional neural networks, it can potentially transform breast cancer diagnosis, enhancing accuracy, saving time and costs, and improving patient outcomes. The widely used UNet architecture, known for medical image segmentation, has limitations, such as vanishing gradients and a lack of multi-scale feature extraction and selective region attention. In this study, we present a novel segmentation model known as Wavelet_Attention_UNet (WATUNet). In this model, we incorporate wavelet gates and attention gates between the encoder and decoder instead of a simple connection to overcome the limitations mentioned, thereby improving model performance. Two datasets are utilized for the analysis: the public \'Breast Ultrasound Images\' dataset of 780 images and a private VSI dataset of 3818 images, captured at the University of Rochester by the authors. Both datasets contained segmented lesions categorized into three types: no mass, benign mass, and malignant mass. Our segmentation results show superior performance compared to other deep networks. The proposed algorithm attained a Dice coefficient of 0.94 and an F1 score of 0.94 on the VSI dataset and scored 0.93 and 0.94 on the public dataset, respectively. Moreover, our model significantly outperformed other models in McNemar\'s test with false discovery rate correction on a 381-image VSI set. The experimental findings demonstrate that the proposed WATUNet model achieves precise segmentation of breast lesions in both standard-of-care and VSI images, surpassing state-of-the-art models. Hence, the model holds considerable promise for assisting in lesion identification, an essential step in the clinical diagnosis of breast lesions.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基于SwinTransformer的各种分割网络已在医疗分割任务中显示出希望。尽管如此,精度较低和训练收敛较慢等挑战一直存在。为了解决这些问题,我们介绍了一种新颖的方法,结合了Swin变压器和可变形变压器,以提高整体模型性能。我们利用SwinTransformer的窗口注意机制来捕获局部特征信息,并使用可变形Transformer动态调整采样位置,加速模型收敛并使其与对象形状和大小更紧密地对齐。通过合并两个变压器模块并合并额外的跳过连接,以最大程度地减少信息丢失,我们提出的模型擅长快速准确地分割CT或X线肺部图像.实验结果表明,展示了我们模型的非凡能力。它超越了独立的SwinTransformer的SwinUnet的性能,并在相同条件下更快地收敛,在COVID-19CT扫描病变分割数据集和胸部X射线掩模和标签数据集上,准确率分别提高了0.7%(88.18%)和2.7%(98.01%),分别。这一进步有可能帮助医生进行早期诊断和治疗决策。
    Various segmentation networks based on Swin Transformer have shown promise in medical segmentation tasks. Nonetheless, challenges such as lower accuracy and slower training convergence have persisted. To tackle these issues, we introduce a novel approach that combines the Swin Transformer and Deformable Transformer to enhance overall model performance. We leverage the Swin Transformer\'s window attention mechanism to capture local feature information and employ the Deformable Transformer to adjust sampling positions dynamically, accelerating model convergence and aligning it more closely with object shapes and sizes. By amalgamating both Transformer modules and incorporating additional skip connections to minimize information loss, our proposed model excels at rapidly and accurately segmenting CT or X-ray lung images. Experimental results demonstrate the remarkable, showcasing the significant prowess of our model. It surpasses the performance of the standalone Swin Transformer\'s Swin Unet and converges more rapidly under identical conditions, yielding accuracy improvements of 0.7% (resulting in 88.18%) and 2.7% (resulting in 98.01%) on the COVID-19 CT scan lesion segmentation dataset and Chest X-ray Masks and Labels dataset, respectively. This advancement has the potential to aid medical practitioners in early diagnosis and treatment decision-making.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    慢性伤口极大地影响了生活质量。它需要比急性伤口更多的重症监护。与他们的医生安排后续预约以跟踪愈合情况。良好的伤口治疗促进愈合和减少问题。伤口护理需要精确可靠的伤口测量,以根据循证最佳实践优化患者治疗和结果。图像用于通过量化关键愈合参数来客观地评估伤口状态。然而,由于伤口类型和成像条件的高度多样性,伤口图像的鲁棒分割是复杂的。本研究提出并评估了一种用于医学图像中伤口分割的新型混合模型。该模型将先进的深度学习技术与传统的图像处理方法相结合,提高了伤口分割的准确性和可靠性。主要目标是通过利用两种范例的综合优势来克服现有分割方法(UNet)的局限性。在我们的调查中,我们引入了一种混合模型架构,其中ResNet34用作编码器,并且采用UNet作为解码器。ResNet34的深度表示学习和UNet的高效特征提取的结合产生了显著的好处。建筑设计成功地融合了高层和低层的特征,能够生成高精度和准确性的分割图。在将我们的模型实现到实际数据之后,我们能够确定以下联合交集(IOU)的值,骰子得分,精度分别为0.973、0.986和0.9736。根据取得的成果,所提出的方法比当前最先进的方法更精确和准确。
    Quality of life is greatly affected by chronic wounds. It requires more intensive care than acute wounds. Schedule follow-up appointments with their doctor to track healing. Good wound treatment promotes healing and fewer problems. Wound care requires precise and reliable wound measurement to optimize patient treatment and outcomes according to evidence-based best practices. Images are used to objectively assess wound state by quantifying key healing parameters. Nevertheless, the robust segmentation of wound images is complex because of the high diversity of wound types and imaging conditions. This study proposes and evaluates a novel hybrid model developed for wound segmentation in medical images. The model combines advanced deep learning techniques with traditional image processing methods to improve the accuracy and reliability of wound segmentation. The main objective is to overcome the limitations of existing segmentation methods (UNet) by leveraging the combined advantages of both paradigms. In our investigation, we introduced a hybrid model architecture, wherein a ResNet34 is utilized as the encoder, and a UNet is employed as the decoder. The combination of ResNet34\'s deep representation learning and UNet\'s efficient feature extraction yields notable benefits. The architectural design successfully integrated high-level and low-level features, enabling the generation of segmentation maps with high precision and accuracy. Following the implementation of our model to the actual data, we were able to determine the following values for the Intersection over Union (IOU), Dice score, and accuracy: 0.973, 0.986, and 0.9736, respectively. According to the achieved results, the proposed method is more precise and accurate than the current state-of-the-art.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    最近,视觉变换器(ViT)和多层感知器(MLP)相结合的医学图像分割方案得到了广泛的应用。然而,其缺点之一是不同级别的特征融合能力弱,缺乏灵活的定位信息。为了减少编码和解码阶段之间的语义差距,我们提出了一种具有多尺度特征融合Unet的混合conv-MLP网络(MCNMF-Unet)用于医学图像分割。MCNMF-Unet是基于卷积和MLP的U形网络,它不仅继承了卷积在提取底层特征和视觉结构方面的优势,而且还利用MLP来融合网络各层的局部和全局信息。MCNMF-Unet在每个网络阶段进行多层融合和多尺度特征图跳过连接,从而可以充分利用所有特征信息并缓解梯度消失问题。此外,MCNMF-Unet包含多轴和多窗口MLP模块。该模块是完全端到端的,无需考虑图像裁剪的负面影响。它不仅融合了来自多个维度和接收域的信息,而且减少了参数的数量和计算复杂度。我们在布西评估了提出的模型,ISIC2018和CVC-ClinicDB数据集。实验结果表明,我们提出的模型的性能优于大多数现有的网络,IoU为84.04%,F1得分为91.18%。
    Recently, the medical image segmentation scheme combining Vision Transformer (ViT) and multilayer perceptron (MLP) has been widely used. However, one of its disadvantages is that the feature fusion ability of different levels is weak and lacks flexible localization information. To reduce the semantic gap between the encoding and decoding stages, we propose a mixture conv-MLP network with multi-scale features fusion Unet (MCNMF-Unet) for medical image segmentation. MCNMF-Unet is a U-shaped network based on convolution and MLP, which not only inherits the advantages of convolutional in extracting underlying features and visual structures, but also utilizes MLP to fuse local and global information of each layer of the network. MCNMF-Unet performs multi-layer fusion and multi-scale feature map skip connections in each network stage so that all the feature information can be fully utilized and the gradient disappearance problem can be alleviated. Additionally, MCNMF-Unet incorporates a multi-axis and multi-windows MLP module. This module is fully end-to-end and eliminates the need to consider the negative impact of image cropping. It not only fuses information from multiple dimensions and receptive fields but also reduces the number of parameters and computational complexity. We evaluated the proposed model on BUSI, ISIC2018 and CVC-ClinicDB datasets. The experimental results show that the performance of our proposed model is superior to most existing networks, with an IoU of 84.04% and a F1-score of 91.18%.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为了实现输电线路的自动规划,一个关键步骤是准确识别遥感图像的特征信息。考虑到特征信息深度不同,特征分布不均匀,提出了一种基于AS-Unet++的语义分割方法。首先,在传统的Unet中增加了atrous空间金字塔池化(ASPP)和挤压激励(SE)模块,这样可以扩展传感场,增强重要特征,这就是所谓的AS-Unet。第二,AS-Unet++结构是通过使用不同的AS-Unet层构建的,使得AS-Unet的每一层的特征提取部分堆叠在一起。与Unet相比,提出的AS-Unet++自动学习不同深度的特征,并确定具有最佳性能的深度。一旦确定了最佳网络层数,多余的层可以修剪,这将大大减少训练参数的数量。实验结果表明,与Unet相比,AS-Unet++的整体识别精度明显提高。
    In order to achieve the automatic planning of power transmission lines, a key step is to precisely recognize the feature information of remote sensing images. Considering that the feature information has different depths and the feature distribution is not uniform, a semantic segmentation method based on a new AS-Unet++ is proposed in this paper. First, the atrous spatial pyramid pooling (ASPP) and the squeeze-and-excitation (SE) module are added to traditional Unet, such that the sensing field can be expanded and the important features can be enhanced, which is called AS-Unet. Second, an AS-Unet++ structure is built by using different layers of AS-Unet, such that the feature extraction parts of each layer of AS-Unet are stacked together. Compared with Unet, the proposed AS-Unet++ automatically learns features at different depths and determines a depth with optimal performance. Once the optimal number of network layers is determined, the excess layers can be pruned, which will greatly reduce the number of trained parameters. The experimental results show that the overall recognition accuracy of AS-Unet++ is significantly improved compared to Unet.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号