Spatial pyramid pooling

空间金字塔池
  • 文章类型: Journal Article
    背景:睡眠纺锤体已成为评估认知能力和相关疾病的有价值的生物标志物,强调其在临床研究中检测的重要性。然而,使用固定模板的基于模板匹配的算法可能无法完全适应不同持续时间的主轴。此外,受图像多尺度特征提取的启发,使用多尺度特征提取方法可以更好地适应不同频率和持续时间的主轴。
    方法:因此,本研究提出了一种新颖的基于弹性时间窗和空间金字塔池化(SPP)的自动主轴检测算法,用于提取多尺度特征。该算法利用弹性时间窗对脑电图(EEG)信号进行分割,实现跨多个尺度的特征提取。这种方法可以适应不同EEG时期纺锤体持续时间和极化定位的显着变化。此外,空间金字塔池化被集成到深度可分离卷积(DSC)网络中,以在不同尺度上对分段主轴信号特征执行多尺度池化。
    结果:与现有模板匹配算法相比,该算法的主轴波偏振定位更符合实际情况。在公开数据集DREAMS上进行的实验结果表明,该算法的平均准确率达到95.75%,平均阴性预测值(NPV)为96.55%,表明其先进的性能。
    结论:通过彻底的消融实验验证了每个模块的有效性。更重要的是,当面对不同实验对象的变化时,该算法具有较强的鲁棒性。这一特点使算法在识别睡眠纺锤时更加准确,有望帮助专家自动检测睡眠脑电图信号中的纺锤,减少人工检测的工作量和时间,提高效率。
    BACKGROUND: Sleep spindles have emerged as valuable biomarkers for assessing cognitive abilities and related disorders, underscoring the importance of their detection in clinical research. However, template matching-based algorithms using fixed templates may not be able to fully adapt to spindles of different durations. Moreover, inspired by the multiscale feature extraction of images, the use of multiscale feature extraction methods can be used to better adapt to spindles of different frequencies and durations.
    METHODS: Therefore, this study proposes a novel automatic spindle detection algorithm based on elastic time windows and spatial pyramid pooling (SPP) for extracting multiscale features. The algorithm utilizes elastic time windows to segment electroencephalogram (EEG) signals, enabling the extraction of features across multiple scales. This approach accommodates significant variations in spindle duration and polarization positioning during different EEG epochs. Additionally, spatial pyramid pooling is integrated into a depthwise separable convolutional (DSC) network to perform multiscale pooling on the segmented spindle signal features at different scales.
    RESULTS: Compared with existing template matching algorithms, this algorithm\'s spindle wave polarization positioning is more consistent with the real situation. Experimental results conducted on the public dataset DREAMS show that the average accuracy of this algorithm reaches 95.75%, with an average negative predictive value (NPV) of 96.55%, indicating its advanced performance.
    CONCLUSIONS: The effectiveness of each module was verified through thorough ablation experiments. More importantly, the algorithm shows strong robustness when faced with changes in different experimental subjects. This feature makes the algorithm more accurate at identifying sleep spindles and is expected to help experts automatically detect spindles in sleep EEG signals, reduce the workload and time of manual detection, and improve efficiency.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    如果各种植物营养缺乏,可以提高作物产量以促进农业生长,疾病在早期阶段被识别和发现。因此,植物的持续健康监测对于处理植物应激至关重要。深度学习方法已经证明了其在从叶子的视觉症状自动检测植物病害和营养缺乏方面的卓越性能。本文提出了一种使用图卷积网络(GNN)进行植物营养缺乏和疾病分类的新深度学习方法,添加到基础卷积神经网络(CNN)上。有时候,全局特征描述符可能无法捕获病叶的重要区域,导致疾病分类不准确。为了解决这个问题,区域特征学习对于整体特征聚合至关重要。在这项工作中,使用空间金字塔池进行区分性特征表示,探索了多尺度的基于区域的特征摘要。此外,开发了GCN,以学习更精细的细节,以对植物病害和营养素不足进行分类。所提出的方法,称为植物营养缺乏和疾病网络(PND-Net),已经在两个营养缺乏的公共数据集上进行了评估,和两个使用四个骨干CNN进行疾病分类。建议的PND-Net的最佳分类性能如下:(a)90.00%香蕉和90.54%咖啡营养缺乏;(b)使用Xception骨干的PlantDoc数据集上的马铃薯病和84.30%。此外,为了推广,已经进行了额外的实验,并且所提出的方法在两个公共数据集上实现了最先进的性能,即乳腺癌组织病理学图像分类(BreakHis40×:95.50%,和BreakHis100×:96.79%的准确率)和子宫颈抹片图像中的单细胞用于宫颈癌分类(SIPaKMeD:99.18%的准确率)。此外,所提出的方法已使用五折交叉验证进行了评估,并在这些数据集上实现了改进的性能。显然,提出的PND-Net有效地提高了各种植物在真实和复杂的田间环境中的自动健康分析的性能,暗示PND-Net适合农业增长和人类癌症分类。
    Crop yield production could be enhanced for agricultural growth if various plant nutrition deficiencies, and diseases are identified and detected at early stages. Hence, continuous health monitoring of plant is very crucial for handling plant stress. The deep learning methods have proven its superior performances in the automated detection of plant diseases and nutrition deficiencies from visual symptoms in leaves. This article proposes a new deep learning method for plant nutrition deficiencies and disease classification using a graph convolutional network (GNN), added upon a base convolutional neural network (CNN). Sometimes, a global feature descriptor might fail to capture the vital region of a diseased leaf, which causes inaccurate classification of disease. To address this issue, regional feature learning is crucial for a holistic feature aggregation. In this work, region-based feature summarization at multi-scales is explored using spatial pyramidal pooling for discriminative feature representation. Furthermore, a GCN is developed to capacitate learning of finer details for classifying plant diseases and insufficiency of nutrients. The proposed method, called Plant Nutrition Deficiency and Disease Network (PND-Net), has been evaluated on two public datasets for nutrition deficiency, and two for disease classification using four backbone CNNs. The best classification performances of the proposed PND-Net are as follows: (a) 90.00% Banana and 90.54% Coffee nutrition deficiency; and (b) 96.18% Potato diseases and 84.30% on PlantDoc datasets using Xception backbone. Furthermore, additional experiments have been carried out for generalization, and the proposed method has achieved state-of-the-art performances on two public datasets, namely the Breast Cancer Histopathology Image Classification (BreakHis 40 × : 95.50%, and BreakHis 100 × : 96.79% accuracy) and Single cells in Pap smear images for cervical cancer classification (SIPaKMeD: 99.18% accuracy). Also, the proposed method has been evaluated using five-fold cross validation and achieved improved performances on these datasets. Clearly, the proposed PND-Net effectively boosts the performances of automated health analysis of various plants in real and intricate field environments, implying PND-Net\'s aptness for agricultural growth as well as human cancer classification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    蛋白质-蛋白质相互作用(PPIs)在生命活动中起着至关重要的作用。已经开发了许多基于蛋白质序列信息的人工智能算法来预测PPI。然而,这些模型难以处理各种序列长度,并且泛化和预测精度较低。在这项研究中,我们提出了一个新的端到端深度学习框架,RSPPI,结合残差神经网络(ResNet)和空间金字塔池化(SPP),基于蛋白质序列理化性质和空间结构信息预测PPI。在RSPPI模型中,ResNet用于从蛋白质三维结构和一级序列中提取结构和物理化学信息;SPP层用于将特征图转换为单个向量并避免固定长度要求。RSPPI模型具有出色的跨物种性能,并且在大多数评估指标中都优于基于蛋白质序列或基因本体论的几种最新方法。RSPPI模型为开发AIPPI预测算法提供了一种新颖的策略。
    Protein-protein interactions (PPIs) play an essential role in life activities. Many artificial intelligence algorithms based on protein sequence information have been developed to predict PPIs. However, these models have difficulty dealing with various sequence lengths and suffer from low generalization and prediction accuracy. In this study, we proposed a novel end-to-end deep learning framework, RSPPI, combining residual neural network (ResNet) and spatial pyramid pooling (SPP), to predict PPIs based on the protein sequence physicochemistry properties and spatial structural information. In the RSPPI model, ResNet was employed to extract the structural and physicochemical information from the protein three-dimensional structure and primary sequence; the SPP layer was used to transform feature maps to a single vector and avoid the fixed-length requirement. The RSPPI model possessed excellent cross-species performance and outperformed several state-of-the-art methods based either on protein sequence or gene ontology in most evaluation metrics. The RSPPI model provides a novel strategy to develop an AI PPI prediction algorithm.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    解决由于潜在的严重影响而导致的准确跌倒事件检测的关键需求,本文介绍了空间信道和池化增强YouOnlyLookOnce版本5小(SCPE-YOLOv5s)模型。跌倒事件由于其变化的尺度和微妙的姿势特征而对检测提出了挑战。为了解决这个问题,SCPE-YOLOv5将空间注意力引入了高效信道注意力(ECA)网络,这显著增强了模型从空间姿态分布中提取特征的能力。此外,该模型将平均池化层集成到空间金字塔池(SPP)网络中,以支持跌倒姿势的多尺度提取。同时,通过将ECA网络纳入SPP,该模型有效地结合了全局和局部特征,进一步增强了特征提取。本文在公共数据集上验证了SCPE-YOLOv5,证明它达到了88.29%的平均精度,表现优于你只看一次版本5小4.87%。此外,该模型实现每秒57.4帧。因此,SCPE-YOLOv5s为跌倒事件检测提供了一种新颖的解决方案。
    Addressing the critical need for accurate fall event detection due to their potentially severe impacts, this paper introduces the Spatial Channel and Pooling Enhanced You Only Look Once version 5 small (SCPE-YOLOv5s) model. Fall events pose a challenge for detection due to their varying scales and subtle pose features. To address this problem, SCPE-YOLOv5s introduces spatial attention to the Efficient Channel Attention (ECA) network, which significantly enhances the model\'s ability to extract features from spatial pose distribution. Moreover, the model integrates average pooling layers into the Spatial Pyramid Pooling (SPP) network to support the multi-scale extraction of fall poses. Meanwhile, by incorporating the ECA network into SPP, the model effectively combines global and local features to further enhance the feature extraction. This paper validates the SCPE-YOLOv5s on a public dataset, demonstrating that it achieves a mean Average Precision of 88.29 %, outperforming the You Only Look Once version 5 small by 4.87 %. Additionally, the model achieves 57.4 frames per second. Therefore, SCPE-YOLOv5s provides a novel solution for fall event detection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为了提高复杂背景下混凝土桥梁表观病害检测的准确性,提出了一种基于增强型YOLOv3算法的桥梁病害识别方法。首先,YOLOv3网络结构得到增强,以更好地适应疾病规模特征的密集分布和大变化,检测层结合了挤压和激励(SE)网络注意机制模块和空间金字塔池化模块,以增强语义特征提取能力。其次,选择具有更好定位能力的CIoU作为训练的损失函数。最后,采用K-means算法对桥面病害数据集进行锚帧聚类。1363个包含裸露钢筋的数据集,剥落,并产生桥梁的水蚀破坏,并在手动标记和数据改进后进行网络训练,以测试本文所述算法的有效性。根据试验结果,YOLOv3模型在准确率方面比原始模型提高了更多,召回率,平均精度(AP),其他指标。其总体平均平均精度(mAP)值也增长了5.5%。使用RTX2080Ti显卡,检测帧速率增加到每秒84帧,实现更精确和实时的桥梁疾病检测。
    A bridge disease identification approach based on an enhanced YOLO v3 algorithm is suggested to increase the accuracy of apparent disease detection of concrete bridges under complex backgrounds. First, the YOLO v3 network structure is enhanced to better accommodate the dense distribution and large variation of disease scale characteristics, and the detection layer incorporates the squeeze and excitation (SE) networks attention mechanism module and spatial pyramid pooling module to strengthen the semantic feature extraction ability. Secondly, CIoU with better localization ability is selected as the loss function for training. Finally, the K-means algorithm is used for anchor frame clustering on the bridge surface disease defects dataset. 1363 datasets containing exposed reinforcement, spalling, and water erosion damage of bridges are produced, and network training is done after manual labelling and data improvement in order to test the efficacy of the algorithm described in this paper. According to the trial results, the YOLO v3 model has enhanced more than the original model in terms of precision rate, recall rate, Average Precision (AP), and other indicators. Its overall mean Average Precision (mAP) value has also grown by 5.5%. With the RTX2080Ti graphics card, the detection frame rate increases to 84 Frames Per Second, enabling more precise and real-time bridge illness detection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    核磁共振(NMR)是分析由小分子组成的混合物的关键技术,提供非破坏性的,快,可重复,和无偏见的好处。然而,这是具有挑战性的,因为化学位移和峰值重叠的偏移,往往存在于混合物,如植物风味。这里,我们提出了一种基于深度学习的混合物识别方法(DeepMID),该方法可用于识别配方风味(由几种植物风味组成的混合物)中的植物风味(混合物),而无需了解植物风味中的特定成分。伪连体卷积神经网络(pSCNN)和空间金字塔池化(SPP)层由于其高精度和鲁棒性而用于解决问题。DeepMID模型经过训练,已验证,并在包含50,000对配方和植物口味的增强数据集上进行了测试。我们证明DeepMID可以在增强测试集中实现出色的预测结果:ACC=99.58%,TPR=99.48%,FPR=0.32%;和两个实验获得的数据集:一个显示ACC=97.60%,TPR=92.81%,FPR=0.78%,另一个显示ACC=92.31%,TPR=80.00%,FPR=0.00%。总之,DeepMID是一种基于NMR光谱识别配方香料中植物香料的可靠方法,这可以帮助研究人员加快风味配方的设计。
    Nuclear magnetic resonance (NMR) is a crucial technique for analyzing mixtures consisting of small molecules, providing non-destructive, fast, reproducible, and unbiased benefits. However, it is challenging to perform mixture identification because of the offset of chemical shifts and peak overlaps that often exist in mixtures such as plant flavors. Here, we propose a deep-learning-based mixture identification method (DeepMID) that can be used to identify plant flavors (mixtures) in a formulated flavor (mixture consisting of several plant flavors) without the need to know the specific components in the plant flavors. A pseudo-Siamese convolutional neural network (pSCNN) and a spatial pyramid pooling (SPP) layer were used to solve the problems due to their high accuracy and robustness. The DeepMID model is trained, validated, and tested on an augmented data set containing 50,000 pairs of formulated and plant flavors. We demonstrate that DeepMID can achieve excellent prediction results in the augmented test set: ACC = 99.58%, TPR = 99.48%, FPR = 0.32%; and two experimentally obtained data sets: one shows ACC = 97.60%, TPR = 92.81%, FPR = 0.78% and the other shows ACC = 92.31%, TPR = 80.00%, FPR = 0.00%. In conclusion, DeepMID is a reliable method for identifying plant flavors in formulated flavors based on NMR spectroscopy, which can assist researchers in accelerating the design of flavor formulations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    由于脑肿瘤组织的复杂性,从磁共振图像(MRI)中分割脑肿瘤被认为是一个巨大的挑战,并且当由放射科医师进行手动分割时,从健康组织分割这些组织是更加乏味的挑战。在本文中,我们提出了一种实验方法,以强调深度学习元素(如优化器和损失函数)对脑肿瘤分割深度学习最佳解决方案的影响和有效性。我们在最受欢迎的脑肿瘤数据集(MICCAIBraTS2020和RSNA-ASNR-MICCAIBraTS2021)上评估了我们的性能结果。此外,引入了一种新的BridgedU-Net-ASPP-EVO,它利用Atrous空间金字塔池来增强捕获多尺度信息,以帮助分割不同的肿瘤大小,不断发展的归一化层,挤压和激励剩余块,和下采样的最大平均池化。构建了该架构的两个变体(桥接U-Net_ASPP_EVOv1和桥接U-Net_ASPP_EVOv2)。与其他最先进的模型相比,使用这两个模型取得了最好的结果;对于增强肿瘤(ET),我们从变量1获得了0.84、0.85和0.91的平均分割骰子得分,从v2获得了0.83、0.86和0.92的平均分割骰子得分,肿瘤核心(TC),和全肿瘤(WT)肿瘤亚区,分别,在BraTS2021验证数据集中。
    Brain tumor segmentation from Magnetic Resonance Images (MRI) is considered a big challenge due to the complexity of brain tumor tissues, and segmenting these tissues from the healthy tissues is an even more tedious challenge when manual segmentation is undertaken by radiologists. In this paper, we have presented an experimental approach to emphasize the impact and effectiveness of deep learning elements like optimizers and loss functions towards a deep learning optimal solution for brain tumor segmentation. We evaluated our performance results on the most popular brain tumor datasets (MICCAI BraTS 2020 and RSNA-ASNR-MICCAI BraTS 2021). Furthermore, a new Bridged U-Net-ASPP-EVO was introduced that exploits Atrous Spatial Pyramid Pooling to enhance capturing multi-scale information to help in segmenting different tumor sizes, Evolving Normalization layers, squeeze and excitation residual blocks, and the max-average pooling for down sampling. Two variants of this architecture were constructed (Bridged U-Net_ASPP_EVO v1 and Bridged U-Net_ASPP_EVO v2). The best results were achieved using these two models when compared with other state-of-the-art models; we have achieved average segmentation dice scores of 0.84, 0.85, and 0.91 from variant1, and 0.83, 0.86, and 0.92 from v2 for the Enhanced Tumor (ET), Tumor Core (TC), and Whole Tumor (WT) tumor sub-regions, respectively, in the BraTS 2021validation dataset.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    解决高分辨率带来的低检测精度和过大的参数体积的挑战,显著的尺度变化,以及无人机航拍图像的复杂背景,本文介绍了MFP-YOLO,基于YOLOv5s的轻量级检测算法。最初,设计了一个多路径逆残差模块,并纳入了一种关注机制,以管理与重大规模变化和来自复杂背景的大量干扰相关的问题。然后,并行解卷积空间金字塔池化用于提取特定尺度的信息,增强多尺度目标检测。此外,Focal-EIoU损失函数用于增强模型对高质量样本的关注,从而提高训练的稳定性和检测的准确性。最后,轻巧的解耦头部替换了原始模型的检测头部,加快网络收敛速度,提高检测精度。实验结果表明,MFP-YOLO在VisDrone2019验证和测试集上的mAP50分别提高了12.9%和8.0%,分别,与原来的YOLOV5S相比。同时,模型的参数体积和重量尺寸分别减少了79.2%和73.7%,分别,说明MFP-YOLO在无人机航拍影像探测任务中的性能优于其他主流算法。
    Addressing the challenges of low detection precision and excessive parameter volume presented by the high resolution, significant scale variations, and complex backgrounds in UAV aerial imagery, this paper introduces MFP-YOLO, a lightweight detection algorithm based on YOLOv5s. Initially, a multipath inverse residual module is designed, and an attention mechanism is incorporated to manage the issues associated with significant scale variations and abundant interference from complex backgrounds. Then, parallel deconvolutional spatial pyramid pooling is employed to extract scale-specific information, enhancing multi-scale target detection. Furthermore, the Focal-EIoU loss function is utilized to augment the model\'s focus on high-quality samples, consequently improving training stability and detection accuracy. Finally, a lightweight decoupled head replaces the original model\'s detection head, accelerating network convergence speed and enhancing detection precision. Experimental results demonstrate that MFP-YOLO improved the mAP50 on the VisDrone 2019 validation and test sets by 12.9% and 8.0%, respectively, compared to the original YOLOv5s. At the same time, the model\'s parameter volume and weight size were reduced by 79.2% and 73.7%, respectively, indicating that MFP-YOLO outperforms other mainstream algorithms in UAV aerial imagery detection tasks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    脑肿瘤的诊断是一个漫长的过程,和自动化的过程,如脑肿瘤分割加快了时间线。U-Nets一直是语义分割的常用解决方案,它使用下采样-上采样方法来分割肿瘤。U-Nets依靠残差连接在上采样过程中传递信息;然而,上采样块仅从一个下采样块接收信息。这限制了上采样块的上下文和范围。在本文中,我们提出了SPP-U-Net,其中剩余连接被空间金字塔池(SPP)和注意力块的组合所取代。这里,SPP提供来自各种下采样块的信息,这将增加重建的范围,而注意力通过将局部特征与其相应的全局依赖性相结合来提供必要的上下文。现有文献使用沉重的方法,例如使用嵌套和密集的跳过连接和变压器。这些方法增加了模型内的训练参数,因此增加了模型的训练时间和复杂性。另一方面,所提出的方法获得了与现有文献相当的结果,而无需在160×192×192等较大尺寸上改变可训练参数的数量。总而言之,在Brats2021交叉验证中,所提出的模型的平均骰子得分为0.883,Hausdorff距离为7.84.
    Brain tumor diagnosis has been a lengthy process, and automation of a process such as brain tumor segmentation speeds up the timeline. U-Nets have been a commonly used solution for semantic segmentation, and it uses a downsampling-upsampling approach to segment tumors. U-Nets rely on residual connections to pass information during upsampling; however, an upsampling block only receives information from one downsampling block. This restricts the context and scope of an upsampling block. In this paper, we propose SPP-U-Net where the residual connections are replaced with a combination of Spatial Pyramid Pooling (SPP) and Attention blocks. Here, SPP provides information from various downsampling blocks, which will increase the scope of reconstruction while attention provides the necessary context by incorporating local characteristics with their corresponding global dependencies. Existing literature uses heavy approaches such as the usage of nested and dense skip connections and transformers. These approaches increase the training parameters within the model which therefore increase the training time and complexity of the model. The proposed approach on the other hand attains comparable results to existing literature without changing the number of trainable parameters over larger dimensions such as 160 × 192 × 192. All in all, the proposed model scores an average dice score of 0.883 and a Hausdorff distance of 7.84 on Brats 2021 cross validation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    猪肉是世界上消费最广泛的肉类产品,实现对猪个体的准确检测,对猪的智能养殖和健康监测具有重要意义。改进生猪检测对提高猪肉产量和质量具有重要意义,以及经济学。然而,目前的大多数方法都是基于体力劳动,导致不可行的表现。为了提高个体猪检测的效率和效果,本文介绍了注意力模块增强型YOLOv3-SC模型(YOLOv3-SPP-CBAM.SPP表示空间金字塔池模块,CBAM表示卷积块注意模块)。具体来说,利用注意力模块,网络将提取更丰富的特征信息,领先改进的性能。此外,通过集成SPP结构化网络,可以实现多尺度特征融合,这使得网络更加健壮。在构建的4019个样本的数据集上,实验结果表明,YOLOv3-SC网络在识别个体猪方面的mAP达到99.24%,检测时间为16ms。与其他流行的四种型号相比,包括YOLOv1,YOLOv2,更快-RCNN,和YOLOv3,猪识别的mAP提高了2.31%,1.44%,1.28%,和0.61%,分别。本文提出的YOLOv3-SC可以实现猪的准确个体检测。因此,这种新提出的模型可用于快速检测农场的单个猪,为个体猪的检测提供了新思路。
    Pork is the most widely consumed meat product in the world, and achieving accurate detection of individual pigs is of great significance for intelligent pig breeding and health monitoring. Improved pig detection has important implications for improving pork production and quality, as well as economics. However, most of the current approaches are based on manual labor, resulting in unfeasible performance. In order to improve the efficiency and effectiveness of individual pig detection, this paper describes the development of an attention module enhanced YOLOv3-SC model (YOLOv3-SPP-CBAM. SPP denotes the Spatial Pyramid Pooling module and CBAM indicates the Convolutional Block Attention Module). Specifically, leveraging the attention module, the network will extract much richer feature information, leading the improved performance. Furthermore, by integrating the SPP structured network, multi-scale feature fusion can be achieved, which makes the network more robust. On the constructed dataset of 4019 samples, the experimental results showed that the YOLOv3-SC network achieved 99.24% mAP in identifying individual pigs with a detection time of 16 ms. Compared with the other popular four models, including YOLOv1, YOLOv2, Faster-RCNN, and YOLOv3, the mAP of pig identification was improved by 2.31%, 1.44%, 1.28%, and 0.61%, respectively. The YOLOv3-SC proposed in this paper can achieve accurate individual detection of pigs. Consequently, this novel proposed model can be employed for the rapid detection of individual pigs on farms, and provides new ideas for individual pig detection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号