Feature Extraction

特征提取
  • 文章类型: Journal Article
    通过结合诸如标签和逻辑规则之类的先验信息来学习有区别的特征,最近的图像分类工作取得了一定的成功。然而,这些方法忽略了特征的可变性,导致特征不一致和模型参数更新的波动,这进一步降低了图像分类的准确性和模型的不稳定性。为了解决这个问题,本文提出了一种将结构先验驱动特征提取与梯度动量(SPGM)相结合的新方法,从一致的特征学习和精确的参数更新的角度来看,提高图像分类的准确性和稳定性。具体来说,SPGM利用结构先验驱动的特征提取(SPFE)方法来计算多级特征和原始图像的梯度,以构建结构信息,然后将其转化为先验知识,以驱动网络学习与原始图像一致的特征。此外,引入了梯度和动量(GMO)集成优化策略,根据梯度和动量之和的角度和范数,动态调整参数更新的方向和步长,启用精确的模型参数更新。在CIFAR10和CIFAR100数据集上进行的大量实验表明,SPGM方法显着降低了图像分类中的前1位错误率,提高分类性能,并优于最先进的方法。
    Recent image classification efforts have achieved certain success by incorporating prior information such as labels and logical rules to learn discriminative features. However, these methods overlook the variability of features, resulting in feature inconsistency and fluctuations in model parameter updates, which further contribute to decreased image classification accuracy and model instability. To address this issue, this paper proposes a novel method combining structural prior-driven feature extraction with gradient-momentum (SPGM), from the perspectives of consistent feature learning and precise parameter updates, to enhance the accuracy and stability of image classification. Specifically, SPGM leverages a structural prior-driven feature extraction (SPFE) approach to calculate gradients of multi-level features and original images to construct structural information, which is then transformed into prior knowledge to drive the network to learn features consistent with the original images. Additionally, an optimization strategy integrating gradients and momentum (GMO) is introduced, dynamically adjusting the direction and step size of parameter updates based on the angle and norm of the sum of gradients and momentum, enabling precise model parameter updates. Extensive experiments on CIFAR10 and CIFAR100 datasets demonstrate that the SPGM method significantly reduces the top-1 error rate in image classification, enhances the classification performance, and outperforms state-of-the-art methods.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    输电线路走廊点云场景中目标对象的语义分割是电力线树屏障检测的关键步骤。大量的,无序分布,输电线路走廊场景中点云的非均匀性对特征提取提出了重大挑战。以往的研究往往忽视了空间信息的核心利用,限制了网络理解复杂几何形状的能力。为了克服这个限制,本文着眼于增强分割网络中空间几何信息的深度表达,并提出了一种称为BDF-Net的方法来改进RandLA-Net。对于每个输入的3D点云数据,BDF-Net首先通过空间信息编码块将相对坐标和相对距离信息编码为空间几何特征表示,以捕获点云数据的局部空间结构。随后,双线性池块通过利用其双线性相互作用能力有效地将点云的特征信息与空间几何表示相结合,从而学习更多的区别性局部特征描述符。全局特征提取块利用点位置与相对位置的比值捕获点云数据中的全局结构信息,从而增强网络的语义理解能力。为了验证BDF-Net的性能,本文构建了一个数据集,PPCD,针对输电线路走廊的点云场景进行了详细的实验。实验结果表明,BDF-Net在各种评估指标上实现了显著的性能提升,具体实现97.16%的OA,77.48%的mIoU,mAcc为87.6%,为3.03%,16.23%,比RandLA-Net高18.44%,分别。此外,与其他最新方法的比较也验证了BDF-Net在点云语义分割任务中的优越性。
    Semantic segmentation of target objects in power transmission line corridor point cloud scenes is a crucial step in powerline tree barrier detection. The massive quantity, disordered distribution, and non-uniformity of point clouds in power transmission line corridor scenes pose significant challenges for feature extraction. Previous studies have often overlooked the core utilization of spatial information, limiting the network\'s ability to understand complex geometric shapes. To overcome this limitation, this paper focuses on enhancing the deep expression of spatial geometric information in segmentation networks and proposes a method called BDF-Net to improve RandLA-Net. For each input 3D point cloud data, BDF-Net first encodes the relative coordinates and relative distance information into spatial geometric feature representations through the Spatial Information Encoding block to capture the local spatial structure of the point cloud data. Subsequently, the Bilinear Pooling block effectively combines the feature information of the point cloud with the spatial geometric representation by leveraging its bilinear interaction capability thus learning more discriminative local feature descriptors. The Global Feature Extraction block captures the global structure information in the point cloud data by using the ratio between the point position and the relative position, so as to enhance the semantic understanding ability of the network. In order to verify the performance of BDF-Net, this paper constructs a dataset, PPCD, for the point cloud scenario of transmission line corridors and conducts detailed experiments on it. The experimental results show that BDF-Net achieves significant performance improvements in various evaluation metrics, specifically achieving an OA of 97.16%, a mIoU of 77.48%, and a mAcc of 87.6%, which are 3.03%, 16.23%, and 18.44% higher than RandLA-Net, respectively. Moreover, comparisons with other state-of-the-art methods also verify the superiority of BDF-Net in point cloud semantic segmentation tasks.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    对于行业图像数据,提出了一种基于随机配置网络和多尺度特征提取的图像分类方法。使用深度2DSCN从不同尺度的图像中提取多尺度特征,并且多层的隐藏特征也被连接在一起以获得更多的信息特征。集成的特征被馈送到SCN中以学习分类器,该分类器提高了不同类别的识别率。在实验中,使用手写数字数据库和工业热轧带钢数据库,对比结果表明,该方法能有效提高分类精度。
    For industry image data, this paper proposes an image classification method based on stochastic configuration networks and multi-scale feature extraction. The multi-scale features are extracted from images of different scales using deep 2DSCN, and the hidden features of multiple layers are also connected together to obtain more informational features. The integrated features are fed into SCNs to learn a classifier which improves the recognition rate for different categories. In the experiments, a handwritten digit database and an industry hot-rolled steel strip database are used, and the comparison results demonstrate the proposed method can effectively improve the classification accuracy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在本文中,引入了一种新的音乐流派分类方法。所提出的方法涉及将音频信号转换为称为声谱的统一表示,使用增强的Rigdelet神经网络(RNN)从其中提取纹理特征。此外,RNN已使用部分强化效果优化器(IPREO)的改进版本进行了优化,有效地避免了局部优化并增强了RNN的泛化能力。GTZAN数据集已在实验中使用,以评估所提出的RNN/IPREO模型对音乐流派分类的有效性。结果表明,通过结合光谱质心的组合,精度达到了92%,梅尔谱图,和Mel频率倒谱系数(MFCC)作为特征。该性能显著优于K-均值(58%)和支持向量机(高达68%)。此外,RNN/IPREO模型超越了各种深度学习架构,如神经网络(65%),RNN(84%),CNN(88%),DNN(86%),VGG-16(91%),和ResNet-50(90%)。值得注意的是,RNN/IPREO模型能够获得与VGG-16、ResNet-50和RNN-LSTM等知名深度模型相当的结果。有时甚至超过他们的分数。这突出了其混合CNN-双向RNN设计与IPREO参数优化算法相结合的优势,用于提取复杂和顺序的听觉数据。
    In this paper, a new approach has been introduced for classifying the music genres. The proposed approach involves transforming an audio signal into a unified representation known as a sound spectrum, from which texture features have been extracted using an enhanced Rigdelet Neural Network (RNN). Additionally, the RNN has been optimized using an improved version of the partial reinforcement effect optimizer (IPREO) that effectively avoids local optima and enhances the RNN\'s generalization capability. The GTZAN dataset has been utilized in experiments to assess the effectiveness of the proposed RNN/IPREO model for music genre classification. The results show an impressive accuracy of 92 % by incorporating a combination of spectral centroid, Mel-spectrogram, and Mel-frequency cepstral coefficients (MFCCs) as features. This performance significantly outperformed K-Means (58 %) and Support Vector Machines (up to 68 %). Furthermore, the RNN/IPREO model outshined various deep learning architectures such as Neural Networks (65 %), RNNs (84 %), CNNs (88 %), DNNs (86 %), VGG-16 (91 %), and ResNet-50 (90 %). It is worth noting that the RNN/IPREO model was able to achieve comparable results to well-known deep models like VGG-16, ResNet-50, and RNN-LSTM, sometimes even surpassing their scores. This highlights the strength of its hybrid CNN-Bi-directional RNN design in conjunction with the IPREO parameter optimization algorithm for extracting intricate and sequential auditory data.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项研究探讨了采用高光谱成像(HSI)技术定量评估硅(Si)对油菜叶片中铅(Pb)含量的影响的可行性。针对高光谱数据维数高、信息冗余的缺陷,本文提出了两种改进的特征波长提取算法,重复区间组合优化(RICO)和区间组合优化(ICO)结合逐步回归(ICO-SR)。将整个油菜籽油菜叶片作为感兴趣区域(ROI),以提取400.89-1002.19nm范围内的可见近红外高光谱数据。在数据处理中,Savitzky-Golay(SG)平滑,去趋势(DT),并利用多重散射校正(MSC)进行光谱数据预处理,而递归特征消除(RFE),迭代可变子集优化(IVSO),ICO,并采用两种增强算法来识别特征波长。随后,基于光谱数据的预处理和特征提取,采用偏最小二乘回归(PLSR)和支持向量回归(SVR)方法构建了油菜叶片Pb含量预测模型,并对每个模型的性能进行了比较和分析。结果表明,两种改进算法在提取有代表性的光谱信息方面比传统方法更有效,SVR模型的性能优于PLSR模型。最后,为了进一步提高SVR模型的预测精度和鲁棒性,引入鲸鱼优化算法(WOA)对其参数进行优化。结果表明,MSC-RICO-WOA-SVR模型取得了最佳的综合性能,Rp2为0.9436,RMSEP为0.0501mg/kg,RPD为3.4651。研究结果进一步证实了HSI结合特征提取算法在评价Si缓解油菜Pb胁迫效果方面的巨大潜力,为确定Si适宜施用量缓解油菜Pb污染提供了理论依据。
    This study explored the feasibility of employing hyperspectral imaging (HSI) technology to quantitatively assess the effect of silicon (Si) on lead (Pb) content in oilseed rape leaves. Aiming at the defects of hyperspectral data with high dimension and redundant information, this paper proposed two improved feature wavelength extraction algorithms, repetitive interval combination optimization (RICO) and interval combination optimization (ICO) combined with stepwise regression (ICO-SR). The entire oilseed rape leaves were taken as the region of interest (ROI) to extract the visible near-infrared hyperspectral data within the 400.89-1002.19 nm range. In data processing, Savitzky-Golay (SG) smoothing, detrending (DT), and multiple scatter correction (MSC) were utilized for spectral data preprocessing, while recursive feature elimination (RFE), iteratively variable subset optimization (IVSO), ICO, and the two enhanced algorithms were employed to identify characteristic wavelengths. Subsequently, based on the spectral data of preprocessing and feature extraction, partial least squares regression (PLSR) and support vector regression (SVR) methods were used to construct various Pb content prediction models in oilseed rape leaves, with a comparison and analysis of each model performance. The results indicated that the two improved algorithms were more efficient in extracting representative spectral information than conventional methods, and the performance of SVR models was better than PLSR models. Finally, to further improve the prediction accuracy and robustness of the SVR models, the whale optimization algorithm (WOA) was introduced to optimize their parameters. The findings demonstrated that the MSC-RICO-WOA-SVR model achieved the best comprehensive performance, with Rp2 of 0.9436, RMSEP of 0.0501 mg/kg, and RPD of 3.4651. The results further confirmed the great potential of HSI combined with feature extraction algorithms to evaluate the effectiveness of Si in alleviating Pb stress in oilseed rape and provided a theoretical basis for determining the appropriate amount of Si application to alleviate Pb pollution in oilseed rape.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    高光谱图像(HSI)分类是HSI应用领域的重要组成部分。由于HSI包含丰富的光谱信息,有效地提取深层表示特征是一大挑战。在现有方法中,尽管边缘数据增强用于增强边缘表示,在边缘也引入了大量的高频噪声。此外,不同光谱对于分类决策的重要性尚未得到强调。为应对上述挑战,我们提出了一种边缘感知和谱空间特征学习网络(ESSN)。ESSN包含边缘特征增强块和频谱空间特征提取块。首先,在边缘特征增强块中,图像的边缘被感知,不同光谱波段的边缘特征得到自适应加强。然后,在光谱空间特征提取块中,自适应调整不同光谱的权重,在此基础上提取更全面的深度表征特征。已经对三个公开的高光谱数据集进行了广泛的实验,实验结果表明,与现有技术的SOTA方法相比,该方法具有更高的准确性和抗干扰性。
    Hyperspectral image (HSI) classification is a vital part of the HSI application field. Since HSIs contain rich spectral information, it is a major challenge to effectively extract deep representation features. In existing methods, although edge data augmentation is used to strengthen the edge representation, a large amount of high-frequency noise is also introduced at the edges. In addition, the importance of different spectra for classification decisions has not been emphasized. Responding to the above challenges, we propose an edge-aware and spectral-spatial feature learning network (ESSN). ESSN contains an edge feature augment block and a spectral-spatial feature extraction block. Firstly, in the edge feature augment block, the edges of the image are sensed, and the edge features of different spectral bands are adaptively strengthened. Then, in the spectral-spatial feature extraction block, the weights of different spectra are adaptively adjusted, and more comprehensive depth representation features are extracted on this basis. Extensive experiments on three publicly available hyperspectral datasets have been conducted, and the experimental results indicate that the proposed method has higher accuracy and immunity to interference compared to state-of-the-art (SOTA) method.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    作物生长过程图像的高通量和全时采集,并分析其特征的形态参数,是实现快速育种技术的基础,从而加速作物育种者对种质资源的探索和品种选择。发芽过程中大豆胚根的进化特征是大豆种子活力的重要指标。直接影响大豆后续的生长过程和产量。为了解决人工测量胚胎胚根特征耗时耗力的问题,以及大错误的问题,利用连续时间序列作物生长活力监测系统采集大豆发芽的全时间序列图像。通过引入注意机制SegNext_attention,改进Segment模块,并添加CAL模块,构建了YOLOv8-segANDcal模型,用于分割和提取大豆胚胎胚根特征和胚根长度计算。与YOLOv8-seg模型相比,该模型在mAP50-95中分别提高了2%和1%的胚胎胚根的检测和分割,并计算了胚胎胚根的轮廓特征和胚根长度,获得胚根轮廓随发芽时间的形态演变特征。该模型为作物育种者和农学家选择作物品种提供了一种快速准确的方法。
    The high-throughput and full-time acquisition of images of crop growth processes, and the analysis of the morphological parameters of their features, is the foundation for achieving fast breeding technology, thereby accelerating the exploration of germplasm resources and variety selection by crop breeders. The evolution of embryonic soybean radicle characteristics during germination is an important indicator of soybean seed vitality, which directly affects the subsequent growth process and yield of soybeans. In order to address the time-consuming and labor-intensive manual measurement of embryonic radicle characteristics, as well as the issue of large errors, this paper utilizes continuous time-series crop growth vitality monitoring system to collect full-time sequence images of soybean germination. By introducing the attention mechanism SegNext_Attention, improving the Segment module, and adding the CAL module, a YOLOv8-segANDcal model for the segmentation and extraction of soybean embryonic radicle features and radicle length calculation was constructed. Compared to the YOLOv8-seg model, the model respectively improved the detection and segmentation of embryonic radicles by 2% and 1% in mAP50-95, and calculated the contour features and radicle length of the embryonic radicles, obtaining the morphological evolution of the embryonic radicle contour features over germination time. This model provides a rapid and accurate method for crop breeders and agronomists to select crop varieties.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基于稳态视觉诱发电位(SSVEP)的脑机接口(BCI)由于其高的信息传递率而受到广泛关注,精度高,丰富的指令集。然而,其识别方法的性能在很大程度上取决于受试者内部分类的校准数据量。一些研究使用深度学习(DL)算法进行学科间分类,这可以减少计算过程,但是与受试者内部分类相比,性能仍有很大的改进空间。为了解决这些问题,提出了一种基于软饱和非线性模块的高效SSVEP信号识别深度学习网络模型e-SSVEPNet。软饱和非线性模块在输出小于零时采用类似的指数计算方法,提高对噪声的鲁棒性。在SSVEP数据集的条件下,两个滑动时间窗口长度(1s和0.5s),和三个训练数据大小,本文对提出的网络模型进行了评估,并将其与其他传统和深度学习模型基线方法进行了比较。对非线性模块的实验结果进行了分类和比较。大量实验结果表明,所提出的网络在SSVEP数据集上具有最高的主体内分类平均准确率,提高了SSVEP信号分类识别的性能,并且在短信号下具有更高的解码精度,因此,它具有巨大的潜在能力,实现基于高速SSVEP的BCI。
    Brain-computer interfaces (BCIs) based on steady-state visual evoked potentials (SSVEP) have received widespread attention due to their high information transmission rate, high accuracy, and rich instruction set. However, the performance of its identification methods strongly depends on the amount of calibration data for within-subject classification. Some studies use deep learning (DL) algorithms for inter-subject classification, which can reduce the calculation process, but there is still much room for improvement in performance compared with intra-subject classification. To solve these problems, an efficient SSVEP signal recognition deep learning network model e-SSVEPNet based on the soft saturation nonlinear module is proposed in this paper. The soft saturation nonlinear module uses a similar exponential calculation method for output when it is less than zero, improving robustness to noise. Under the conditions of the SSVEP data set, two sliding time window lengths (1 s and 0.5 s), and three training data sizes, this paper evaluates the proposed network model and compares it with other traditional and deep learning model baseline methods. The experimental results of the nonlinear module were classified and compared. A large number of experimental results show that the proposed network has the highest average accuracy of intra-subject classification on the SSVEP data set, improves the performance of SSVEP signal classification and recognition, and has higher decoding accuracy under short signals, so it has huge potential ability to realize high-speed SSVEP-based for BCI.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    针对传统图像识别技术难以提取有用特征和延长识别时间的问题,对AlexNet模型进行了改进,提高了图像分类识别的效果。本研究重点研究了8种番茄叶病和健康叶片。利用HOG和LBP加权融合提取图像特征,提出了一种基于AlexNet模型的番茄叶部病害识别模型,迁移学习用于训练AlexNet模型。将AlexNet模型在PlantVillage图像数据集上学到的知识转移到此模型,同时减少完全连接的层数。使用Keras深度学习框架和编程语言Python。该模型已实现,并对番茄叶部病害进行了分类鉴定。特征加权融合分类的识别率高于串行和并行,识别时间最短。当HOG和LBP的权重系数比为3:7时,图像识别率最高,其值为97.2%。从模型性能曲线看,当迭代次数超过150次时,训练集和测试准确率均超过97%,损失率呈梯度下降,并且变化相对稳定;与传统的AlexNet模型相比,HOG+LBP+SVM模型,和VGG模型,改进的AlexNet模型具有最高的识别率,它具有很高的召回价值,准确度,和F1值;与最新的卷积神经网络疾病识别模型相比,改进的AlexNet模型识别准确率为98.83%,F1值为0.994。结果表明,该模型具有良好的收敛性能,预测速度快,且损失率低,能有效识别8种番茄叶片图像,为作物病害识别研究提供参考。
    Aiming at the problems that the traditional image recognition technology is challenging to extract useful features and the recognition time is extended; the AlexNet model is improved to improve the effect of image classification and recognition. This study focuses on 8 types of tomato leaf diseases and healthy leaves. By using HOG and LBP weighted fusion to extract image features, a tomato leaf disease recognition model based on the AlexNet model is proposed, and transfer learning is used to train the AlexNet model. Transfer the knowledge learned by the AlexNet model on the PlantVillage image dataset to this model while reducing the number of fully connected layers. Keras deep learning framework and programming language Python were used. The model was implemented, and the classification and identification of tomato leaf diseases were carried out. The recognition rate of feature-weighted fusion classification is higher than that of serial and parallel methods, and the recognition time is the shortest. When the weight coefficient ratio of HOG and LBP is 3:7, the image recognition rate is the highest, and its value is 97.2 %. From the model performance curve See, when the number of iterations is more than 150 times, the training set and test accuracy rate both exceed 97 %, the loss rate shows a gradient decline, and the change is relatively stable; compared with the traditional AlexNet model, HOG + LBP + SVM model, and VGG model, improved AlexNet model has the highest recognition rate, and it has high recall value, accuracy, and F1 value; Compared with the latest convolutional neural network disease recognition models, improved AlexNet model recognition accuracy was 98.83 %, and the F1 value was 0.994. It shows that the model has good convergence performance, fast prediction speed, and low loss rate and can effectively identify 8 types of tomato leaf images, which provides a reference for the research on crop disease identification.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    诈骗行为严重威胁个人利益和社会稳定,因此,欺诈检测近年来备受关注。在社交媒体等场景中,欺诈者通常隐藏在众多良性用户中,只构成一小部分,经常形成“小帮派”。由于欺诈者的稀缺,传统的图神经网络可能会忽略或掩盖关键的欺诈信息,导致欺诈特征代表性不足。为了解决这些问题,本研究提出了图上的tran-smote(GTS)欺诈检测方法。利用子图神经网络提取器深度挖掘各类节点的结构特征,使用变压器技术将这些功能与属性功能集成在一起,丰富了节点的信息表示,从而解决了特征表示不足的问题。此外,这种方法涉及设置一个特征嵌入空间来生成代表少数类的新节点,边缘生成器用于为这些新节点提供相关的连接信息,缓解阶级不平衡问题。在两个真实数据集上的实验结果表明,所提出的GTS,性能优于当前最先进的基线。
    Fraud seriously threatens individual interests and social stability, so fraud detection has attracted much attention in recent years. In scenarios such as social media, fraudsters typically hide among numerous benign users, constituting only a small minority and often forming \"small gangs\". Due to the scarcity of fraudsters, the conventional graph neural network might overlook or obscure critical fraud information, leading to insufficient representation of fraud characteristics. To address these issues, the tran-smote on graphs (GTS) method for fraud detection is proposed by this study. Structural features of each type of node are deeply mined using a subgraph neural network extractor, these features are integrated with attribute features using transformer technology, and the node\'s information representation is enriched, thereby addressing the issue of inadequate feature representation. Additionally, this approach involves setting a feature embedding space to generate new nodes representing minority classes, and an edge generator is used to provide relevant connection information for these new nodes, alleviating the class imbalance problem. The results from experiments on two real datasets demonstrate that the proposed GTS, performs better than the current state-of-the-art baseline.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号