GoogleNet

GoogleNet
  • 文章类型: Journal Article
    医学图像分析(MI)是先进医学的重要组成部分,因为它有助于早期发现和诊断各种疾病。通过磁共振成像(MRI)对脑肿瘤进行分类是一项挑战,需要准确的模型来进行有效的诊断和治疗计划。本文介绍了AG-MSTLN-EL,利用多源迁移学习的注意力辅助多源迁移学习集成学习模型(VisualGeometryGroupResNet和GoogLeNet),注意机制,和集成学习,以实现健壮和准确的脑肿瘤分类。多源迁移学习允许从不同领域提取知识,增强泛化。注意机制集中在特定的MRI区域,提高可解释性和分类性能。集成学习结合了k-最近邻,Softmax,和支持向量机分类器,提高准确性和可靠性。在具有3064个脑肿瘤MRI图像的数据集上评估模型的性能,AG-MSTLN-EL在所有分类措施方面都优于最先进的模型。迁移学习模式的创新组合,注意机制,集成学习为脑肿瘤分类提供了可靠的解决方案。其卓越的性能和高解释性使AG-MSTLN-EL成为临床医生和研究人员在医学图像分析中的宝贵工具。
    The analysis of medical images (MI) is an important part of advanced medicine as it helps detect and diagnose various diseases early. Classifying brain tumors through magnetic resonance imaging (MRI) poses a challenge demanding accurate models for effective diagnosis and treatment planning. This paper introduces AG-MSTLN-EL, an attention-aided multi-source transfer learning ensemble learning model leveraging multi-source transfer learning (Visual Geometry Group ResNet and GoogLeNet), attention mechanisms, and ensemble learning to achieve robust and accurate brain tumor classification. Multi-source transfer learning allows knowledge extraction from diverse domains, enhancing generalization. The attention mechanism focuses on specific MRI regions, increasing interpretability and classification performance. Ensemble learning combines k-nearest neighbor, Softmax, and support vector machine classifiers, improving both accuracy and reliability. Evaluating the model\'s performance on a dataset with 3064 brain tumor MRI images, AG-MSTLN-EL outperforms state-of-the-art models in terms of all classification measures. The model\'s innovative combination of transfer learning, attention mechanism, and ensemble learning provides a reliable solution for brain tumor classification. Its superior performance and high interpretability make AG-MSTLN-EL a valuable tool for clinicians and researchers in medical image analysis.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    对微藻物种进行准确分类对于监测海洋生态系统和管理海洋粘液的出现至关重要,这对于监测海洋环境中的粘液现象至关重要。由于耗时的过程和对专家知识的需求,传统方法已经不足。本文的目的是采用卷积神经网络(CNN)和支持向量机(SVM)来提高分类精度和效率。通过采用先进的计算技术,包括MobileNet和GoogleNet模型,与SVM分类一起,这项研究表明,与传统的识别方法相比,有了显著的进步。在使用四种不同的SVM核函数对由7820图像组成的数据集进行分类时,线性内核的成功率最高,为98.79%。其次是RBF内核,占98.73%,多项式内核为97.84%,乙状核为97.20%。这项研究不仅为海洋生物多样性监测的未来研究提供了方法论框架,而且还强调了在生态保护和了解气候变化和环境污染中的粘液动态方面实时应用的潜力。
    Accurately classifying microalgae species is vital for monitoring marine ecosystems and managing the emergence of marine mucilage, which is crucial for monitoring mucilage phenomena in marine environments. Traditional methods have been inadequate due to time-consuming processes and the need for expert knowledge. The purpose of this article is to employ convolutional neural networks (CNNs) and support vector machines (SVMs) to improve classification accuracy and efficiency. By employing advanced computational techniques, including MobileNet and GoogleNet models, alongside SVM classification, the study demonstrates significant advancements over conventional identification methods. In the classification of a dataset consisting of 7820 images using four different SVM kernel functions, the linear kernel achieved the highest success rate at 98.79 %. It is followed by the RBF kernel at 98.73 %, the polynomial kernel at 97.84 %, and the sigmoid kernel at 97.20 %. This research not only provides a methodological framework for future studies in marine biodiversity monitoring but also highlights the potential for real-time applications in ecological conservation and understanding mucilage dynamics amidst climate change and environmental pollution.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    分层是复合板中最重要和最危险的损坏之一。最近,许多论文提出了结构健康监测(SHM)技术的能力,用于研究具有各种形状和厚度深度的结构分层。然而,关于利用卷积神经网络(CNN)方法来自动化无损检测(NDT)技术数据库以识别分层大小和深度的研究很少。在本文中,提出了一种自动化系统,该系统能够区分原始结构和受损结构,并对具有不同深度的三类分层进行分类。该系统包括提出的CNN模型和兰姆波技术。在这项工作中,准备了一个单向复合板,该复合板在不同深度插入了三个分层样品,用于数值和实验研究。在数值部分,研究了导波的传播和与三个分层样品的相互作用,以观察分层深度如何影响分层区域的散射和捕获波。使用有效的超声导波技术对该数值研究进行了实验验证。该技术涉及压电晶片有源传感器(PWAS)和扫描激光多普勒振动计(SLDV)。数值和实验研究都表明,分层深度对捕获波的能量和分布有直接影响。从数值和实验研究中收集了三个不同的数据集,涉及数值波场图像数据集,实验波场图像数据集,和实验波数光谱图像数据集。这三个数据集与提出的CNN模型一起独立使用,以开发一个系统,该系统可以自动分类四个类别(原始类别和三个不同的分层类别)。所有三个数据集的结果显示了所提出的CNN模型以高精度预测分层深度的能力。使用GoogLeNetCNN验证了三个不同数据集的CNN模型结果。两种方法的结果都显示出极好的一致性。结果证明了波场图像和波数谱数据集用作CNN的输入数据以检测分层深度的能力。
    Delamination represents one of the most significant and dangerous damages in composite plates. Recently, many papers have presented the capability of structural health monitoring (SHM) techniques for the investigation of structural delamination with various shapes and thickness depths. However, few studies have been conducted regarding the utilization of convolutional neural network (CNN) methods for automating the non-destructive testing (NDT) techniques database to identify the delamination size and depth. In this paper, an automated system qualified for distinguishing between pristine and damaged structures and classifying three classes of delamination with various depths is presented. This system includes a proposed CNN model and the Lamb wave technique. In this work, a unidirectional composite plate with three samples of delamination inserted at different depths was prepared for numerical and experimental investigations. In the numerical part, the guided wave propagation and interaction with three samples of delamination were studied to observe how the delamination depth can affect the scattered and trapped waves over the delamination region. This numerical study was validated experimentally using an efficient ultrasonic guided waves technique. This technique involved piezoelectric wafer active sensors (PWASs) and a scanning laser Doppler vibrometer (SLDV). Both numerical and experimental studies demonstrate that the delamination depth has a direct effect on the trapped waves\' energy and distribution. Three different datasets were collected from the numerical and experimental studies, involving the numerical wavefield image dataset, experimental wavefield image dataset, and experimental wavenumber spectrum image dataset. These three datasets were used independently with the proposed CNN model to develop a system that can automatically classify four classes (pristine class and three different delamination classes). The results of all three datasets show the capability of the proposed CNN model for predicting the delamination depth with high accuracy. The proposed CNN model results of the three different datasets were validated using the GoogLeNet CNN. The results of both methods show an excellent agreement. The results proved the capability of the wavefield image and wavenumber spectrum datasets to be used as input data to the CNN for the detection of delamination depth.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    PIN二极管,由于其结构简单,在高频大功率激励下具有可变电阻特性,通常在雷达前端用作限制器,以过滤高功率微波(HPM),以防止其电源进入内部电路并造成损坏。本文对PIN二极管的HPM效应进行了理论推导和研究,然后用优化的神经网络算法代替传统的物理建模来计算和预测PIN二极管限幅器的两类HPM限幅指标。我们针对以下两种预测场景中的每一种提出了神经网络模型:在不同HPM辐照下的时间-结温曲线的场景中,来自测试数据集的预测值与模拟值之间的加权均方误差(MSE)低于0.004。在预测PIN限制器的功率限制阈值时,插入损耗,以及不同HPM辐照下的最大隔离度,测试集预测值和模拟值的MSE均小于0.03。本研究提出的方法,应用优化的神经网络算法代替传统的物理建模算法来研究PIN二极管限幅器的高功率微波效应,显著提高了计算和仿真速度,降低了计算成本,为研究PIN二极管限幅器的高功率微波效应提供了一种新的方法。
    PIN diodes, due to their simple structure and variable resistance characteristics under high-frequency high-power excitation, are often used in radar front-end as limiters to filter high power microwaves (HPM) to prevent its power from entering the internal circuit and causing damage. This paper carries out theoretical derivation and research on the HPM effects of PIN diodes, and then uses an optimized neural network algorithm to replace traditional physical modeling to calculate and predict two types of HPM limiting indicators of PIN diode limiters. We proposes a neural network model for each of the following two prediction scenarios: in the scenario of time-junction temperature curves under different HPM irradiation, the weighted mean squared error (MSE) between the predicted values from the test dataset and the simulated values is below 0.004. While in predicting PIN limiter\'s power limitation threshold, insertion loss, and maximum isolation under different HPM irradiation, the MSE of the test set prediction values and simulation values are all less than 0.03. The method proposed in this research, which applies an optimized neural network algorithm to replace traditional physical modeling algorithms for studying the high-power microwave effects of PIN diode limiters, significantly improves the computational and simulation speed, reduces the calculation cost, and provides a new method for studying the high-power microwave effects of PIN diode limiters.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    背景:检查DNA序列的功能和特征是一项极具挑战性的任务。说到人类基因组,由外显子和内含子组成,这项任务更具挑战性。人类外显子和内含子含有数百万到数十亿个核苷酸,这有助于在该序列中观察到的复杂性。考虑到基因组学的主题有多复杂,很明显,利用信号处理技术和深度学习工具建立一个强有力的预测模型,对人类基因组研究的发展是非常有帮助的。
    结果:使用频率混沌游戏表示用彩色图像表示人类外显子和内含子后,使用两个预训练的卷积神经网络模型(Resnet-50和GoogleNet)和一个建议的具有13个隐藏层的CNN模型对我们获得的图像进行分类。我们已经达到了92%的值的准确率为Resnet-50模型在大约7小时的执行时间,GoogleNet模型在2小时半的执行时间内的准确率值为91.5%。对于我们提出的CNN模型,在2h和37min内,我们的准确率达到91.6%。
    结论:我们提出的CNN模型在执行时间方面比Resnet-50模型更快。它能够稍微超过GoogleNet模型的准确率值。
    BACKGROUND: Examining functions and characteristics of DNA sequences is a highly challenging task. When it comes to the human genome, which is made up of exons and introns, this task is more challenging. Human exons and introns contain millions to billions of nucleotides, which contributes to the complexity observed in this sequences. Considering how complicated the subject of genomics is, it is obvious that using signal processing techniques and deep learning tools to build a strong predictive model can be very helpful for the development of the research of the human genome.
    RESULTS: After representing human exons and introns with color images using Frequency Chaos Game Representation, two pre-trained convolutional neural network models (Resnet-50 and GoogleNet) and a proposed CNN model having 13 hidden layers were used to classify our obtained images. We have reached a value of 92% for the accuracy rate for Resnet-50 model in about 7 h for the execution time, a value of 91.5% for the accuracy rate for the GoogleNet model in 2 h and a half for the execution time. For our proposed CNN model, we have reached 91.6% for the accuracy rate in 2 h and 37 min.
    CONCLUSIONS: Our proposed CNN model is faster than the Resnet-50 model in terms of execution time. It was able to slightly exceed the GoogleNet model for the accuracy rate value.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    多模式机器人音乐表演艺术领域由于其创新潜力而引起了极大的兴趣。传统的机器人在理解音乐表演中的情感和艺术表达方面面临局限性。因此,本文探讨了融合视觉和听觉感知的多模态机器人在音乐表演中的应用,以提高音乐表演的质量和艺术表现力。我们的方法涉及集成GRU(门控递归单元)和GoogLeNet模型进行情感分析。GRU模型处理音频数据并捕获音乐元素的时间动态,包括长期依赖,提取情感信息。GoogLeNet模型擅长图像处理,提取复杂的视觉细节和审美特征。这种协同作用加深了对音乐和视觉元素的理解,旨在产生更多情感共鸣和互动的机器人表演。实验结果证明了我们方法的有效性,显示出多模态机器人在音乐表现方面的显著改善。这些机器人,配备了我们的方法,提供高质量,有效唤起观众情感参与的艺术表演。在音乐表演中融合视听感知的多模态机器人丰富了艺术形式,并提供了多样化的人机交互。这项研究证明了多模式机器人在音乐表演中的潜力,促进技术与艺术的融合。它开辟了表演艺术和人机交互的新领域,提供独特和创新的体验。我们的发现为表演艺术领域多模式机器人的发展提供了宝贵的见解。
    The field of multimodal robotic musical performing arts has garnered significant interest due to its innovative potential. Conventional robots face limitations in understanding emotions and artistic expression in musical performances. Therefore, this paper explores the application of multimodal robots that integrate visual and auditory perception to enhance the quality and artistic expression in music performance. Our approach involves integrating GRU (Gated Recurrent Unit) and GoogLeNet models for sentiment analysis. The GRU model processes audio data and captures the temporal dynamics of musical elements, including long-term dependencies, to extract emotional information. The GoogLeNet model excels in image processing, extracting complex visual details and aesthetic features. This synergy deepens the understanding of musical and visual elements, aiming to produce more emotionally resonant and interactive robot performances. Experimental results demonstrate the effectiveness of our approach, showing significant improvements in music performance by multimodal robots. These robots, equipped with our method, deliver high-quality, artistic performances that effectively evoke emotional engagement from the audience. Multimodal robots that merge audio-visual perception in music performance enrich the art form and offer diverse human-machine interactions. This research demonstrates the potential of multimodal robots in music performance, promoting the integration of technology and art. It opens new realms in performing arts and human-robot interactions, offering a unique and innovative experience. Our findings provide valuable insights for the development of multimodal robots in the performing arts sector.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    开发有效的方法来推断由众多表情组成的不同面孔之间的关系或在不同时间在同一面孔上的关系(例如,疾病进展)是影像学相关研究中的一个悬而未决的问题。在这项研究中,我们提出了一种新颖的面部特征提取方法,表征,基于经典计算机视觉和深度学习的识别,更具体地说,卷积神经网络。
    我们描述了名为FRetrAival(FRAI)的混合面部表征系统,它是GoogleNet和AlexNet神经网络(NN)模型的混合。通过FRAI网络分析的图像通过计算机视觉技术进行预处理,例如基于梯度的定向算法,该算法只能从任何类型的图片中提取人脸区域。使用对齐人脸数据集(AFD)来训练和测试用于提取图像特征的FRAI解决方案。野生(LFW)保持数据集中的标记面已用于外部验证。
    总的来说,与以前的技术相比,我们的方法通过产生最大精度,在k-最近邻(KNN)上显示出更好的结果,召回,F1和F2得分值(92.00、92.66、92.33和92.52%,分别)对于AFD和(每个变量为95.00%)对于LFW数据集,它们被用作训练和测试数据集。FRAI模型可能会用于医疗保健和犯罪学以及许多其他应用中,在这些应用中,快速识别特定识别目标的指纹等面部特征非常重要。
    UNASSIGNED: Developing efficient methods to infer relations among different faces consisting of numerous expressions or on the same face at different times (e.g., disease progression) is an open issue in imaging related research. In this study, we present a novel method for facial feature extraction, characterization, and identification based on classical computer vision coupled with deep learning and, more specifically, convolutional neural networks.
    UNASSIGNED: We describe the hybrid face characterization system named FRetrAIval (FRAI), which is a hybrid of the GoogleNet and the AlexNet Neural Network (NN) models. Images analyzed by the FRAI network are preprocessed by computer vision techniques such as the oriented gradient-based algorithm that can extract only the face region from any kind of picture. The Aligned Face dataset (AFD) was used to train and test the FRAI solution for extracting image features. The Labeled Faces in the Wild (LFW) holdout dataset has been used for external validation.
    UNASSIGNED: Overall, in comparison to previous techniques, our methodology has shown much better results on k-Nearest Neighbors (KNN) by yielding the maximum precision, recall, F1, and F2 score values (92.00, 92.66, 92.33, and 92.52%, respectively) for AFD and (95.00% for each variable) for LFW dataset, which were used as training and testing datasets. The FRAI model may be potentially used in healthcare and criminology as well as many other applications where it is important to quickly identify face features such as fingerprint for a specific identification target.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:肺癌是最常见的癌症类型,占全球癌症病例的12.8%。当最初出现非特异性症状时,很难在早期诊断。
    目的:使用机器学习方法开发的图像处理技术在决策支持系统的开发中发挥了至关重要的作用。本研究旨在通过深度学习方法和卷积神经网络(CNN)对良性和恶性肺部病变进行分类。
    方法:图像数据集包括4459次计算机断层扫描(CT)扫描(良性,2242;恶性,2217).研究类型为回顾性;病例对照分析。一种基于GoogLeNet架构的方法,这是一种深度学习方法,用于对图像进行最大推断,并最大限度地减少手动控制。
    结果:用于开发CNN模型的数据集包含在训练(3567)和测试(892)数据集中。模型在训练阶段的最高准确率估计为0.98。根据准确性,灵敏度,特异性,正预测值,和测试数据的阴性预测值,最高的分类性能比为阳性预测值,为0.984.
    结论:深度学习方法有助于通过计算机断层扫描图像对肺癌进行诊断和分类。
    UNASSIGNED: Lung cancer is the most common type of cancer, accounting for 12.8% of cancer cases worldwide. As initially non-specific symptoms occur, it is difficult to diagnose in the early stages.
    UNASSIGNED: Image processing techniques developed using machine learning methods have played a crucial role in the development of decision support systems. This study aimed to classify benign and malignant lung lesions with a deep learning approach and convolutional neural networks (CNNs).
    UNASSIGNED: The image dataset includes 4459 Computed tomography (CT) scans (benign, 2242; malignant, 2217). The research type was retrospective; the case-control analysis. A method based on GoogLeNet architecture, which is one of the deep learning approaches, was used to make maximum inference on images and minimize manual control.
    UNASSIGNED: The dataset used to develop the CNNs model is included in the training (3567) and testing (892) datasets. The model\'s highest accuracy rate in the training phase was estimated as 0.98. According to accuracy, sensitivity, specificity, positive predictive value, and negative predictive values of testing data, the highest classification performance ratio was positive predictive value with 0.984.
    UNASSIGNED: The deep learning methods are beneficial in the diagnosis and classification of lung cancer through computed tomography images.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    深度学习在识别复杂结构方面发挥着重要作用,与传统算法相比,它在训练和分类任务方面的表现优于传统算法。在这项工作中,开发了一种基于云的本地解决方案,用于将阿尔茨海默病(AD)分类为MRI扫描作为输入模态。多分类用于AD品种,分为四个阶段。为了利用预先训练的GoogLeNet模型的功能,采用迁移学习。GoogLeNet模型,它是为图像分类任务预先训练的,针对多类别AD分类的特定目的进行了微调。通过这个过程,达到了98%的更好的准确度。因此,使用GoogLeNet提出的体系结构开发了用于阿尔茨海默病预测的本地云Web应用程序。此应用程序使医生能够远程检查患者是否存在AD。
    Deep learning is playing a major role in identifying complicated structure, and it outperforms in term of training and classification tasks in comparison to traditional algorithms. In this work, a local cloud-based solution is developed for classification of Alzheimer\'s disease (AD) as MRI scans as input modality. The multi-classification is used for AD variety and is classified into four stages. In order to leverage the capabilities of the pre-trained GoogLeNet model, transfer learning is employed. The GoogLeNet model, which is pre-trained for image classification tasks, is fine-tuned for the specific purpose of multi-class AD classification. Through this process, a better accuracy of 98% is achieved. As a result, a local cloud web application for Alzheimer\'s prediction is developed using the proposed architectures of GoogLeNet. This application enables doctors to remotely check for the presence of AD in patients.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    基于卷积神经网络(CNN)的辣椒叶病识别是有趣的研究领域之一。然而,大多数现有的基于CNN的辣椒叶病检测模型在准确性和计算性能方面都是次优的。特别是,由于在大型领域中进行叶片病害识别需要大量的计算和内存消耗,因此在嵌入式便携式设备上应用CNN具有挑战性。因此,本文介绍了一种基于GoogLeNet架构的增强型轻量级模型。初始步骤涉及压缩Inception结构以减少模型参数,导致识别速度显着提高。此外,该网络结合了空间金字塔池化结构,以无缝地整合局部和全局特征。随后,所提出的改进模型已经在9183张图像的真实数据集上进行了训练,含有6种辣椒病。交叉验证结果表明,模型准确率为97.87%,比基于Inception-V1和Inception-V3的GoogLeNet高出6%。该模型的内存需求仅为10.3MB,减少了52.31%-86.69%,与GoogLeNet相比。我们还将该模型与现有的基于CNN的模型进行了比较,包括AlexNet,ResNet-50和MobileNet-V2。结果表明,该模型的平均推理时间减少了61.49%,41.78%和23.81%,分别。结果表明,所提出的增强模型在精度和计算效率方面都能显著提高性能,这有可能提高辣椒种植业的生产力。
    Pepper leaf disease identification based on convolutional neural networks (CNNs) is one of the interesting research areas. However, most existing CNN-based pepper leaf disease detection models are suboptimal in terms of accuracy and computing performance. In particular, it is challenging to apply CNNs on embedded portable devices due to a large amount of computation and memory consumption for leaf disease recognition in large fields. Therefore, this paper introduces an enhanced lightweight model based on GoogLeNet architecture. The initial step involves compressing the Inception structure to reduce model parameters, leading to a remarkable enhancement in recognition speed. Furthermore, the network incorporates the spatial pyramid pooling structure to seamlessly integrate local and global features. Subsequently, the proposed improved model has been trained on the real dataset of 9183 images, containing 6 types of pepper diseases. The cross-validation results show that the model accuracy is 97.87%, which is 6% higher than that of GoogLeNet based on Inception-V1 and Inception-V3. The memory requirement of the model is only 10.3 MB, which is reduced by 52.31%-86.69%, comparing to GoogLeNet. We have also compared the model with the existing CNN-based models including AlexNet, ResNet-50 and MobileNet-V2. The result shows that the average inference time of the proposed model decreases by 61.49%, 41.78% and 23.81%, respectively. The results show that the proposed enhanced model can significantly improve performance in terms of accuracy and computing efficiency, which has potential to improve productivity in the pepper farming industry.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号