image recognition

图像识别
  • 文章类型: Journal Article
    致密砂岩油气藏是石油地质勘探研究的主要热点。然而,由于致密砂岩储层固有的强非均质性和复杂的微观孔隙结构,许多储层分类标准的适用性有限。本次调查的重点是位于鄂尔多斯盆地济源地区的长8致密储层。高压压汞实验,铸造薄截面,并进行了扫描电镜实验。采用图像识别技术提取各样品的孔隙形状参数。根据上述情况,通过灰色关联分析(GRA),层次分析法(AHP),熵权法(EWM)和综合权重法,通过拟合得到初始产能与高压压汞参数的关系指数Q1和初始产能与孔隙形状参数的关系指数Q2。建立了基于孔隙结构和形状参数的致密砂岩储层双耦合综合定量分类预测模型。对目标储层进行了定量分类研究,分析储层质量与孔隙结构和形状参数的相关性,导致有利勘探区的建议。研究结果表明,当Q1≥0.5和Q2≥0.5时,储层分类为I型,当Q1>0.7和Q2>0.57时,储层分类为I1型,表明为高产储层。当0.32 Tight sandstone reservoirs are a primary focus of research on the geological exploration of petroleum. However, many reservoir classification criteria are of limited applicability due to the inherent strong heterogeneity and complex micropore structure of tight sandstone reservoirs. This investigation focused on the Chang 8 tight reservoir situated in the Jiyuan region of the Ordos Basin. High-pressure mercury intrusion experiments, casting thin sections, and scanning electron microscopy experiments were conducted. Image recognition technology was used to extract the pore shape parameters of each sample. Based on the above, through grey relational analysis (GRA), analytic hierarchy process (AHP), entropy weight method (EWM) and comprehensive weight method, the relationship index Q1 between initial productivity and high pressure mercury injection parameters and the relationship index Q2 between initial productivity and pore shape parameters are obtained by fitting. Then a dual-coupled comprehensive quantitative classification prediction model for tight sandstone reservoirs was developed based on pore structure and shape parameters. A quantitative classification study was conducted on the target reservoir, analyzing the correlation between reservoir quality and pore structure and shape parameters, leading to the proposal of favourable exploration areas. The research results showed that when Q1 ≥ 0.5 and Q2 ≥ 0.5, the reservoir was classified as type I. When Q1 > 0.7 and Q2 > 0.57, it was classified as type I1, indicating a high-yield reservoir. When 0.32 < Q1 < 0.47 and 0.44 < Q2 < 0.56, was classified as type II. When 0.1 < Q1 < 0.32 and 0.3 < Q2 < 0.44, it was classified as type III. Type I reservoirs exhibit a zigzag pattern in the northwest part of the study area. Thus, the northwest should be prioritized in actual exploration and development. Additionally, the initial productivity of tight sandstone reservoirs showed a positive correlation with the porosity, permeability, sorting coefficient, coefficient of variation, and median radius. Conversely, it demonstrated a negative correlation with the median pressure and displacement pressure. The perimeters of pores, their circularity, and the length of the major axis showed a positive correlation with the porosity, permeability, sorting coefficient, coefficient of variation, and median radius. On the other hand, they exhibited a negative correlation with the median pressure and displacement pressure. This study quantitatively constructed a new classification and evaluation system for tight sandstone reservoirs from the perspective of microscopic pore structure, achieving an overall model accuracy of 93.3%. This model effectively predicts and evaluates tight sandstone reservoirs. It provides new guidance for identifying favorable areas in the study region and other tight sandstone reservoirs.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    煤流中的异物容易造成输送带的损坏,大多数异物经常被遮挡,让他们很难被发现。针对低照度和沙尘雾环境中遮挡目标检测精度和效率低的问题,提出了一种图像异物检测方法。首先,YOLOv5s后端处理通过软非最大抑制进行优化,以减少密集对象的影响。其次,SimOTA标签分配用于减少密集遮挡下模糊样本的影响。然后,滑动损失用于挖掘困难样品,和Inner-SIoU用于优化边界框回归损失。最后,组-泰勒修剪用于压缩模型。实验结果表明,该方法参数仅为4.20×105,计算量为1.00×109,模型大小为1.20MB,自建数据集上的mAP0.5高达91.30%。不同计算设备上的检测速度高达66.31、41.90和33.03FPS。证明了该方法实现了对多层闭塞煤流异物的快速、高精度检测。
    Foreign objects in coal flow easily cause damage to conveyor belts, and most foreign objects are often occluded, making them difficult to detect. Aiming at solving the problems of low accuracy and efficiency in the detection of occluded targets in a low-illumination and dust fog environment, an image detection method for foreign objects is proposed. Firstly, YOLOv5s back-end processing is optimized by soft non-maximum suppression to reduce the influence of dense objects. Secondly, SimOTA label allocation is used to reduce the influence of ambiguous samples under dense occlusion. Then, Slide Loss is used to excavate difficult samples, and Inner-SIoU is used to optimize the bounding box regression loss. Finally, Group-Taylor pruning is used to compress the model. The experimental results show that the proposed method has only 4.20 × 105 parameters, a computational amount of 1.00 × 109, a model size of 1.20 MB, and an mAP0.5 of up to 91.30% on the self-built dataset. The detection speed on the different computing devices is as high as 66.31, 41.90, and 33.03 FPS. This proves that the proposed method achieves fast and high-accuracy detection of multi-layer occluded coal flow foreign objects.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    泡沫浮选是一种广泛而重要的选矿方法,显著影响提取矿物的纯度和质量。传统上,工人需要通过观察浮选泡沫的视觉特性来控制化学剂量,但这需要相当的经验和操作技能。本文设计了一种基于深度集成学习的浮选泡沫图像识别传感器,用于监测实际的浮选泡沫工作条件,以协助操作人员促进化学剂量调整,并实现促进精矿品位和矿物回收的工业目标。在我们的方法中,浮选泡沫图像的训练和验证数据在K折交叉验证中进行划分,基于深度神经网络(DNN)的学习器是通过在图像增强的训练数据中预先训练的DNN模型生成的,以提高其泛化性和鲁棒性。然后,提出了一种利用基于DNN的学习者在验证过程中的性能信息的隶属函数,以提高基于DNN的学习者的识别准确性。随后,提出了一种基于F1分数的通过相似于理想解决方案(TOPSIS)的订单偏好技术,通过由基于DNN的学习者预测组成的决策矩阵,通过隶属函数选择浮选泡沫图像的最可能工作条件,该方法用于优化深度集成学习的组合过程。在实际工业金锑泡沫浮选应用中验证了所设计传感器的有效性和优越性。
    Froth flotation is a widespread and important method for mineral separation, significantly influencing the purity and quality of extracted minerals. Traditionally, workers need to control chemical dosages by observing the visual characteristics of flotation froth, but this requires considerable experience and operational skills. This paper designs a deep ensemble learning-based sensor for flotation froth image recognition to monitor actual flotation froth working conditions, so as to assist operators in facilitating chemical dosage adjustments and achieve the industrial goals of promoting concentrate grade and mineral recovery. In our approach, training and validation data on flotation froth images are partitioned in K-fold cross validation, and deep neural network (DNN) based learners are generated through pre-trained DNN models in image-enhanced training data, in order to improve their generalization and robustness. Then, a membership function utilizing the performance information of the DNN-based learners during the validation is proposed to improve the recognition accuracy of the DNN-based learners. Subsequently, a technique for order preference by similarity to an ideal solution (TOPSIS) based on the F1 score is proposed to select the most probable working condition of flotation froth images through a decision matrix composed of the DNN-based learners\' predictions via a membership function, which is adopted to optimize the combination process of deep ensemble learning. The effectiveness and superiority of the designed sensor are verified in a real industrial gold-antimony froth flotation application.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在神经形态硬件上直接训练尖峰神经网络(SNN)具有显着降低人工神经网络训练能耗的潜力。使用Spike时序相关可塑性(STDP)训练的SNN受益于无梯度和无监督的局部学习,可以在超低功耗神经形态硬件上轻松实现。然而,分类任务不能单独使用无监督的STDP执行。在本文中,我们提出了稳定监督的STDP(S2-STDP),有监督的STDP学习规则,用于训练配备无监督STDP的SNN的分类层,以进行特征提取。S2-STDP集成误差调制权重更新,其将神经元尖峰与从层内的平均激发时间导出的期望时间戳对齐。然后,我们引入了一种称为配对竞争神经元(PCN)的训练架构,以进一步增强使用S2-STDP训练的分类层的学习能力。PCN将每个类与配对的神经元相关联,并通过类内竞争鼓励神经元对目标或非目标样本的专业化。我们在图像识别数据集上评估我们的方法,包括MNIST,Fashion-MNIST,和CIFAR-10.结果表明,我们的方法优于最先进的监督STDP学习规则,用于类似的结构和神经元数量。进一步的分析表明,PCN的使用增强了S2-STDP的性能,不考虑超参数集,也不引入任何额外的超参数。
    Direct training of Spiking Neural Networks (SNNs) on neuromorphic hardware has the potential to significantly reduce the energy consumption of artificial neural network training. SNNs trained with Spike Timing-Dependent Plasticity (STDP) benefit from gradient-free and unsupervised local learning, which can be easily implemented on ultra-low-power neuromorphic hardware. However, classification tasks cannot be performed solely with unsupervised STDP. In this paper, we propose Stabilized Supervised STDP (S2-STDP), a supervised STDP learning rule to train the classification layer of an SNN equipped with unsupervised STDP for feature extraction. S2-STDP integrates error-modulated weight updates that align neuron spikes with desired timestamps derived from the average firing time within the layer. Then, we introduce a training architecture called Paired Competing Neurons (PCN) to further enhance the learning capabilities of our classification layer trained with S2-STDP. PCN associates each class with paired neurons and encourages neuron specialization toward target or non-target samples through intra-class competition. We evaluate our methods on image recognition datasets, including MNIST, Fashion-MNIST, and CIFAR-10. Results show that our methods outperform state-of-the-art supervised STDP learning rules, for comparable architectures and numbers of neurons. Further analysis demonstrates that the use of PCN enhances the performance of S2-STDP, regardless of the hyperparameter set and without introducing any additional hyperparameters.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    传统的手工血涂片诊断方法耗时长,容易出错,通常在很大程度上依赖于临床实验室分析师的经验来保证准确性。随着神经网络和深度学习等关键技术的突破不断推动医疗领域的数字化转型,图像识别技术正越来越多地被利用来增强现有的医疗流程。近年来,计算机技术的进步通过使用图像识别技术提高了血液涂片中血细胞识别的效率。本文全面总结了利用图像识别算法诊断血涂片疾病的方法和步骤,重点是疟疾和白血病。此外,它为开发全面的血细胞病理检测系统提供了前瞻性的研究方向。
    Traditional manual blood smear diagnosis methods are time-consuming and prone to errors, often relying heavily on the experience of clinical laboratory analysts for accuracy. As breakthroughs in key technologies such as neural networks and deep learning continue to drive digital transformation in the medical field, image recognition technology is increasingly being leveraged to enhance existing medical processes. In recent years, advancements in computer technology have led to improved efficiency in the identification of blood cells in blood smears through the use of image recognition technology. This paper provides a comprehensive summary of the methods and steps involved in utilizing image recognition algorithms for diagnosing diseases in blood smears, with a focus on malaria and leukemia. Furthermore, it offers a forward-looking research direction for the development of a comprehensive blood cell pathological detection system.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    流量数据的水文监测对于防洪和现代河流管理具有重要意义。然而,传统的接触方法越来越难以满足简单的要求,准确度,和连续性。基于视频的河流流量测量是一种通过使用图像识别算法来监视不接触水体的流速的技术。与传统接触技术相比,具有全覆盖、全自动化的优势。为了及时总结现有结果,并为进一步的研究和应用提供信息,本文回顾并综合了有关基于视频的河流流量测量技术的一般实现路线以及当今流行的速度检测图像识别算法的原理和进展的文献。然后,它讨论了图像识别算法在图像采集条件方面的挑战,参数不确定性,以及复杂的气象和水环境。结论是,可以通过增强基于视频的放电测量算法的鲁棒性和准确性来提高该技术的性能,尽量减少天气影响,提高计算效率。最后,概述了进一步完善该技术的未来发展方向。
    The hydrological monitoring of flow data is important for flood prevention and modern river management. However, traditional contact methods are increasingly struggling to meet the requirements of simplicity, accuracy, and continuity. The video-based river discharge measurement is a technique to monitor flow velocity without contacting the water body by using the image-recognition algorithms, which has been verified to have the advantages of full coverage and full automation compared with the traditional contact technique. In order to provide a timely summary of the available results and to inform further research and applications, this paper reviews and synthesizes the literature on the general implementation routes of the video-based river discharge measurement technique and the principles and advances of today\'s popular image-recognition algorithms for velocity detection. Then, it discusses the challenges of image-recognition algorithms in terms of image acquisition conditions, parameter uncertainties, and complex meteorological and water environments. It is concluded that the performance of this technique can be improved by enhancing the robustness and accuracy of video-based discharge measurement algorithms, minimizing weather effects, and improving computational efficiency. Finally, future development directions for further perfecting this technique are outlined.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    化学分子结构是表达化学知识的直接和方便的手段,在学术交流中起着至关重要的作用。在化学方面,手绘是学生和研究人员的共同任务。如果我们能把手绘的化学分子结构转换成机器可读的格式,像SMILES编码,计算机可以有效地处理和分析这些结构,显著提高了化学研究的效率。此外,随着教育技术的进步,自动评分越来越受欢迎。当机器自动识别化学分子结构并评估图纸的正确性时,它为教师提供了极大的便利。我们创建了ChemReco,一种用于识别涉及三个原子的化学分子结构的工具:C,H,O,为化学研究人员提供方便。目前,对手绘化学分子结构的研究有限。因此,本文的主要重点是构建数据集。我们提出了一种合成图像方法来快速生成类似于手绘化学分子结构的图像,提高数据采集效率。关于模型选择,本文开发的手绘化学分子结构识别模型最终识别准确率达96.90%。该模型采用了EfficientNetTransformer的编码器-解码器架构,与其他编码器-解码器组合相比,具有卓越的性能。
    Chemical molecular structures are a direct and convenient means of expressing chemical knowledge, playing a vital role in academic communication. In chemistry, hand drawing is a common task for students and researchers. If we can convert hand-drawn chemical molecular structures into machine-readable formats, like SMILES encoding, computers can efficiently process and analyze these structures, significantly enhancing the efficiency of chemical research. Furthermore, with the progress of educational technology, automated grading is gaining popularity. When machines automatically recognize chemical molecular structures and assess the correctness of the drawings, it offers great convenience to teachers. We created ChemReco, a tool designed to identify chemical molecular structures involving three atoms: C, H, and O, providing convenience for chemical researchers. Currently, there are limited studies on hand-drawn chemical molecular structures. Therefore, the primary focus of this paper is constructing datasets. We propose a synthetic image method to rapidly generate images resembling hand-drawn chemical molecular structures, enhancing dataset acquisition efficiency. Regarding model selection, the hand-drawn chemical molecule structural recognition model developed in this article achieves a final recognition accuracy of 96.90%. This model employs the encoder-decoder architecture of EfficientNet + Transformer, demonstrating superior performance compared to other encoder-decoder combinations.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    AI技术的快速发展引起了人们对其在各个领域的应用的极大兴趣,包括医学和牙科。这项研究旨在评估具有图像识别功能的ChatGPT-4V在回答日本国家牙科考试(JNDE)中基于图像的问题时的能力,以探索其作为牙科学生教育支持工具的潜力。
    数据集使用了JNDE的问题,该项目于2023年1月进行,重点是与图像相关的查询。使用了ChatGPT-4V,和标准化的提示,问题文本,和图像被输入。使用QlikSense®和GraphPadPrism进行数据和统计分析。
    ChatGPT-4V对基于图像的JNDE问题的总体正确响应率为35.0%。强制性问题的正确回答率为57.1%,一般问题占43.6%,临床实际问题占28.6%。在牙科麻醉学和牙髓学等专业中,ChatGPT-4V的正确反应率超过70%,而正畸和口腔手术的反应率较低。问题中更多的图像与更低的准确性相关,表明图像数量对正确和不正确响应的影响。
    虽然创新,ChatGPT-4V的图像识别功能表现出局限性,特别是在处理图像密集和复杂的临床实践问题时,目前尚不完全适合作为牙科学生的教育支持工具。建议使用更广泛的数据集进行进一步的技术改进和重新评估。
    UNASSIGNED: Rapid advancements in AI technology have led to significant interest in its application across various fields, including medicine and dentistry. This study aimed to assess the capabilities of ChatGPT-4V with image recognition in answering image-based questions from the Japanese National Dental Examination (JNDE) to explore its potential as an educational support tool for dental students.
    UNASSIGNED: The dataset used questions from the JNDE, which was conducted in January 2023, with a focus on image-related queries. ChatGPT-4V was utilized, and standardized prompts, question texts, and images were input. Data and statistical analyses were conducted using Qlik Sense® and GraphPad Prism.
    UNASSIGNED: The overall correct response rate of ChatGPT-4V for image-based JNDE questions was 35.0 %. The correct response rates were 57.1 % for compulsory questions, 43.6 % for general questions, and 28.6 % for clinical practical questions. In specialties like Dental Anesthesiology and Endodontics, ChatGPT-4V achieved correct response rates above 70 %, while response rates for Orthodontics and Oral Surgery were lower. A higher number of images in questions was correlated with lower accuracy, suggesting an impact of the number of images on correct and incorrect responses.
    UNASSIGNED: While innovative, ChatGPT-4V\'s image recognition feature exhibited limitations, especially in handling image-intensive and complex clinical practical questions, and is not yet fully suitable as an educational support tool for dental students at its current stage. Further technological refinement and re-evaluation with a broader dataset are recommended.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:为了生产优质的茉莉花茶,在收获过程中,选择处于最佳生长阶段的茉莉花至关重要。然而,由于环境和人工因素,实现这一目标仍然是一个挑战。本研究通过使用YOLOv7算法根据视觉属性对不同的茉莉花进行分类来解决这个问题,卷积神经网络中最先进的算法之一。
    结果:使用该模型检测茉莉花的平均精度(mAP值)为0.948,对于茉莉花的五种不同开放程度的精度,即小芽,芽,半开,全开而狂野,是87.7%,90.3%,89%,93.9%和86.4%,分别。同时,处理数据集中图像的其他方法,比如模糊和改变亮度,也提高了算法的可信度。
    结论:本研究表明,使用深度学习算法区分不同生长阶段的茉莉花是可行的。该研究可为茉莉花产量估算和开发智能精准采花应用以减少花卉浪费和生产成本提供参考。©2024化学工业学会。
    BACKGROUND: To produce jasmine tea of excellent quality, it is crucial to select jasmine flowers at their optimal growth stage during harvesting. However, achieving this goal remains a challenge due to environmental and manual factors. This study addresses this issue by classifying different jasmine flowers based on visual attributes using the YOLOv7 algorithm, one of the most advanced algorithms in convolutional neural networks.
    RESULTS: The mean average precision (mAP value) for detecting jasmine flowers using this model is 0.948, and the accuracy for five different degrees of openness of jasmine flowers, namely small buds, buds, half-open, full-open and wiltered, is 87.7%, 90.3%, 89%, 93.9% and 86.4%, respectively. Meanwhile, other ways of processing the images in the dataset, such as blurring and changing the brightness, also increased the credibility of the algorithm.
    CONCLUSIONS: This study shows that it is feasible to use deep learning algorithms for distinguishing jasmine flowers at different growth stages. This study can provide a reference for jasmine production estimation and for the development of intelligent and precise flower-picking applications to reduce flower waste and production costs. © 2024 Society of Chemical Industry.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    物联网的出现预计将为所谓的智能边缘设备创造一个广阔的市场,在无数领域打开无数机会,包括个性化医疗保健和先进的机器人技术。利用3D集成,边缘设备可以实现前所未有的小型化,同时提高处理能力和减少能耗。这里,我们展示了一个后端兼容的光电突触与转移学习方法在医疗保健应用,包括基于脑电图(EEG)的癫痫发作预测,基于肌电图(EMG)的手势识别,和基于心电图(ECG)的心律失常检测。通过对三个生物医学数据集的实验,我们观察到预训练模型在脑电图上的分类精度提高了2.93%,心电图4.90%,肌电图为7.92%,分别。该设备的光学编程特性可实现超低功耗(2.8×10-13J)微调过程,并为边缘计算场景中的患者特定问题提供解决方案。此外,该设备具有令人印象深刻的光敏特性,可以实现一系列光触发的突触功能,使其有希望用于神经形态视觉应用。为了展示这些复杂的突触特性的好处,开发了5×5的光电突触阵列,有效地模拟人类的视觉感知和记忆功能。所提出的柔性光电突触在推进可穿戴应用中的神经形态生理信号处理和人工视觉系统领域方面具有巨大潜力。
    The emergence of the Internet-of-Things is anticipated to create a vast market for what are known as smart edge devices, opening numerous opportunities across countless domains, including personalized healthcare and advanced robotics. Leveraging 3D integration, edge devices can achieve unprecedented miniaturization while simultaneously boosting processing power and minimizing energy consumption. Here, we demonstrate a back-end-of-line compatible optoelectronic synapse with a transfer learning method on health care applications, including electroencephalogram (EEG)-based seizure prediction, electromyography (EMG)-based gesture recognition, and electrocardiogram (ECG)-based arrhythmia detection. With experiments on three biomedical datasets, we observe the classification accuracy improvement for the pretrained model with 2.93% on EEG, 4.90% on ECG, and 7.92% on EMG, respectively. The optical programming property of the device enables an ultra-low power (2.8 × 10-13 J) fine-tuning process and offers solutions for patient-specific issues in edge computing scenarios. Moreover, the device exhibits impressive light-sensitive characteristics that enable a range of light-triggered synaptic functions, making it promising for neuromorphic vision application. To display the benefits of these intricate synaptic properties, a 5 × 5 optoelectronic synapse array is developed, effectively simulating human visual perception and memory functions. The proposed flexible optoelectronic synapse holds immense potential for advancing the fields of neuromorphic physiological signal processing and artificial visual systems in wearable applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号