机器视觉 machine vision-医云文献数字医云科研云海量医学决策数据服务

machine vision 关注

机器视觉

文献(167篇)

百科

视频

1 Automatic recognition system for concrete cracks with support vector machine based on crack features.

基于裂缝特征的支持向量机混凝土裂缝自动识别系统 [J]. 影响指数 : 4.996
发表时间：Aug 2024 29
来源期刊：Sci Rep PMID：39209999

DOI：10.1038/s41598-024-71075-1
文章类型： Journal Article

裂缝是混凝土表面的常见问题。随着基于机器视觉的检测系统的不断优化，有效的裂纹检测与识别是整个系统的核心。在这项研究中,支持向量机(SVM)用于区分裂缝与其他区域。为了完成SVM的识别系统,提出了一个由图像处理和识别模型组成的框架。提出了一种将Prewitt算子与Otsu阈值相结合的图像分割算法。新算法结合数学形态学处理的二值图像可以得到更完整的裂纹区域和更少的干涉区域。初始参数提取后，大多数杂质区域都是通过初步区分来筛选的，去除99%的杂质。该处理步骤确保了样品的平衡和有效性。建立基于径向基函数支持向量机的自动识别模型,紧密度,入住率，在将这三个特征与裂缝的所有六个特征进行比较后，选择了长宽比作为输入参数。该系统的识别准确率达到97.14%,证明了所提出的方法是有效的，满足了实际需求。
Cracks are a common problem in concrete surfaces. With the continuous optimization of machine vision-based inspection systems, effective crack detection and recognition is the core of the entire system. In this study, support vector machine (SVM) was used to distinguish cracks from other regions. To complete the recognition system of the SVM, a framework consisting of an image processing and recognition model was proposed. An algorithm combining the Prewitt operator with the Otsu threshold was proposed for image segmentation. The binary image processed by the new algorithm combined with mathematical morphology can result in a more complete crack zone and fewer interference regions. After the initial parameter extraction, most of the impurity areas were screened by preliminary discrimination, removing 99% of the impurities. This processing step ensured the balance and effectiveness of the samples. To establish an automatic identification model based on SVM with a radial basis function, compactness, occupancy rate, and length-width ratio were selected as input parameters after comparing these three features with all six features of the crack. The recognition accuracy of this system reaches 97.14%, demonstrating that the proposed method is effective and satisfies practical requirements.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
2 Study on Calcination Characteristics of Diaspore-Kaolin Bauxite Based on Machine Vision.

基于机器视觉的洋水高岭土铝土矿煅烧特性研究 [J]. 影响指数 : 4.927
发表时间：Aug 2024 11
来源期刊：Molecules PMID：39202891

DOI：10.3390/molecules29163813
文章类型： Journal Article

贵州的D-K型铝土矿可用作未燃烧的陶瓷，吸附剂，和低温煅烧后的地质聚合物。旨在解决D-K型铝土矿在不同温度下煅烧后颜色发生变化的问题。采用数字图像处理技术提取铝土矿在不同温度下煅烧10min后图像的颜色特征。然后，我们分析了铝土矿煅烧前后的化学组成和微观形貌的变化，并研究了铝土矿煅烧后图像的颜色特征与组成变化之间的相关性。试验结果表明,铝土矿在500℃~1000℃煅烧10min后,脱水脱碳反应较为明显。主成分从硬铝石逐渐变为Al2O3，图像的色度值从0.0980下降到0.0515，饱和度值从0.0161上升到0.2433，亮度值从0.5890上升到0.7177。研究表明，铝土矿颜色特征的变化与组成变化密切相关。这对于从工程角度指导基于数字图像处理的铝土矿煅烧非常重要。
D-K-type bauxite from Guizhou can be used as an unburned ceramic, adsorbent, and geopolymer after low-temperature calcination. It aims to solve the problem where the color of the D-K-type bauxite changes after calcination at different temperatures. Digital image processing technology was used to extract the color characteristics of bauxite images after 10 min of calcination at various temperatures. Then, we analyzed changes in the chemical composition and micromorphology of bauxite before and after calcination and investigated the correlation between the color characteristics of images and composition changes after bauxite calcination. The test results indicated that after calcining bauxite at 500 °C to 1000 °C for 10 min, more obvious dehydration and decarburization reactions occurred. The main component gradually changed from diaspore to Al2O3, the chromaticity value of the image decreased from 0.0980 to 0.0515, the saturation value increased from 0.0161 to 0.2433, and the brightness value increased from 0.5890 to 0.7177. Studies have shown that changes in bauxite color characteristics are strongly correlated with changes in composition. This is important for directing bauxite calcination based on digital image processing from engineering viewpoints.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
3 A lightweight dual-attention network for tomato leaf disease identification.

番茄叶部病害识别的轻量级双注意网络影响指数 : 6.627
发表时间：2024
来源期刊：Front Plant Sci PMID：39166234

DOI：10.3389/fpls.2024.1420584
文章类型： Journal Article

番茄病害图像识别在农业生产中起着至关重要的作用。今天,基于深度学习的机器视觉方法在疾病识别方面取得了一定的成功，他们仍然面临着一些挑战。这些问题包括不平衡的数据集，疾病特征不明确,阶级间的小差异，和大的类内变化。为了应对这些挑战，提出了一种基于机器视觉的番茄叶部病害分类识别方法。首先,为了增强图像中的疾病特征细节，分段线性变换方法用于图像增强，并且使用过采样来扩展数据集，补偿不平衡的数据集。接下来,本文介绍了一种具有双重注意机制的卷积块，称为DAC块，用于构造名为LDAMNet的轻量级模型。DAC模块创新性地使用混合通道注意力（HCA）和协调注意力（CSA）分别处理输入图像的通道信息和空间信息，增强模型的特征提取能力。此外,本文提出了一种对噪声标签鲁棒的鲁棒交叉熵(RCE)损失函数，旨在减少训练过程中噪声标签对LDAMNet模型的影响。实验结果表明,该方法在番茄病害数据集上的平均识别准确率为98.71%,有效地将疾病信息保留在图像中并捕获疾病区域。此外，该方法还对水稻作物病害数据集具有很强的识别能力，表明良好的泛化性能和在不同作物的病害识别中有效发挥作用的能力。本文的研究成果为作物病害识别领域提供了新的思路和方法。然而,未来的研究需要进一步优化模型的结构和计算效率，并在更实际的场景中验证其应用效果。
Tomato disease image recognition plays a crucial role in agricultural production. Today, while machine vision methods based on deep learning have achieved some success in disease recognition, they still face several challenges. These include issues such as imbalanced datasets, unclear disease features, small inter-class differences, and large intra-class variations. To address these challenges, this paper proposes a method for classifying and recognizing tomato leaf diseases based on machine vision. First, to enhance the disease feature details in images, a piecewise linear transformation method is used for image enhancement, and oversampling is employed to expand the dataset, compensating for the imbalanced dataset. Next, this paper introduces a convolutional block with a dual attention mechanism called DAC Block, which is used to construct a lightweight model named LDAMNet. The DAC Block innovatively uses Hybrid Channel Attention (HCA) and Coordinate Attention (CSA) to process channel information and spatial information of input images respectively, enhancing the model\'s feature extraction capabilities. Additionally, this paper proposes a Robust Cross-Entropy (RCE) loss function that is robust to noisy labels, aimed at reducing the impact of noisy labels on the LDAMNet model during training. Experimental results show that this method achieves an average recognition accuracy of 98.71% on the tomato disease dataset, effectively retaining disease information in images and capturing disease areas. Furthermore, the method also demonstrates strong recognition capabilities on rice crop disease datasets, indicating good generalization performance and the ability to function effectively in disease recognition across different crops. The research findings of this paper provide new ideas and methods for the field of crop disease recognition. However, future research needs to further optimize the model\'s structure and computational efficiency, and validate its application effects in more practical scenarios.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
4 Rapid and non-destructive identification of Panax ginseng origins using hyperspectral imaging, visible light imaging, and X-ray imaging combined with multi-source data fusion strategies.

利用高光谱成像技术快速无损鉴定人参产地,可见光成像,X射线成像与多源数据融合相结合的策略。影响指数 : 7.425
发表时间：Sep 2024
来源期刊：Food Res Int PMID：39147491

DOI：10.1016/j.foodres.2024.114758
文章类型： Journal Article

人参的地理来源显著影响其营养价值和化学成分,进而影响其市场价格。分析这些差异的传统方法通常很耗时，并且需要大量的试剂，使他们效率低下。因此,高光谱成像（HSI）结合X射线技术用于人参产地的快速和无损可追溯性。最初，通过采用组合的孤立森林算法和密度峰值聚类（DPC）算法，可以有效地拒绝离群样本。随后，利用高光谱数据构建随机森林(RF)和支持向量机(SVM)分类模型。通过应用72种预处理方法及其组合，进一步优化了这些模型。此外,为了提高模型的性能，采用了四种变量筛选算法：SelectKBest，遗传算法(GA)，最小绝对收缩和选择运算符(LASSO)，和排列特征重要性(PFI)。优化后的模型,利用二阶导数，自动缩放，排列特征重要性，和支持向量机(2ndDer-AS-PFI-SVM)，实现了93.4%的预测精度，在独立预测集上，Kappa值为0.876，Brier评分为0.030，F1评分为0.932，AUC为0.994。此外，从彩色和X射线图像中提取的图像数据（包括颜色信息和纹理信息）用于构建分类模型并评估其性能。其中,利用X射线图像的纹理信息构建的SVM模型表现最好,在验证集上实现了63.0%的预测精度，Brier评分为0.181，F1评分为0.518，AUC为0.553。通过基于Stacking策略实现中层融合和高层数据融合,发现采用高光谱光谱信息和X射线图像纹理信息的高级融合的模型明显优于仅使用高光谱光谱信息的模型。这种先进的模型达到了95.2%的预测精度，在独立预测集上，Kappa值为0.912，Brier评分为0.027，F1评分为0.952，AUC为0.997。总之,本研究不仅为人参产地的快速、无损溯源提供了一条新的技术路径,同时也展示了HSI和X射线技术在医药和食品可追溯性领域的联合应用的巨大潜力。
The geographical origin of Panax ginseng significantly influences its nutritional value and chemical composition, which in turn affects its market price. Traditional methods for analyzing these differences are often time-consuming and require substantial quantities of reagents, rendering them inefficient. Therefore, hyperspectral imaging (HSI) in conjunction with X-ray technology were used for the swift and non-destructive traceability of Panax ginseng origin. Initially, outlier samples were effectively rejected by employing a combined isolated forest algorithm and density peak clustering (DPC) algorithm. Subsequently, random forest (RF) and support vector machine (SVM) classification models were constructed using hyperspectral spectral data. These models were further optimized through the application of 72 preprocessing methods and their combinations. Additionally, to enhance the model\'s performance, four variable screening algorithms were employed: SelectKBest, genetic algorithm (GA), least absolute shrinkage and selection operator (LASSO), and permutation feature importance (PFI). The optimized model, utilizing second derivative, auto scaling, permutation feature importance, and support vector machine (2nd Der-AS-PFI-SVM), achieved a prediction accuracy of 93.4 %, a Kappa value of 0.876, a Brier score of 0.030, an F1 score of 0.932, and an AUC of 0.994 on an independent prediction set. Moreover, the image data (including color information and texture information) extracted from color and X-ray images were used to construct classification models and evaluate their performance. Among them, the SVM model constructed using texture information from X -ray images performed the best, and it achieved a prediction accuracy of 63.0 % on the validation set, with a Brier score of 0.181, an F1 score of 0.518, and an AUC of 0.553. By implementing mid-level fusion and high-level data fusion based on the Stacking strategy, it was found that the model employing a high-level fusion of hyperspectral spectral information and X-ray images texture information significantly outperformed the model using only hyperspectral spectral information. This advanced model attained a prediction accuracy of 95.2 %, a Kappa value of 0.912, a Brier score of 0.027, an F1 score of 0.952, and an AUC of 0.997 on the independent prediction set. In summary, this study not only provides a novel technical path for fast and non-destructive traceability of Panax ginseng origin, but also demonstrates the great potential of the combined application of HSI and X-ray technology in the field of traceability of both medicinal and food products.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
5 A machine vision tool for multi-color H2O2 sensing by MoOx nanoparticles with oxygen vacancies.

具有氧空位的 MoOx 纳米粒子用于多色 H2O2 传感的机器视觉工具。影响指数 : 4.831
发表时间：Aug 2024 8
来源期刊：Spectrochim Acta A Mol Biomol Spectrosc PMID：39128385

DOI：10.1016/j.saa.2024.124953
文章类型： Journal Article

提高过氧化氢(H2O2)检测在日常生产生活中的易操作性和便携性,具有重要的应用价值。然而,实现H2O2的快速比色检测和颜色变化定量仍然是一个挑战。在这项研究中,利用机器视觉实现了具有富氧空位的MoOx(2≤x≤3)纳米粒子对H2O2的快速、直观检测。随着H2O2浓度的增加，检测系统表现出从蓝色到绿色再到黄色的可见多色变化，紫外-可见分光光度计在680nm附近的吸收峰逐渐减小。具有出色的灵敏度，线性范围为0.1-600μmol/L，浓度低至0.1μmol/L可以检测到对H2O2具有良好的选择性。通过XPS等表征方法揭示了通过MoOx中氧空位的变化检测H2O2的传感机理,EPR,和DFT。此外,色调,饱和度,构建了基于MoOx的数值(HSV)可视化分析系统,便携式，以及在实际应用场景中对H2O2的敏感监测。这项工作提供了一个易于操作，低成本,便于实现H2O2的快速比色测定,在日常生活和工业生产中具有广阔的应用前景。
Improving the ease of operation and portability of hydrogen peroxide (H2O2) detection in daily production and life holds significant application value. However, it remains a challenge to achieve rapid colorimetric detection of H2O2 and color change quantification. In this study, we achieved rapid and visual detection of H2O2 by MoOx (2 ≤ x ≤ 3) nanoparticles with rich oxygen vacancies using machine vision. As the concentration of H2O2 increases, the detection system exhibited a visible multi-color change from blue to green and then yellow and the absorption peak near 680 nm measured by the UV-visible spectrophotometer gradually decreased. With excellent sensitivity, a wide linear range of 0.1-600 μmol/L, concentrations as low as 0.1 μmol/L can be detected with good selectivity towards H2O2. The sensing mechanism of detecting H2O2 by the change of oxygen vacancies in MoOx was revealed through characterization methods such as XPS, EPR, and DFT. In addition, the Hue, Saturation, Value (HSV) visual analysis system based on MoOx was constructed to assist in the rapid, portable, and sensitive monitoring of H2O2 in practical application scenarios. This work offers an easy-to operate, low cost, and convenience for achieving rapid colorimetric determination of H2O2 and has broad application prospects in daily life and industrial production.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

求助全文
6 GC Snakes: An Efficient and Robust Segmentation Model for Hot Forging Images.

GC Snakes ：一种高效鲁棒的热锻造图像分割模型。影响指数 : 3.847
发表时间：Jul 2024 25
来源期刊：Sensors (Basel) PMID：39123869

DOI：10.3390/s24154821
文章类型： Journal Article

机器视觉是热锻件理想的非接触测量方法，由于热锻件工作条件的多样性，图像分割在性能和鲁棒性方面一直是一个具有挑战性的问题。因此，本文针对锻造图像提出了一种高效、鲁棒的活动轮廓模型和相应的图像分割方法,通过测量锻件的几何参数，进行了验证实验，以证明分割方法的性能。具体来说,基于用于锻造图像的等效灰度表面的几何连续性，定义了三种类型的连续性参数；因此，提出了一种新的图像力和外部能量函数来形成新的主动轮廓模型，几何连续性蛇（GC蛇），这更适合于锻造图像的灰度分布特征，以鲁棒地提高主动轮廓的收敛性；此外，提出了一种GCSnakes初始控制点的生成策略，以构成一种高效，鲁棒的图像分割方法。实验结果表明，对于不同温度和大小的锻造图像，与现有的活动轮廓模型相比，提出的GCSnakes具有更好的分割性能，为热锻件的几何参数测量提供了更好的性能和效率。GCSnakes的最大定位和尺寸误差为0.5525mm和0.3868mm，分别,与Snakes模型的0.7873mm和0.6868mm的误差相比。
Machine vision is a desirable non-contact measurement method for hot forgings, as image segmentation has been a challenging issue in performance and robustness resulting from the diversity of working conditions for hot forgings. Thus, this paper proposes an efficient and robust active contour model and corresponding image segmentation approach for forging images, by which verification experiments are conducted to prove the performance of the segmentation method by measuring geometric parameters for forging parts. Specifically, three types of continuity parameters are defined based on the geometric continuity of equivalent grayscale surfaces for forging images; hence, a new image force and external energy functional are proposed to form a new active contour model, Geometric Continuity Snakes (GC Snakes), which is more percipient to the grayscale distribution characteristics of forging images to improve the convergence for active contour robustly; additionally, a generating strategy for initial control points for GC Snakes is proposed to compose an efficient and robust image segmentation approach. The experimental results show that the proposed GC Snakes has better segmentation performance compared with existing active contour models for forging images of different temperatures and sizes, which provides better performance and efficiency in geometric parameter measurement for hot forgings. The maximum positioning and dimension errors by GC Snakes are 0.5525 mm and 0.3868 mm, respectively, compared with errors of 0.7873 mm and 0.6868 mm by the Snakes model.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
7 Machine vision-based gait scan method for identifying cognitive impairment in older adults.

基于机器视觉的步态扫描方法识别老年人认知障碍。影响指数 : 5.702
发表时间：2024
来源期刊：Front Aging Neurosci PMID：39081395

DOI：10.3389/fnagi.2024.1341227
文章类型： Journal Article

■早期识别老年人的认知障碍可以减轻与年龄相关的残疾的负担。步态参数与认知衰退相关并可预测认知衰退。尽管在认知研究中已经使用了多种传感器和机器学习分析方法，需要一种深度优化的基于机器视觉的步态分析方法来识别认知衰退.
■这项研究使用了158名名为华西医院老年人步态的成年人的步行录像数据集，在简短的便携式精神状态问卷上被标记为表现。我们提出了一种新颖的识别网络，深度优化GaitPart(DO-GaitPart),基于轮廓和骨骼步态图像。应用了三个改进：在模板生成阶段使用短期时间模板生成器（STTG），以降低计算成本并最大程度地减少时间信息的损失；深度空间特征提取器（DSFE）从步态图像中提取全局和局部细粒度空间特征；以及多尺度时间聚合（MTA），一种基于注意力机制的时间建模方法，以提高步态模式的可分辨性。
■消融测试表明，DO-GaitPart的每个组件都是必不可少的。DO-GaitPart在CASIA-B数据集上的背包行走场景中表现出色，优于比较方法，它们是GaitSet，GaitPart,MT3D,3D本地,TransGait,CSTL,GLN,Gait3D数据集上的GaitGL和SMPLGait。提出的机器视觉步态特征识别方法在认知状态分类任务上实现了0.876(0.852-0.900)的接收机工作特征/曲线下面积(ROCAUC)。
■所提出的方法从步态视频数据集中很好地识别了认知衰退，使其成为认知评估中的前瞻性原型工具。
UNASSIGNED: Early identification of cognitive impairment in older adults could reduce the burden of age-related disabilities. Gait parameters are associated with and predictive of cognitive decline. Although a variety of sensors and machine learning analysis methods have been used in cognitive studies, a deep optimized machine vision-based method for analyzing gait to identify cognitive decline is needed.
UNASSIGNED: This study used a walking footage dataset of 158 adults named West China Hospital Elderly Gait, which was labelled by performance on the Short Portable Mental Status Questionnaire. We proposed a novel recognition network, Deep Optimized GaitPart (DO-GaitPart), based on silhouette and skeleton gait images. Three improvements were applied: short-term temporal template generator (STTG) in the template generation stage to decrease computational cost and minimize loss of temporal information; depth-wise spatial feature extractor (DSFE) to extract both global and local fine-grained spatial features from gait images; and multi-scale temporal aggregation (MTA), a temporal modeling method based on attention mechanism, to improve the distinguishability of gait patterns.
UNASSIGNED: An ablation test showed that each component of DO-GaitPart was essential. DO-GaitPart excels in backpack walking scene on CASIA-B dataset, outperforming comparison methods, which were GaitSet, GaitPart, MT3D, 3D Local, TransGait, CSTL, GLN, GaitGL and SMPLGait on Gait3D dataset. The proposed machine vision gait feature identification method achieved a receiver operating characteristic/area under the curve (ROCAUC) of 0.876 (0.852-0.900) on the cognitive state classification task.
UNASSIGNED: The proposed method performed well identifying cognitive decline from the gait video datasets, making it a prospective prototype tool in cognitive assessment.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
8 A method for cabbage root posture recognition based on YOLOv5s.

一种基于 YOLOv5s 的甘蓝根系姿态识别方法影响指数 : 3.776
发表时间：Jul 2024 15
来源期刊：Heliyon PMID：39071611

DOI：10.1016/j.heliyon.2024.e31868
文章类型： Journal Article

高效,无损收获白菜对于保持其风味和质量至关重要。目前白菜收获主要依靠机械化自动采摘方法。然而,大多数现有的卷心菜收获装置的一个显著缺陷是没有一个根部姿势识别系统来及时调整根部姿势，因此影响采收过程中切根的准确性。为了解决这个问题，本研究介绍了一种将深度学习与传统图像处理算法相结合的白菜根姿态识别方法。通过YOLOv5s深度学习模型对白菜的主根感兴趣区域(ROI)区域进行初步检测。随后，传统的图像处理方法，格雷厄姆算法，并采用计算最小外接矩形的方法来具体检测白菜根的倾角。这种方法有效地解决了计算由外叶遮挡引起的根倾角的困难。结果表明，该方法的准确率和召回率分别为98.7%和98.6%。分别,姿态的平均绝对误差为0.80°，平均相对误差为1.34%。将该方法作为机械收获的参考，可以有效减轻白菜的伤害率。
Efficient, non-destructive cabbage harvesting is crucial for preserving its flavor and quality. Current cabbage harvesting mainly relies on mechanized automatic picking methods. However, a notable deficiency in most existing cabbage harvesting devices is the absence of a root posture recognition system to promptly adjust the root posture, consequently impacting the accuracy of root cutting during harvesting. To address this issue, this study introduces a cabbage root posture recognition method that combines deep learning with traditional image processing algorithms. Preliminary detection of the main root Region of Interest (ROI) areas of the cabbage is carried out through the YOLOv5s deep learning model. Subsequently, traditional image processing methods, the Graham algorithm, and the method of calculating the minimum circumscribed rectangle are employed to specifically detect the inclination angle of cabbage roots. This approach effectively addresses the difficulty in calculating the inclination angle of roots caused by occlusion from outer leaves. The results demonstrate that the precision and recall of this method are 98.7 % and 98.6 %, respectively, with an average absolute error of 0.80° and an average relative error of 1.34 % in posture. Using this method as a reference for mechanical harvesting can effectively mitigate cabbage damage rates.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
9 A Lightweight and Efficient Multi-Type Defect Detection Method for Transmission Lines Based on DCP-YOLOv8.

基于 DCP - YOLOv8 的输电线路轻型高效多类型缺陷检测方法 [J]. 影响指数 : 3.847
发表时间：Jul 2024 11
来源期刊：Sensors (Basel) PMID：39065894

DOI：10.3390/s24144491
文章类型： Journal Article

目前,利用AI图像识别技术对海量电网输电线路巡检图片进行智能缺陷检测是一种高效、流行的方法。通常,缺陷检测算法模型的构建有两条技术路线:一是使用轻量级网络,这提高了效率，但它通常只能针对少数类型的缺陷，并可能降低检测精度；另一种是使用复杂的网络模型，这提高了准确性，并且可以同时识别多种类型的缺陷，但是它的计算量大，效率低。为了保持模型的高检测精度以及其轻量化结构,提出了一种轻量高效的基于DCP-YOLOv8的输电线路多类型缺陷检测方法。该方法采用可变形卷积(C2f_DCNv3)来增强缺陷特征提取能力,并设计了一种重新参数化的交叉相位特征融合结构(RCSP),将高层语义特征与低层空间特征进行优化和融合,从而提高模型识别不同尺度缺陷的能力，同时显著降低模型参数；它结合了动态检测头和可变形卷积v3的检测头（DCNv3-Dyhead），以增强特征表达能力和上下文信息的利用率，进一步提高检测精度。实验结果表明,在一个包含20条真实输电线路缺陷的数据集上,该方法将平均准确度(mAP@0.5)提高到72.2%，增长4.3%,与最轻的基线YOLOv8n模型相比；模型参数数量仅为2.8M，减少9.15%,每秒处理帧数(FPS)达到103,满足实时检测需求。在多类型缺陷检测的场景中，它有效地平衡了检测精度和性能与定量泛化。
Currently, the intelligent defect detection of massive grid transmission line inspection pictures using AI image recognition technology is an efficient and popular method. Usually, there are two technical routes for the construction of defect detection algorithm models: one is to use a lightweight network, which improves the efficiency, but it can generally only target a few types of defects and may reduce the detection accuracy; the other is to use a complex network model, which improves the accuracy, and can identify multiple types of defects at the same time, but it has a large computational volume and low efficiency. To maintain the model\'s high detection accuracy as well as its lightweight structure, this paper proposes a lightweight and efficient multi type defect detection method for transmission lines based on DCP-YOLOv8. The method employs deformable convolution (C2f_DCNv3) to enhance the defect feature extraction capability, and designs a re-parameterized cross phase feature fusion structure (RCSP) to optimize and fuse high-level semantic features with low level spatial features, thus improving the capability of the model to recognize defects at different scales while significantly reducing the model parameters; additionally, it combines the dynamic detection head and deformable convolutional v3\'s detection head (DCNv3-Dyhead) to enhance the feature expression capability and the utilization of contextual information to further improve the detection accuracy. Experimental results show that on a dataset containing 20 real transmission line defects, the method increases the average accuracy (mAP@0.5) to 72.2%, an increase of 4.3%, compared with the lightest baseline YOLOv8n model; the number of model parameters is only 2.8 M, a reduction of 9.15%, and the number of processed frames per second (FPS) reaches 103, which meets the real time detection demand. In the scenario of multi type defect detection, it effectively balances detection accuracy and performance with quantitative generalizability.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)
10 A Common Knowledge-Driven Generic Vision Inspection Framework for Adaptation to Multiple Scenarios, Tasks, and Objects.

一种通用的知识驱动的通用视觉检测框架，用于适应多种场景，任务,和对象。影响指数 : 3.847
发表时间：Jun 2024 25
来源期刊：Sensors (Basel) PMID：39000899

DOI：10.3390/s24134120
文章类型： Journal Article

工业制造模式正在经历从以产品为中心向以客户为中心的转变。由定制需求驱动，产品的复杂性和对质量的要求提高了，对传统机器视觉技术的适用性提出了挑战。广泛的研究证明了基于AI的学习和图像处理对特定对象或任务的有效性，但是很少有出版物关注集成产品的复合任务，方法的可追溯性和可改进性，以及不同场景或任务之间知识的提取和交流。为了解决这个问题,本文提出了一种常见的，知识驱动,通用视觉检查框架，目标是将产品检验标准化为信息解耦和自适应度量的过程。基于行业知识和结构化任务，将与任务相关的对象感知计划为多粒度和多模式的渐进对齐。在适当的高级策略和经验下，检查被抽象为多子模式空间组合映射和差异度量的可重构过程。最后,提出了基于历史数据的知识改进和积累策略。实验演示了为复杂产品生成检测管道并通过故障追踪和知识改进不断改进的过程。与（1.767°相比，69.802毫米）和0.883通过最先进的深度学习方法获得，生成的管道实现了从(2.771°，153.584mm)至(1.034°,52.308mm），检测率范围从0.462到0.927。通过验证其他成像方法和工业任务，我们证明了适应性的关键在于挖掘知识的内在共性，多维积累，和重新申请。
The industrial manufacturing model is undergoing a transformation from a product-centric model to a customer-centric one. Driven by customized requirements, the complexity of products and the requirements for quality have increased, which pose a challenge to the applicability of traditional machine vision technology. Extensive research demonstrates the effectiveness of AI-based learning and image processing on specific objects or tasks, but few publications focus on the composite task of the integrated product, the traceability and improvability of methods, as well as the extraction and communication of knowledge between different scenarios or tasks. To address this problem, this paper proposes a common, knowledge-driven, generic vision inspection framework, targeted for standardizing product inspection into a process of information decoupling and adaptive metrics. Task-related object perception is planned into a multi-granularity and multi-pattern progressive alignment based on industry knowledge and structured tasks. Inspection is abstracted as a reconfigurable process of multi-sub-pattern space combination mapping and difference metric under appropriate high-level strategies and experiences. Finally, strategies for knowledge improvement and accumulation based on historical data are presented. The experiment demonstrates the process of generating a detection pipeline for complex products and continuously improving it through failure tracing and knowledge improvement. Compared to the (1.767°, 69.802 mm) and 0.883 obtained by state-of-the-art deep learning methods, the generated pipeline achieves a pose estimation ranging from (2.771°, 153.584 mm) to (1.034°, 52.308 mm) and a detection rate ranging from 0.462 to 0.927. Through verification of other imaging methods and industrial tasks, we prove that the key to adaptability lies in the mining of inherent commonalities of knowledge, multi-dimensional accumulation, and reapplication.

导出

Endnote Noteexpress

更多引用

收藏

翻译标题摘要

我要上传

PDF(Pubmed)

machine vision 关注

1 Automatic recognition system for concrete cracks with support vector machine based on crack features.

2 Study on Calcination Characteristics of Diaspore-Kaolin Bauxite Based on Machine Vision.

3 A lightweight dual-attention network for tomato leaf disease identification.

4 Rapid and non-destructive identification of Panax ginseng origins using hyperspectral imaging, visible light imaging, and X-ray imaging combined with multi-source data fusion strategies.

5 A machine vision tool for multi-color H2O2 sensing by MoOx nanoparticles with oxygen vacancies.

6 GC Snakes: An Efficient and Robust Segmentation Model for Hot Forging Images.

7 Machine vision-based gait scan method for identifying cognitive impairment in older adults.

8 A method for cabbage root posture recognition based on YOLOv5s.

9 A Lightweight and Efficient Multi-Type Defect Detection Method for Transmission Lines Based on DCP-YOLOv8.

10 A Common Knowledge-Driven Generic Vision Inspection Framework for Adaptation to Multiple Scenarios, Tasks, and Objects.