Image Processing

图像处理
  • 文章类型: Journal Article
    公共安全是一个至关重要的问题,通常通过公共场所入口的安全检查来解决,让受过训练的人员或X射线扫描仪检测违禁物品。然而,很多地方,比如医院,学校,活动中心缺乏这样的资源,冒着安全漏洞的风险。即使使用X射线扫描仪或手动检查,漏洞可以被恶意意图的个人利用,带来重大安全隐患。此外,传统方法,依靠人工检查和常规图像处理技术,通常效率低下,容易出现高错误率。为了减轻这些风险,我们提出了一个实时检测模型-EnhanceNet,它使用集成到YOLOv4中的自定义扩展增强池网络(SEP-Net)。创新的SEP-Net增强了特征表示和定位精度,显著有助于模型在检测违禁物品方面的功效。我们注释了九个类的自定义数据集,并使用不同的输入大小(608和416)评估了我们的模型。608输入大小实现了74.10%的平均平均精度(mAP),检测速度为每秒22.3帧(FPS)。416输入大小显示出优越的性能,MAP为76.75%,检测速度为27.1FPS。这些证明了我们的模型是准确和有效的,使它们适合实时应用。
    Public safety is a critical concern, typically addressed through security checks at entrances of public places, involving trained officers or X-ray scanning machines to detect prohibited items. However, many places like hospitals, schools, and event centres lack such resources, risking security breaches. Even with X-ray scanners or manual checks, gaps can be exploited by individuals with malicious intent, posing significant security risks. Additionally, traditional methods, relying on manual inspections and conventional image processing techniques, are often inefficient and prone to high error rates. To mitigate these risks, we propose a real-time detection model - EnhanceNet using a customized Scale-Enhanced Pooling Network (SEP-Net) integrated into the YOLOv4. The innovative SEP-Net enhances feature representation and localization accuracy, significantly contributing to the model\'s efficacy in detecting prohibited items. We annotated a custom dataset of nine classes and evaluated our models using different input sizes (608 and 416). The 608 input size achieved a mean Average Precision (mAP) of 74.10% with a detection speed of 22.3 Frames per Second (FPS). The 416 input size showed superior performance, achieving a mAP of 76.75% and a detection speed of 27.1 FPS. These demonstrate that our models are accurate and efficient, making them suitable for real-time applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    肺部图像分割在计算机辅助肺部疾病诊断和治疗中起着重要作用。
    本文探讨了通过生成对抗网络进行肺部CT图像分割的方法。我们采用了各种生成对抗网络,并利用它们的图像平移功能来执行图像分割。采用生成对抗网络将原始肺部图像转换为分割图像。
    在真实的肺部图像数据集上测试了基于生成对抗网络的分割方法。实验结果表明,该方法优于最先进的方法。
    基于生成对抗网络的方法对肺部图像分割有效。
    UNASSIGNED: Lung image segmentation plays an important role in computer-aid pulmonary disease diagnosis and treatment.
    UNASSIGNED: This paper explores the lung CT image segmentation method by generative adversarial networks. We employ a variety of generative adversarial networks and used their capability of image translation to perform image segmentation. The generative adversarial network is employed to translate the original lung image into the segmented image.
    UNASSIGNED: The generative adversarial networks-based segmentation method is tested on real lung image data set. Experimental results show that the proposed method outperforms the state-of-the-art method.
    UNASSIGNED: The generative adversarial networks-based method is effective for lung image segmentation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为了解决果园苹果自动采收的问题,我们提出了一种用于识别苹果和计算苹果直径的YOLOv5-RACF算法。该算法采用机器人操作数字系统(ROS)来控制机器人的运动系统,激光雷达映射,导航,以及机械臂的姿势和抓取操作,实现自动化苹果收获和放置。测试是在实际的果园环境中进行的。该算法模型的平均苹果检测精度(mAP@0.5)为98.748%,a(mAP@0.5:0.95)为90.02%。计算一个苹果直径的时间为0.13s,测量精度在1-3毫米的误差范围内。机器人平均需要9秒来挑选一个苹果并返回到最初的姿势。这些结果证明了该系统在实际农业环境中的效率和可靠性。
    To address the issue of automated apple harvesting in orchards, we propose a YOLOv5-RACF algorithm for identifying apples and calculating apple diameters. This algorithm employs the robot operating dystem (ROS) to control the robot\'s locomotion system, Lidar mapping, and navigation, as well as the robotic arm\'s posture and grasping operations, achieving automated apple harvesting and placement. The tests were conducted in an actual orchard environment. The algorithm model achieved an average apple detection accuracy (mAP@0.5) of 98.748% and a (mAP@0.5:0.95) of 90.02%. The time to calculate the diameter of one apple was 0.13 s, with a measurement accuracy within an error range of 1-3 mm. The robot takes an average of 9 s to pick an apple and return to the initial pose. These results demonstrate the system\'s efficiency and reliability in real agricultural environments.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    为了加强摆动电弧窄间隙焊接中焊炬水平和垂直位置的同步检测,提出了一种火炬姿态检测(TPD)方法。这种方法利用被动视觉传感来捕获沟槽侧壁上的电弧图像,采用先进的图像处理方法提取和拟合圆弧轮廓。通过圆弧轮廓拟合线确定圆弧轮廓中心点和最高点的坐标。焊炬中心位置是根据相邻焊接图像中的电弧轮廓中心的平均水平坐标计算得出的,而高度位置是从弧最高点的垂直坐标确定的。在可变和恒定坡口焊接条件下的实验验证表明,TPD方法检测焊炬中心位置的精度在0.32mm以内。这种方法消除了构造导线中心线的需要,这是以前方法的要求,从而降低了导线直线度对检测精度的影响。提出的TPD方法成功地实现了火炬中心和高度位置的同时检测,为摆动电弧窄间隙焊接的智能检测和自适应控制奠定了基础。
    To enhance the synchronous detection of the horizontal and vertical positions of the torch in swing arc narrow gap welding, a torch pose detection (TPD) method is proposed. This approach utilizes passive visual sensing to capture images of the arc on the groove sidewall, using advanced image processing methods to extract and fit the arc contour. The coordinates of the arc contour center point and the highest point are determined through the arc contour fitting line. The torch center position is calculated from the average horizontal coordinates of the arc contour centers in adjacent welding images, while the height position is determined from the vertical coordinate of the arc\'s highest point. Experimental validation in both variable and constant groove welding conditions demonstrated the TPD method\'s accuracy within 0.32 mm for detecting the torch center position. This method eliminates the need to construct the wire centerline, which was a requirement in previous approaches, thereby reducing the impact of wire straightness on detection accuracy. The proposed TPD method successfully achieves simultaneous detection of the torch center and height positions, laying the foundation for intelligent detection and adaptive control in swing arc narrow gap welding.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    自闭症谱系障碍(ASD)是一种复杂的心理综合征,其特征是在社交互动中持续存在困难。限制行为,演讲,和非语言交流。这种疾病的影响和症状的严重程度因人而异。在大多数情况下,ASD的症状出现在2至5岁,并持续到整个青春期和成年期。虽然这种疾病不能完全治愈,研究表明,早期发现这种综合征可以帮助维持儿童的行为和心理发展。专家目前正在研究各种机器学习方法,特别是卷积神经网络,加快筛选过程。卷积神经网络被认为是诊断ASD的有前途的框架。本研究采用了不同的预训练卷积神经网络,如ResNet34、ResNet50、AlexNet、MobileNetV2、VGG16和VGG19用于诊断ASD并比较它们的性能。将迁移学习应用于研究中包含的每个模型,以获得比初始模型更高的结果。所提出的ResNet50模型实现了最高的精度,92%,与其他迁移学习模型相比。所提出的方法在准确性和计算成本方面也优于最先进的模型。
    Autism spectrum disorder (ASD) is a complex psychological syndrome characterized by persistent difficulties in social interaction, restricted behaviours, speech, and nonverbal communication. The impacts of this disorder and the severity of symptoms vary from person to person. In most cases, symptoms of ASD appear at the age of 2 to 5 and continue throughout adolescence and into adulthood. While this disorder cannot be cured completely, studies have shown that early detection of this syndrome can assist in maintaining the behavioural and psychological development of children. Experts are currently studying various machine learning methods, particularly convolutional neural networks, to expedite the screening process. Convolutional neural networks are considered promising frameworks for the diagnosis of ASD. This study employs different pre-trained convolutional neural networks such as ResNet34, ResNet50, AlexNet, MobileNetV2, VGG16, and VGG19 to diagnose ASD and compared their performance. Transfer learning was applied to every model included in the study to achieve higher results than the initial models. The proposed ResNet50 model achieved the highest accuracy, 92%, compared to other transfer learning models. The proposed method also outperformed the state-of-the-art models in terms of accuracy and computational cost.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在将倾斜的柔性针插入软组织期间的偏转建模对于在经皮活检手术期间将机器人辅助的柔性针插入人体内的特定目标位置至关重要。本文提出了一种基于切削力识别的力学模型,用于预测柔性针在软组织中的挠度。与其他型号不同,该方法不需要测量组织的杨氏模量(E)和泊松比(ν),这需要复杂的硬件来获得。在模型中,针穿刺过程被离散成一系列深度均匀的穿刺步骤。针被简化为由一系列虚拟弹簧支撑的悬臂梁,组织刚度对针头变形的影响用虚拟弹簧的弹簧刚度系数表示。通过理论建模和实验参数辨识,获得弹簧刚度系数,从而对针的偏转进行建模。为了验证模型的准确性,将预测的模型结果与聚乙烯醇(PVA)凝胶样品中穿刺实验的挠度进行比较,模型预测的平均最大误差范围在0.606±0.167mm和1.005±0.174mm之间,表明该模型可以成功预测针的挠度。这项工作将有助于设计自动控制策略的针。
    The deflection modeling during the insertion of bevel-tipped flexible needles into soft tissues is crucial for robot-assisted flexible needle insertion into specific target locations within the human body during percutaneous biopsy surgery. This paper proposes a mechanical model based on cutting force identification to predict the deflection of flexible needles in soft tissues. Unlike other models, this method does not require measuring Young\'s modulus (E) and Poisson\'s ratio (ν) of tissues, which require complex hardware to obtain. In the model, the needle puncture process is discretized into a series of uniform-depth puncture steps. The needle is simplified as a cantilever beam supported by a series of virtual springs, and the influence of tissue stiffness on needle deformation is represented by the spring stiffness coefficient of the virtual spring. By theoretical modeling and experimental parameter identification of cutting force, the spring stiffness coefficients are obtained, thereby modeling the deflection of the needle. To verify the accuracy of the proposed model, the predicted model results were compared with the deflection of the puncture experiment in polyvinyl alcohol (PVA) gel samples, and the average maximum error range predicted by the model was between 0.606 ± 0.167 mm and 1.005 ± 0.174 mm, which showed that the model can successfully predict the deflection of the needle. This work will contribute to the design of automatic control strategies for needles.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    河豚毒素(TTX),一种致命的神经毒素,对人类健康构成严重威胁。可用的光谱方法受到诸如复杂程序和不充分的现场能力的限制。在这项研究中,我们提出了一种使用Fe3O4@Cu作为催化生物传感器结合SERS的方法,用于TTX检测的比色法和图像处理。整合适体会放大系统的特异性并掩盖Fe3O4@Cu的催化活性。Fe3O4@Cu在H2O2-TMB反响中的催化效力可以量化体系中TTX的浓度。因此,氧化TMB(oxTMB)导致SERS信号的产生和变化,比色法和图像处理,实现TTX的三通道定量检测。在最优条件下,已建立的SERS的检测限,比色法和图像处理分别为0.055、2.127和0.243ng/mL,分别。这种三通道生物传感器应用于实际样品,提供一个准确的,用于现场TTX检测的稳定且适应性强的替代方案。
    Tetrodotoxin (TTX), a lethal neurotoxin, poses a grave threat to human health. The available spectroscopic methods suffer from limitations such as complex procedures and inadequate on-site capabilities. In this study, we proposed a method using Fe3O4@Cu as a catalytic biosensor combined with SERS, colorimetry and image processing for TTX detection. Integrating the aptamer amplifies the specificity of the system and masks the catalytic activity of Fe3O4@Cu. The catalytic efficiency of Fe3O4@Cu in the H2O2-TMB reaction can quantify the concentration of TTX in the system. Consequently, oxidation of TMB (oxTMB) led to the generation and change of signals for SERS, colorimetry and image processing, enabling a three-channel quantitative detection of TTX. Under the optimal conditions, the detection limit of established SERS, colorimetry and image processing were 0.055, 2.127 and 0.243 ng/mL, respectively. This three-channel biosensor was applied to real samples, providing an accurate, stable and adaptable alternative for on-site TTX detection.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    在完成模拟时,实现准确的结果可能是具有挑战性的材料去除量和有限的测量范围的表面微形貌仪器。为了克服这些限制,本文提出了一种结合图像拼接和小波分解技术的高保真建模方法。通过图像特征提取,重叠特征匹配,特征融合,和拼接效果评价。在此基础上,采用小波分解方法根据检测信号各自的频率进行分离,允许建立基准面和粗糙度表面。通过泊松重建算法将点云模型转换为连续的几何模型。在案例研究中,使用ZeGagePlus光学轮廓仪收集桶精加工后的铝合金片材的四个样品图像。每个图像具有834.37μmX834.37μm的实际尺寸。随后,在物理和模拟实验之间进行了比较。结果清楚地表明,所提出的方法有可能提高30%以上的精加工模拟的准确性。得到的模型和零件的实际表面之间的误差可以控制在1μm以内。
    In finishing simulations, achieving accurate results can be challenging due to the minimal amount of material removal and the limited measurement range of surface micro-topography instruments. To overcome these limitations, a novel high-fidelity modeling method combining image mosaic and wavelet decomposition technologies is proposed in this paper. We achieve the stitching of narrow field and high pixel micro morphology images through four steps: image feature extraction, overlapped feature matching, feature fusion, and stitching effect evaluation. On this basis, the wavelet decomposition method is employed to separate detection signals based on their respective frequencies, allowing the establishment of a datum plane and a roughness surface. The point cloud model undergoes a transformation into a continuous geometric model via the Poisson reconstruction algorithm. In the case study, four sample images of an aluminum alloy sheet after barrel finishing were collected using the ZeGage Plus optical profiler. Each image has an actual size of 834.37 μm × 834.37 μm. Subsequently, a comparison was carried out between the physical and simulation experiments. The results clearly indicate that the proposed method has the potential to enhance the accuracy of the finishing simulation by over 30%. The error between the resulting model and the actual surface of the part can be controlled within 1 μm.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    阴霾天气会降低图像质量,导致图像变得模糊,对比度降低。这使得物体边缘和特征不清楚,导致较低的检测精度和可靠性。为了增强除霾效果,我们提出了一种基于编码器-解码器范式(UIDF-Net)的图像去雾和融合网络。该网络利用图像融合模块(MDL-IFM)来融合去雾图像的特征,产生更清晰的结果。此外,为了更好地提取雾霾信息,我们引入了一种雾霾编码器(Mist-Encode),可以有效地处理图像的不同频率特征,提高模型在图像去雾任务中的性能。实验结果表明,与现有算法相比,该模型在室外数据集上具有更好的去雾性能。
    Haze weather deteriorates image quality, causing images to become blurry with reduced contrast. This makes object edges and features unclear, leading to lower detection accuracy and reliability. To enhance haze removal effectiveness, we propose an image dehazing and fusion network based on the encoder-decoder paradigm (UIDF-Net). This network leverages the Image Fusion Module (MDL-IFM) to fuse the features of dehazed images, producing clearer results. Additionally, to better extract haze information, we introduce a haze encoder (Mist-Encode) that effectively processes different frequency features of images, improving the model\'s performance in image dehazing tasks. Experimental results demonstrate that the proposed model achieves superior dehazing performance compared to existing algorithms on outdoor datasets.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在粒子图像测速(PIV)实验中,在拍摄或传输粒子图像时,粒子图像中不可避免地存在背景噪声,模糊了粒子图像,减少图像的信息熵,最后使得到的流场不准确。本研究以一幅低质量原始粒子图像为研究对象,采用基于小波分解和重构的频域处理方法对粒子图像进行预处理。采用信息熵分析法对图像处理效果进行评价。结果表明,有效地提取和增强了原始粒子图像中代表粒子图像细节的有用高频粒子信息,图像背景噪声明显减弱。然后,图像的信息熵分析表明,与未经处理的原始粒子图像相比,重建的粒子图像包含更有效的粒子细节,具有更高的信息熵。基于重建的粒子图像,可以在较低的误差范围内获得更准确的流场。
    In particle image velocimetry (PIV) experiments, background noise inevitably exists in the particle images when a particle image is being captured or transmitted, which blurs the particle image, reduces the information entropy of the image, and finally makes the obtained flow field inaccurate. Taking a low-quality original particle image as the research object in this research, a frequency domain processing method based on wavelet decomposition and reconstruction was applied to perform particle image pre-processing. Information entropy analysis was used to evaluate the effect of image processing. The results showed that useful high-frequency particle information representing particle image details in the original particle image was effectively extracted and enhanced, and the image background noise was significantly weakened. Then, information entropy analysis of the image revealed that compared with the unprocessed original particle image, the reconstructed particle image contained more effective details of the particles with higher information entropy. Based on reconstructed particle images, a more accurate flow field can be obtained within a lower error range.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号