face detection

  • 文章类型: Journal Article
    处理面部特征对于识别社交伙伴(猎物,捕食者,或特定),并识别和准确解释情感表达。在人类和非人类灵长类动物中的大量研究提供了证据,促进了检测面部特征的固有机制的概念。这些机制支持独立于先前经验的面孔表示,对于社会和语言领域的后续发展至关重要。此外,面部处理缺陷是自闭症谱系障碍的可靠生物标志物,早期出现并与症状严重程度相关。面部处理,然而,不仅是人类的特权:其他物种也表现出非凡的面部检测能力。在这次审查中,我们概述了当前有关脊椎动物模型中人脸检测的文献,这些文献可能与自闭症研究有关。
    Processing facial features is crucial to identify social partners (prey, predators, or conspecifics) and recognize and accurately interpret emotional expressions. Numerous studies in both human and non-human primates provided evidence promoting the notion of inherent mechanisms for detecting facial features. These mechanisms support a representation of faces independent of prior experiences and are vital for subsequent development in social and language domains. Moreover, deficits in processing faces are a reliable biomarker of autism spectrum disorder, appearing early and correlating with symptom severity. Face processing, however, is not only a prerogative of humans: other species also show remarkable face detection abilities. In this review, we present an overview of the current literature on face detection in vertebrate models that could be relevant to the study of autism.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    使用现成算法的对象检测系统在部署在复杂场景中时存在失败的趋势。本工作描述了一种检测手术后新生儿(新生儿)面部表情的案例,作为预测和分类新生儿重症监护病房(NICU)严重疼痛的方式。我们的初始测试表明,现成的面部检测器和在成人面部上训练的机器学习算法都无法检测到NICU中新生儿的面部表情。我们通过使用USF-MNPAD-I新生儿面部数据集训练最先进的“You-Only-Look-Once”(YOLO)面部检测模型,提高了这个复杂场景的准确性。在运行时,我们训练的YOLO模型显示,与NICU护士手动疼痛评分相比,新生儿疼痛自动分类的平均平均精度(mAP)和ROC曲线下面积(AUC)的差异为8.6%。鉴于挑战,从手术后新生儿的脸上收集真相的时间和精力,在这里,我们分享用这些面部表情数据训练我们的YOLO模型的权重。这些权重可以促进进一步开发用于检测面部表情的准确策略,它可以用来预测疼痛发作的时间,结合其他感觉方式(身体运动,哭泣频率,生命体征)。对疼痛发作时间的可靠预测反过来创建了一个治疗时间窗口,其中NICU护士和提供者可以实施安全有效的策略来减轻这种脆弱患者人群的严重疼痛。
    There is a tendency for object detection systems using off-the-shelf algorithms to fail when deployed in complex scenes. The present work describes a case for detecting facial expression in post-surgical neonates (newborns) as a modality for predicting and classifying severe pain in the Neonatal Intensive Care Unit (NICU). Our initial testing showed that both an off-the-shelf face detector and a machine learning algorithm trained on adult faces failed to detect facial expression of neonates in the NICU. We improved accuracy in this complex scene by training a state-of-the-art \"You-Only-Look-Once\" (YOLO) face detection model using the USF-MNPAD-I dataset of neonate faces. At run-time our trained YOLO model showed a difference of 8.6% mean Average Precision (mAP) and 21.2% Area under the ROC Curve (AUC) for automatic classification of neonatal pain compared with manual pain scoring by NICU nurses. Given the challenges, time and effort associated with collecting ground truth from the faces of post-surgical neonates, here we share the weights from training our YOLO model with these facial expression data. These weights can facilitate the further development of accurate strategies for detecting facial expression, which can be used to predict the time to pain onset in combination with other sensory modalities (body movements, crying frequency, vital signs). Reliable predictions of time to pain onset in turn create a therapeutic window of time wherein NICU nurses and providers can implement safe and effective strategies to mitigate severe pain in this vulnerable patient population.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    个体认同在生态学和行为学中起着举足轻重的作用,特别是作为理解复杂社会结构的工具。然而,传统的识别方法通常涉及侵入性的物理标签,并且可以证明对动物具有破坏性,对研究人员来说是耗时的。近年来,深度学习在研究中的整合通过复杂任务的自动化提供了新的方法论观点。利用物体检测和识别技术越来越多地被研究人员用来实现对视频镜头的识别。这项研究是对通过深度学习开发用于日本猕猴(Macacafuscata)的人脸检测和个体识别的非侵入性工具的初步探索。这项研究的最终目标是,使用对数据集进行的识别,自动生成所研究人群的社交网络表示。当前的主要结果是有希望的:(i)创建了日本猕猴的面部检测器(Faster-RCNN模型),达到82.2%的准确率,(ii)为高岛猕猴种群创建个人识别器(YOLOv8n模型),准确率达到83%。我们还通过传统的方法创建了一个高岛人口社交网络,基于视频上的共同事件。因此,我们提供了一个基准,将根据该基准评估自动生成的网络的可靠性。这些初步结果证明了这种方法的潜力,可以为科学界提供跟踪日本猕猴个人和社会网络研究的工具。
    Individual identification plays a pivotal role in ecology and ethology, notably as a tool for complex social structures understanding. However, traditional identification methods often involve invasive physical tags and can prove both disruptive for animals and time-intensive for researchers. In recent years, the integration of deep learning in research has offered new methodological perspectives through the automatisation of complex tasks. Harnessing object detection and recognition technologies is increasingly used by researchers to achieve identification on video footage. This study represents a preliminary exploration into the development of a non-invasive tool for face detection and individual identification of Japanese macaques (Macaca fuscata) through deep learning. The ultimate goal of this research is, using identification done on the dataset, to automatically generate a social network representation of the studied population. The current main results are promising: (i) the creation of a Japanese macaques\' face detector (Faster-RCNN model), reaching an accuracy of 82.2% and (ii) the creation of an individual recogniser for the Kōjima Island macaque population (YOLOv8n model), reaching an accuracy of 83%. We also created a Kōjima population social network by traditional methods, based on co-occurrences on videos. Thus, we provide a benchmark against which the automatically generated network will be assessed for reliability. These preliminary results are a testament to the potential of this approach to provide the scientific community with a tool for tracking individuals and social network studies in Japanese macaques.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    全世界年轻人死亡的主要原因之一是车祸,大多数死亡发生在坐在前排乘客座位上的儿童身上,在事故发生时,受到安全气囊的直接撞击,这对13岁以下的儿童来说是致命的。本研究旨在通过使用儿童面部检测系统进行内部监控来提高对这种风险的认识,该系统可提醒驾驶员儿童不应坐在前排乘客座位上。
    该系统包含对收集的数据的处理,深度学习的元素,如迁移学习,微调和面部检测,以一种强大的方式识别儿童的存在,这是通过使用从头开始生成的数据集进行训练来实现的。MobileNetV2架构的使用基于良好的性能,当与此任务的Inception架构比较时,这有助于在树莓派4B上实现最终模型。
    生成的图像数据集由102个空座位组成,71名儿童(0-13岁),96名成年人(14-75岁)。从数据增加来看,成人有2,496张图像,儿童有2,310张图像。没有滑动窗口的面分类给出了98%的准确度和100%的准确度的结果。最后,使用拟议的方法,可以实时检测前排乘客座位上的儿童,每个决策和滑动窗口准则的延迟为1s,达到100%的准确度。
    尽管我们在实验环境中的100%精度有些理想化,因为传感器没有被阳光直射遮挡,也没有部分或完全被运送儿童的车辆中常见的污垢或其他碎片覆盖。本研究表明,在任何汽车上都可以在RaspberryPi4ModelB上实现强大的非侵入性分类系统,以通过深度学习方法(如DeepCNN)检测前座儿童。
    UNASSIGNED: One of the main causes of death worldwide among young people are car crashes, and most of these fatalities occur to children who are seated in the front passenger seat and who, at the time of an accident, receive a direct impact from the airbags, which is lethal for children under 13 years of age. The present study seeks to raise awareness of this risk by interior monitoring with a child face detection system that serves to alert the driver that the child should not be sitting in the front passenger seat.
    UNASSIGNED: The system incorporates processing of data collected, elements of deep learning such as transfer learning, fine-tunning and facial detection to identify the presence of children in a robust way, which was achieved by training with a dataset generated from scratch for this specific purpose. The MobileNetV2 architecture was used based on the good performance shown when compared with the Inception architecture for this task; and its low computational cost, which facilitates implementing the final model on a Raspberry Pi 4B.
    UNASSIGNED: The resulting image dataset consisted of 102 empty seats, 71 children (0-13 years), and 96 adults (14-75 years). From the data augmentation, there were 2,496 images for adults and 2,310 for children. The classification of faces without sliding window gave a result of 98% accuracy and 100% precision. Finally, using the proposed methodology, it was possible to detect children in the front passenger seat in real time, with a delay of 1 s per decision and sliding window criterion, reaching an accuracy of 100%.
    UNASSIGNED: Although our 100% accuracy in an experimental environment is somewhat idealized in that the sensor was not blocked by direct sunlight, nor was it partially or completely covered by dirt or other debris common in vehicles transporting children. The present study showed that is possible the implementation of a robust noninvasive classification system made on Raspberry Pi 4 Model B in any automobile for the detection of a child in the front seat through deep learning methods such as Deep CNN.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    基于事件的摄像机在计算机视觉中的使用是一个不断发展的研究方向。然而,尽管已有关于使用事件相机进行人脸检测的研究,大型数据集的可用性存在巨大差距,该数据集具有事件流上的面部和面部标志的注释,从而阻碍了这个方向的应用发展。在这项工作中,我们通过发布第一个大型且多样化的数据集(事件流中的面孔)来解决这个问题,该数据集的持续时间为689分钟,用于直接基于事件的相机输出中的人脸和人脸标志检测。此外,本文介绍了在我们的数据集上训练的12个模型,以预测边界框和面部标志坐标,其mAP50得分超过90%。我们还使用我们的模型使用基于事件的相机进行了实时检测的演示。
    The use of event-based cameras in computer vision is a growing research direction. However, despite the existing research on face detection using the event camera, a substantial gap persists in the availability of a large dataset featuring annotations for faces and facial landmarks on event streams, thus hampering the development of applications in this direction. In this work, we address this issue by publishing the first large and varied dataset (Faces in Event Streams) with a duration of 689 min for face and facial landmark detection in direct event-based camera outputs. In addition, this article presents 12 models trained on our dataset to predict bounding box and facial landmark coordinates with an mAP50 score of more than 90%. We also performed a demonstration of real-time detection with an event-based camera using our models.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    在本文中,提出了一种基于人脸检测和头部姿态估计的学习状态评价方法。该方法适用于计算能力较弱的移动设备,因此有必要对人脸检测和头部姿态估计网络的参数进行控制。首先,我们提出了一个鬼影和注意力模块(GA)基本人脸检测网络(GA-Face)。GA-Face通过鬼影模块减少了特征提取网络中的参数数量和计算量,并通过无参数的注意力机制将网络集中在重要特征上。我们还提出了一个轻量级的双分支(DB)头部姿态估计网络:DB-Net。最后,提出了一种学生学习状态评价算法。该算法可以根据学生的面部与屏幕之间的距离来评估学生的学习状态,以及他们的头部姿势。我们在几个标准人脸检测数据集和标准头部姿态估计数据集上验证了所提出的GA-Face和DB-Net的有效性。最后,我们验证,通过实际案例,认为所提出的在线学习状态评估方法可以有效地评估学生的注意力和专注度,and,由于其计算复杂度低,不会干扰学生的学习过程。
    In this paper, we propose a learning state evaluation method based on face detection and head pose estimation. This method is suitable for mobile devices with weak computing power, so it is necessary to control the parameter quantity of the face detection and head pose estimation network. Firstly, we propose a ghost and attention module (GA) base face detection network (GA-Face). GA-Face reduces the number of parameters and computation in the feature extraction network through the ghost module, and focuses the network on important features through a parameter-free attention mechanism. We also propose a lightweight dual-branch (DB) head pose estimation network: DB-Net. Finally, we propose a student learning state evaluation algorithm. This algorithm can evaluate the learning status of students based on the distance between their faces and the screen, as well as their head posture. We validate the effectiveness of the proposed GA-Face and DB-Net on several standard face detection datasets and standard head pose estimation datasets. Finally, we validate, through practical cases, that the proposed online learning state assessment method can effectively assess the level of student attention and concentration, and, due to its low computational complexity, will not interfere with the student\'s learning process.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    这项研究开创了一种加强在线投票系统的创新方法,利用RSA(Rivest-Shamir-Adleman)加密和解密技术实现强大的数据保护。通过高级安全层的全面合并,包括MobileFaceNet驱动的面部验证,设备指纹匹配,和多因素身份验证,该系统为网络漏洞提供了一个弹性屏障。通过利用Firebase数据库,用户信息被安全地存储和认证,肯定他们在民主进程中的关键作用。RSA加密和解密的交响乐围绕数据传输和存储编排了一个强大的堡垒,确保抵御数字威胁的安全。投票技术的这种范式转变不仅努力提升安全性,而且增强可访问性和便利性,最终有助于在线投票系统的发展,并在数字时代提高参与率并降低相关成本。
    This study pioneers an innovative approach to fortifying online voting systems, leveraging RSA (Rivest-Shamir-Adleman) encryption and decryption techniques for robust data protection. Through a comprehensive amalgamation of advanced security layers, including MobileFaceNet-driven face verification, device fingerprint matching, and multi-factor authentication, this system engenders a resilient shield against cyber vulnerabilities. By harnessing a Firebase database, user information is securely stored and authenticated, affirming their pivotal role in the democratic process. The symphony of RSA encryption and decryption orchestrates a formidable fortress around data transmission and storage, ensuring impregnable security against digital threats. This paradigm shift in voting technology strives to not only elevate security but also enhance accessibility and convenience, ultimately contributing to the evolution of online voting systems and fostering greater participation rates and reducing associated costs in the digital era.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    人类视觉系统对环境中的面部非常敏感,以至于它可以在日常物品中产生虚幻面孔的感知。越来越多的研究表明,虚幻的面孔和真实的面孔是由相似的感知和神经机制处理的,但是这种相似性是否延伸到视觉注意力尚不清楚。视觉搜索研究表明,当对象的类型变化以匹配虚幻面孔中的对象时,虚幻面孔相对于对象具有搜索优势(例如,椅子,胡椒,时钟)(Keys等人。,2021)。这里,当与属于单一类别(花朵)的对象进行比较时,我们研究了虚幻面孔相对于对象的搜索优势是否仍然存在。在三个实验中,我们比较了对虚幻面孔的视觉搜索,真实的面孔,变量对象,和统一的对象(花)。与所有其他类型的目标相比,搜索真实面孔是最好的。相比之下,寻找虚幻的面孔只会比寻找可变的对象更好,不是统一的物体。此结果显示了对虚幻面孔的有限视觉搜索优势,并表明在视觉注意力中,虚幻面孔可能不会像真实面孔那样进行处理。
    The human visual system is very sensitive to the presence of faces in the environment, so much so that it can produce the perception of illusory faces in everyday objects. Growing research suggests that illusory faces and real faces are processed by similar perceptual and neural mechanisms, but whether this similarity extends to visual attention is less clear. A visual search study showed that illusory faces have a search advantage over objects when the types of objects vary to match the objects in the illusory faces (e.g., chair, pepper, clock) (Keys et al., 2021). Here, we examine whether the search advantage for illusory faces over objects remains when compared against objects that belong to a single category (flowers). In three experiments, we compared visual search of illusory faces, real faces, variable objects, and uniform objects (flowers). Search for real faces was best compared with all other types of targets. In contrast, search for illusory faces was only better than search for variable objects, not uniform objects. This result shows a limited visual search advantage for illusory faces and suggests that illusory faces may not be processed like real faces in visual attention.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    通过整合物联网技术,智能门锁可以提供更大的便利,安全,和远程访问。本文提出了一种新颖的智能门框架,该框架结合了基于毫米波雷达和摄像头传感器的人脸检测和识别技术。拟议的框架旨在提高摄像机的准确性和某些限制所带来的安全性,如重叠和照明条件。通过集成毫米波雷达和基于摄像头的人脸检测和识别算法,该系统可以准确地检测和识别接近门的人,提供无缝和安全的访问。该框架包括四个关键组件:基于毫米波雷达的人员检测,相机准备和集成,人员识别,和门锁控制。实验表明,该框架可以用于智能家居。
    By integrating IoT technology, smart door locks can provide greater convenience, security, and remote access. This paper presents a novel framework for smart doors that combines face detection and recognition techniques based on mmWave radar and camera sensors. The proposed framework aims to improve the accuracy and some security aspects arising from some limitations of the camera, such as overlapping and lighting conditions. By integrating mmWave radar and camera-based face detection and recognition algorithms, the system can accurately detect and identify people approaching the door, providing seamless and secure access. This framework includes four key components: person detection based on mmWave radar, camera preparation and integration, person identification, and door lock control. The experiments show that the framework can be useful for a smart home.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    压力是影响当今许多人的因素,并且是造成生活质量差的许多原因的原因。出于这个原因,有必要能够确定一个人是否有压力。因此,有必要开发非侵入性的工具,无害,并且易于使用。本文介绍了一种通过在简短的Trier社会压力测试中使用机器学习自动检测热图像中感兴趣的面部区域来对人体压力进行分类的方法。感兴趣的五个区域,即鼻子,右脸颊,左脸颊,前额,还有下巴,自动检测。然后提取这些区域中每个区域的温度,并将其用作分类器的输入,特别是支持向量机,它输出三个状态:基线,强调,和放松。该提案是在25名参与者的热图像上开发和测试的,这些参与者接受了压力诱导方案,然后进行了放松技术。在测试了所开发的方法之后,准确率为95.4%,误差率为4.5%。本研究中提出的方法允许基于面部热图像对人的压力状态进行自动分类。这是一种适用于专家的创新工具。此外,由于其鲁棒性,它也适用于在线应用。
    Stress is a factor that affects many people today and is responsible for many of the causes of poor quality of life. For this reason, it is necessary to be able to determine whether a person is stressed or not. Therefore, it is necessary to develop tools that are non-invasive, innocuous, and easy to use. This paper describes a methodology for classifying stress in humans by automatically detecting facial regions of interest in thermal images using machine learning during a short Trier Social Stress Test. Five regions of interest, namely the nose, right cheek, left cheek, forehead, and chin, are automatically detected. The temperature of each of these regions is then extracted and used as input to a classifier, specifically a Support Vector Machine, which outputs three states: baseline, stressed, and relaxed. The proposal was developed and tested on thermal images of 25 participants who were subjected to a stress-inducing protocol followed by relaxation techniques. After testing the developed methodology, an accuracy of 95.4% and an error rate of 4.5% were obtained. The methodology proposed in this study allows the automatic classification of a person\'s stress state based on a thermal image of the face. This represents an innovative tool applicable to specialists. Furthermore, due to its robustness, it is also suitable for online applications.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号