Automated facial recognition

自动面部识别
  • 文章类型: Journal Article
    威廉姆斯-贝伦综合征(WBS)是一种罕见的遗传性疾病,以特殊的面部完形为特征,延迟发展,和主动脉瓣上狭窄或/和肺动脉分支狭窄。我们的目标是开发和优化准确的面部识别模型,以帮助诊断WBS,并通过使用五折交叉验证和外部测试集来评估其有效性。我们使用了135例WBS患者的954张图像,124名患有其他遗传疾病的患者,183个健康的孩子训练集包括104例WBS病例的852张图像,91例其他遗传性疾病,2017年9月至2021年12月在广东省人民医院就诊的145名健康儿童。我们通过使用EfficientNet-b3,ResNet-50,VGG-16,VGG-16BN构建了六个WBS面部识别的二元分类模型,VGG-19和VGG-19BN。迁移学习用于预先训练模型,每个模型都用可变余弦学习率进行了修改。首先通过使用五折交叉验证来评估每个模型,然后在外部测试集上进行评估。后者包含102张患有WBS的31名儿童的图像,33名患有其他遗传性疾病的儿童,38个健康的孩子为了将这些识别模型的能力与人类专家在识别WBS案例方面的能力进行比较,我们招募了两名儿科医生,一位儿科心脏病专家,和儿科遗传学家仅根据他们的面部图像来识别WBS患者。我们使用EfficientNet-b3,ResNet-50,VGG-16,VGG-16BN构建了六个面部识别模型来诊断WBS,VGG-19和VGG-19BN。基于VGG-19BN的模型在五重交叉验证方面取得了最佳性能,准确率为93.74%±3.18%,精度为94.93%±4.53%,特异性96.10%±4.30%,F1评分为91.65%±4.28%,而VGG-16BN模型达到了91.63%±5.96%的最高召回值。VGG-19BN型号在外部测试集上也取得了最佳性能,准确率为95.10%,精度100%,召回83.87%,特异性为93.42%,F1得分为91.23%。人类专家在外部测试集上的最佳性能产生了准确性值,精度,召回,特异性,F1得分为77.45%,60.53%,77.42%,83.10%,和66.67%,分别。每个人类专家的F1得分均低于EfficientNet-b3(84.21%),ResNet-50(74.51%),VGG-16(85.71%),VGG-16BN(85.71%),VGG-19(83.02%),和VGG-19BN(91.23%)型号。
    结论:结果表明,面部识别技术可用于准确诊断WBS患者。基于VGG-19BN的面部识别模型在其临床诊断中起着至关重要的作用。它们的性能可以通过扩展训练数据集的大小来提高,优化所应用的CNN架构,并用可变余弦学习率修改它们。
    背景:•WBS的面部完形,通常被描述为“小精灵,“包括宽阔的前额,眶周浮肿,扁平的鼻梁,丰满的脸颊,还有一个小下巴.•最近的研究已经证明了深度卷积神经网络作为WBS诊断工具的面部识别的潜力。
    背景:•本研究开发了六种面部识别模型,EfficientNet-b3,ResNet-50,VGG-16,VGG-16BN,VGG-19和VGG-19BN,改善WBS诊断。•VGG-19BN模型实现了最佳性能,准确率为95.10%,特异性为93.42%。基于VGG-19BN的人脸识别模型在WBS的临床诊断中起着至关重要的作用。
    Williams-Beuren syndrome (WBS) is a rare genetic disorder characterized by special facial gestalt, delayed development, and supravalvular aortic stenosis or/and stenosis of the branches of the pulmonary artery. We aim to develop and optimize accurate models of facial recognition to assist in the diagnosis of WBS, and to evaluate their effectiveness by using both five-fold cross-validation and an external test set. We used a total of 954 images from 135 patients with WBS, 124 patients suffering from other genetic disorders, and 183 healthy children. The training set comprised 852 images of 104 WBS cases, 91 cases of other genetic disorders, and 145 healthy children from September 2017 to December 2021 at the Guangdong Provincial People\'s Hospital. We constructed six binary classification models of facial recognition for WBS by using EfficientNet-b3, ResNet-50, VGG-16, VGG-16BN, VGG-19, and VGG-19BN. Transfer learning was used to pre-train the models, and each model was modified with a variable cosine learning rate. Each model was first evaluated by using five-fold cross-validation and then assessed on the external test set. The latter contained 102 images of 31 children suffering from WBS, 33 children with other genetic disorders, and 38 healthy children. To compare the capabilities of these models of recognition with those of human experts in terms of identifying cases of WBS, we recruited two pediatricians, a pediatric cardiologist, and a pediatric geneticist to identify the WBS patients based solely on their facial images. We constructed six models of facial recognition for diagnosing WBS using EfficientNet-b3, ResNet-50, VGG-16, VGG-16BN, VGG-19, and VGG-19BN. The model based on VGG-19BN achieved the best performance in terms of five-fold cross-validation, with an accuracy of 93.74% ± 3.18%, precision of 94.93% ± 4.53%, specificity of 96.10% ± 4.30%, and F1 score of 91.65% ± 4.28%, while the VGG-16BN model achieved the highest recall value of 91.63% ± 5.96%. The VGG-19BN model also achieved the best performance on the external test set, with an accuracy of 95.10%, precision of 100%, recall of 83.87%, specificity of 93.42%, and F1 score of 91.23%. The best performance by human experts on the external test set yielded values of accuracy, precision, recall, specificity, and F1 scores of 77.45%, 60.53%, 77.42%, 83.10%, and 66.67%, respectively. The F1 score of each human expert was lower than those of the EfficientNet-b3 (84.21%), ResNet-50 (74.51%), VGG-16 (85.71%), VGG-16BN (85.71%), VGG-19 (83.02%), and VGG-19BN (91.23%) models.
    CONCLUSIONS: The results showed that facial recognition technology can be used to accurately diagnose patients with WBS. Facial recognition models based on VGG-19BN can play a crucial role in its clinical diagnosis. Their performance can be improved by expanding the size of the training dataset, optimizing the CNN architectures applied, and modifying them with a variable cosine learning rate.
    BACKGROUND: • The facial gestalt of WBS, often described as \"elfin,\" includes a broad forehead, periorbital puffiness, a flat nasal bridge, full cheeks, and a small chin. • Recent studies have demonstrated the potential of deep convolutional neural networks for facial recognition as a diagnostic tool for WBS.
    BACKGROUND: • This study develops six models of facial recognition, EfficientNet-b3, ResNet-50, VGG-16, VGG-16BN, VGG-19, and VGG-19BN, to improve WBS diagnosis. • The VGG-19BN model achieved the best performance, with an accuracy of 95.10% and specificity of 93.42%. The facial recognition model based on VGG-19BN can play a crucial role in the clinical diagnosis of WBS.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    随着技术的不断进步,生命科学学科发挥着越来越重要的作用,其中人工智能在医疗领域的应用越来越受到关注。贝尔面部麻痹,以面部肌肉无力或瘫痪为特征的神经系统疾病,对患者的面部表情和咀嚼能力产生深远的影响,从而对他们的整体生活质量和心理健康造成相当大的困扰。在这项研究中,我们设计了一个面部属性识别模型专门为个人与贝尔的面部麻痹。该模型利用增强的SSD网络和科学计算对患者病情进行分级评估。通过用更高效的骨干取代VGG网络,我们提高了模型的精度,并显著降低了其计算负担。结果表明,改进后的SSD网络在光分类中的平均精度为87.9%,中度和重度面神经麻痹,并有效地对面神经麻痹患者进行分类,科学计算也提高了分类的精度。这也是本文最重要的贡献之一,为未来智能诊断和治疗以及渐进式康复的研究提供了智能手段和客观数据。
    With the continuous progress of technology, the subject of life science plays an increasingly important role, among which the application of artificial intelligence in the medical field has attracted more and more attention. Bell facial palsy, a neurological ailment characterized by facial muscle weakness or paralysis, exerts a profound impact on patients\' facial expressions and masticatory abilities, thereby inflicting considerable distress upon their overall quality of life and mental well-being. In this study, we designed a facial attribute recognition model specifically for individuals with Bell\'s facial palsy. The model utilizes an enhanced SSD network and scientific computing to perform a graded assessment of the patients\' condition. By replacing the VGG network with a more efficient backbone, we improved the model\'s accuracy and significantly reduced its computational burden. The results show that the improved SSD network has an average precision of 87.9% in the classification of light, middle and severe facial palsy, and effectively performs the classification of patients with facial palsy, where scientific calculations also increase the precision of the classification. This is also one of the most significant contributions of this article, which provides intelligent means and objective data for future research on intelligent diagnosis and treatment as well as progressive rehabilitation.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:努南综合征(NS)是一种罕见的遗传性疾病,患有这种疾病的患者表现出面部形态,其特征是前额高,超端粒,上睑下垂,内上皮褶皱,向下倾斜的睑裂,高度拱形的腭,一个圆形的鼻尖,耳朵向后旋转。面部分析技术最近已被用于识别许多遗传综合征(GS)。然而,很少有研究根据受试者的面部特征来研究NS的识别。
    目的:本研究开发了先进的模型来提高NS诊断的准确性。
    方法:本研究共纳入1,892人,包括233名NS患者,863名患有其他GSs的患者,796名健康儿童。我们为每个受试者拍摄了1到10张正面照片来建立一个数据集,然后应用多任务卷积神经网络(MTCNN)进行数据预处理,以生成具有五个关键面部标志的标准化输出。ImageNet数据集用于预训练网络,以便它可以捕获可概括的特征并最大程度地减少数据浪费。随后,我们基于VGG16、VGG19、VGG16-BN构建了七个面部识别模型,VGG19-BN,ResNet50、MobileNet-V2和挤压和激励网络(SENet)架构。评估了七个模型的识别性能,并与六个医生的识别性能进行了比较。
    结果:所有模型都表现出很高的准确性,精度,和特异性识别NS患者。VGG19-BN型号提供了最佳的整体性能,准确率为93.76%,精度为91.40%,特异性98.73%,F1得分为78.34%。VGG16-BN模型实现了0.9787的最高AUC值,而基于VGG架构的所有模型总体上都优于其他模型。六位医生的准确度得分最高,精度,特异性,F1评分为74.00%,75.00%,88.33%,和61.76%,分别。在所有指标上,每个面部识别模型的性能都优于最好的医生。
    结论:计算机辅助面部识别模型可以提高NS的诊断率。基于VGG19-BN和VGG16-BN的模型可以在临床实践中诊断NS中起重要作用。
    BACKGROUND: Noonan syndrome (NS) is a rare genetic disease, and patients who suffer from it exhibit a facial morphology that is characterized by a high forehead, hypertelorism, ptosis, inner epicanthal folds, down-slanting palpebral fissures, a highly arched palate, a round nasal tip, and posteriorly rotated ears. Facial analysis technology has recently been applied to identify many genetic syndromes (GSs). However, few studies have investigated the identification of NS based on the facial features of the subjects.
    OBJECTIVE: This study develops advanced models to enhance the accuracy of diagnosis of NS.
    METHODS: A total of 1,892 people were enrolled in this study, including 233 patients with NS, 863 patients with other GSs, and 796 healthy children. We took one to 10 frontal photos of each subject to build a dataset, and then applied the multi-task convolutional neural network (MTCNN) for data pre-processing to generate standardized outputs with five crucial facial landmarks. The ImageNet dataset was used to pre-train the network so that it could capture generalizable features and minimize data wastage. We subsequently constructed seven models for facial identification based on the VGG16, VGG19, VGG16-BN, VGG19-BN, ResNet50, MobileNet-V2, and squeeze-and-excitation network (SENet) architectures. The identification performance of seven models was evaluated and compared with that of six physicians.
    RESULTS: All models exhibited a high accuracy, precision, and specificity in recognizing NS patients. The VGG19-BN model delivered the best overall performance, with an accuracy of 93.76%, precision of 91.40%, specificity of 98.73%, and F1 score of 78.34%. The VGG16-BN model achieved the highest AUC value of 0.9787, while all models based on VGG architectures were superior to the others on the whole. The highest scores of six physicians in terms of accuracy, precision, specificity, and the F1 score were 74.00%, 75.00%, 88.33%, and 61.76%, respectively. The performance of each model of facial recognition was superior to that of the best physician on all metrics.
    CONCLUSIONS: Models of computer-assisted facial recognition can improve the rate of diagnosis of NS. The models based on VGG19-BN and VGG16-BN can play an important role in diagnosing NS in clinical practice.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    面部反欺骗(FAS)旨在保护面部识别系统免受欺骗攻击,广泛应用于访问控制等场景,电子支付,和安全监控系统。人脸反欺骗需要整合局部细节和全局语义信息。现有的基于CNN的方法依赖于小步幅或基于图像块的特征提取结构,难以有效捕获空间和跨层特征相关性。同时,基于变压器的方法在提取区分性详细特征方面存在局限性。为了解决上述问题,我们介绍了一个基于多阶段CNN-Transformer的框架,它通过卷积层提取局部特征,并通过自注意提取长距离特征关系。基于此,我们提出了一种跨注意力多阶段特征融合,利用语义高阶段特征在低阶段特征中查询任务相关特征,进行进一步的跨阶段特征融合。为了加强对地方特征的区分,以进行细微的差异,我们设计了按像素的材料分类监督,并在模型的中间层中添加了一个辅助分支。此外,为了解决现有近红外数据集中单一采集环境和采集设备稀缺的局限性,我们创建了一个大规模的近红外人脸反欺骗数据集,包含380k张1040个身份的照片。所提出的方法可以在OULU-NPU和我们提出的近红外数据集上实现最先进的状态,只有1.3GFlops和3.2M参数数,验证了该方法的有效性。
    Face Anti-Spoofing (FAS) seeks to protect face recognition systems from spoofing attacks, which is applied extensively in scenarios such as access control, electronic payment, and security surveillance systems. Face anti-spoofing requires the integration of local details and global semantic information. Existing CNN-based methods rely on small stride or image patch-based feature extraction structures, which struggle to capture spatial and cross-layer feature correlations effectively. Meanwhile, Transformer-based methods have limitations in extracting discriminative detailed features. To address the aforementioned issues, we introduce a multi-stage CNN-Transformer-based framework, which extracts local features through the convolutional layer and long-distance feature relationships via self-attention. Based on this, we proposed a cross-attention multi-stage feature fusion, employing semantically high-stage features to query task-relevant features in low-stage features for further cross-stage feature fusion. To enhance the discrimination of local features for subtle differences, we design pixel-wise material classification supervision and add a auxiliary branch in the intermediate layers of the model. Moreover, to address the limitations of a single acquisition environment and scarcity of acquisition devices in the existing Near-Infrared dataset, we create a large-scale Near-Infrared Face Anti-Spoofing dataset with 380k pictures of 1040 identities. The proposed method could achieve the state-of-the-art in OULU-NPU and our proposed Near-Infrared dataset at just 1.3GFlops and 3.2M parameter numbers, which demonstrate the effective of the proposed method.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    人脸识别系统已经广泛应用于人们日常生活中的各种场景。人脸识别系统的识别率和速度一直是研究人员关注的两个关键技术因素。许多优秀的识别算法实现了高识别率或良好的识别速度。然而,需要更多的研究来开发能够有效平衡这两个指标的算法。在这项研究中,将改进的粒子群优化算法引入到基于图像特征补偿技术的人脸识别算法中。这使得系统在实现高识别率的同时提高了识别效率,旨在在这两个方面取得平衡。该方法为图像特征补偿技术在人脸识别系统中的应用提供了新的视角。它通过在保持令人满意的识别率的同时尽可能降低识别速度,有助于实现人脸识别技术更广泛的应用。最终,这导致改进的用户体验。
    Face recognition systems have been widely applied in various scenarios in people\'s daily lives. The recognition rate and speed of face recognition systems have always been the two key technical factors that researchers focus on. Many excellent recognition algorithms achieve high recognition rates or good recognition speeds. However, more research is needed to develop algorithms that can effectively balance these two indicators. In this study, we introduce an improved particle swarm optimization algorithm into a face recognition algorithm based on image feature compensation techniques. This allows the system to achieve high recognition rates while simultaneously enhancing the recognition efficiency, aiming to strike a balance between the two aspects. This approach provides a new perspective for the application of image feature compensation techniques in face recognition systems. It helps achieve a broader range of applications for face recognition technology by reducing the recognition speed as much as possible while maintaining a satisfactory recognition rate. Ultimately, this leads to an improved user experience.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    背景:由人工智能(AI)提供动力的年龄预测可以用作评估年轻化手术美容效果的客观技术。现有的年龄估计模型是在以高加索种族为主要参考的公共数据集上训练的,因此,它们对于中国患者的临床应用是不切实际的。
    方法:开发和选择适合中国患者接受复壮治疗的年龄估计模型,我们从作者医院的1821名患者中获得了10529张图像的人脸数据库,并选择了两种具有代表性的年龄估计算法进行模型训练.比较并分析了这两种面部年龄预测因子的预测精度和计算逻辑的可解释性。
    结果:传统支持向量机学习模型的平均绝对误差(MAE)为10.185年;绝对误差≤6年的比例为35.90%,≤12年的比例为68.50%。基于VGG-16框架的深度学习模型的MAE为3.011年;绝对误差≤6年的比例为90.20%,100%≤12年。与深度学习相比,传统的机器学习模型具有更清晰的计算逻辑,这使他们能够为临床医生提供更具体的治疗建议。
    结论:实验结果表明,深度学习在预测中国化妆品患者的年龄方面超过了传统的机器学习。尽管传统的机器学习模型比深度学习模型具有更好的可解释性。深度学习对临床定量评价更为准确。了解深度学习准确预测背后的决策逻辑对于更深入的临床应用至关重要。需要进一步探索。
    BACKGROUND: Age prediction powered by artificial intelligence (AI) can be used as an objective technique to assess the cosmetic effect of rejuvenation surgery. Existing age-estimation models are trained on public datasets with the Caucasian race as the main reference, thus they are impractical for clinical application in Chinese patients.
    METHODS: To develop and select an age-estimation model appropriate for Chinese patients receiving rejuvenation treatment, we obtained a face database of 10 529 images from 1821 patients from the author\'s hospital and selected two representative age-estimation algorithms for the model training. The prediction accuracies and the interpretability of calculation logic of these two facial age predictors were compared and analyzed.
    RESULTS: The mean absolute error (MAE) of a traditional support vector machine-learning model was 10.185 years; the proportion of absolute error ≤6 years was 35.90% and 68.50% ≤12 years. The MAE of a deep-learning model based on the VGG-16 framework was 3.011 years; the proportion of absolute error ≤6 years was 90.20% and 100% ≤12 years. Compared with deep learning, traditional machine-learning models have clearer computational logic, which allows them to give clinicians more specific treatment recommendations.
    CONCLUSIONS: Experimental results show that deep-learning exceeds traditional machine learning in the prediction of Chinese cosmetic patients\' age. Although traditional machine learning model has better interpretability than deep-learning model, deep-learning is more accurate for clinical quantitative evaluation. Knowing the decision-making logic behind the accurate prediction of deep-learning is crucial for deeper clinical application, and requires further exploration.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    面部近似(FA)提供了一种有希望的方法来生成死者的可能的面部外观。它有助于探索推动祖先人类解剖学变化的进化力量,并能引起公众的注意。尽管最近在改善FA方法的性能方面取得了进展,对面部骨骼和软组织形态学之间详细定量颅面关系的有限理解可能会阻碍其准确性,因此需要主观经验和艺术诠释。在这项研究中,我们利用几何形态计量学,基于平均面部软组织厚度深度(FSTDs)以及鼻部和口腔的硬组织和软组织之间的协变量,探索了人群之间的颅面关系.此外,我们提出了一种计算机化的方法来分配学习的颅面关系,以生成可能的智人的面部外观,减少人为干预。近似人脸与实际人脸之间的相似性比较较小(平均Procrustes距离为0.0258,平均欧几里得距离为1.79mm),并且通过人脸池测试的识别率更高(91.67%)表明平均密集的FSTD有助于提高近似人脸的准确性。偏最小二乘(PLS)分析结果表明,鼻腔和口腔硬组织分别对其软组织产生影响。然而,相对较弱的RV相关性(<0.4)和更大的近似误差表明,我们需要对来自骨结构的近似鼻部和口腔软组织形状的准确性保持谨慎.总的来说,所提出的方法可以促进颅面关系的调查,并有可能提高近似面部的可靠性,以便在法医学的许多应用中使用。考古学,和人类学。
    Facial approximation (FA) provides a promising means of generating the possible facial appearance of a deceased person. It facilitates exploration of the evolutionary forces driving anatomical changes in ancestral humans and can capture public attention. Despite the recent progress made toward improving the performance of FA methods, a limited understanding of detailed quantitative craniofacial relationships between facial bone and soft tissue morphology may hinder their accuracy, and hence subjective experience and artistic interpretation are required. In this study, we explored craniofacial relationships among human populations based upon average facial soft tissue thickness depths (FSTDs) and covariations between hard and soft tissues of the nose and mouth using geometric morphometrics. Furthermore, we proposed a computerized method to assign the learned craniofacial relationships to generate a probable facial appearance of Homo sapiens, reducing human intervention. A smaller resemblance comparison (an average Procrustes distance was 0.0258 and an average Euclidean distance was 1.79 mm) between approximated and actual faces and a greater recognition rate (91.67%) tested by a face pool indicated that average dense FSTDs contributed to raising the accuracy of approximated faces. Results of partial least squares (PLS) analysis showed that nasal and oral hard tissues have an effect on their soft tissues separately. However, relatively weaker RV correlations (<0.4) and greater approximation errors suggested that we need to be cautious about the accuracy of the approximated nose and mouth soft tissue shapes from bony structures. Overall, the proposed method can facilitate investigations of craniofacial relationships and potentially improve the reliability of the approximated faces for use in numerous applications in forensic science, archaeology, and anthropology.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

  • 文章类型: Journal Article
    目的:阻塞性睡眠呼吸暂停(OSA)的诊断依赖于耗时且复杂的程序,这些程序并不总是容易获得,并且可能会延迟诊断。随着人工智能的广泛使用,我们推测简单的临床信息和基于面部照片的影像识别相结合可能是筛查OSA的有用工具.
    方法:我们连续招募怀疑患有OSA的受试者,他们接受了睡眠检查和拍照。通过自动识别标记了二维面部照片中的68个点。建立了具有面部特征和基本临床信息的优化模型,并进行了十倍交叉验证。接收器工作特性曲线下面积(AUC)表示使用睡眠监测作为参考标准的模型性能。
    结果:共有653名受试者(77.2%为男性,55.3%OSA)进行了分析。CATBOOST是最适合OSA分类的算法,具有一定的敏感性,特异性,准确度,AUC分别为0.75、0.66、0.71和0.76(P<0.05),这比STOP-Bang问卷更好,NoSAS分数,和Epworth规模。睡眠伴侣目睹的呼吸暂停是最强大的变量,其次是体重指数,颈围,面部参数,和高血压。对于频繁仰卧睡眠呼吸暂停的患者,该模型的性能变得更加稳健,灵敏度为0.94。
    结论:研究结果表明,从二维正面照片中提取的颅面特征,尤其是在下颌段,有可能成为中国人群OSA的预测因子。机器学习衍生的自动识别可以促进OSA的自助筛查,无辐射,和可重复的方式。
    The diagnosis of obstructive sleep apnea (OSA) relies on time-consuming and complicated procedures which are not always readily available and may delay diagnosis. With the widespread use of artificial intelligence, we presumed that the combination of simple clinical information and imaging recognition based on facial photos may be a useful tool to screen for OSA.
    We recruited consecutive subjects suspected of OSA who had received sleep examination and photographing. Sixty-eight points from 2-dimensional facial photos were labelled by automated identification. An optimized model with facial features and basic clinical information was established and tenfold cross-validation was performed. Area under the receiver operating characteristic curve (AUC) indicated the model\'s performance using sleep monitoring as the reference standard.
    A total of 653 subjects (77.2% males, 55.3% OSA) were analyzed. CATBOOST was the most suitable algorithm for OSA classification with a sensitivity, specificity, accuracy, and AUC of 0.75, 0.66, 0.71, and 0.76 respectively (P < 0.05), which was better than STOP-Bang questionnaire, NoSAS scores, and Epworth scale. Witnessed apnea by sleep partner was the most powerful variable, followed by body mass index, neck circumference, facial parameters, and hypertension. The model\'s performance became more robust with a sensitivity of 0.94, for patients with frequent supine sleep apnea.
    The findings suggest that craniofacial features extracted from 2-dimensional frontal photos, especially in the mandibular segment, have the potential to become predictors of OSA in the Chinese population. Machine learning-derived automatic recognition may facilitate the self-help screening for OSA in a quick, radiation-free, and repeatable manner.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    深度假货的滥用,一种新兴的换脸技术,引起人们对视觉内容真实性和错误信息传播的严重担忧。为了减轻深假货带来的威胁,已经部署了大量以数据为中心的探测器。然而,这些方法的性能很容易因deepfakes的降级而缺陷。为了提高退化deepfake检测的性能,我们创造性地探索了在特征空间中的恢复方法,以保留用于检测的伪影,而不是直接在图像域中。在本文中,我们提出了一种方法,即DF-UDetector,通过对退化图像进行建模并将提取的特征转换为高质量水平,以防止退化深度伪造。具体而言,整个模型由三个关键部分组成:图像特征提取器,用于捕获图像特征,特征转换模块,用于将退化特征映射为更高质量,和鉴别器,以确定特征图是否具有足够的高质量。在多个视频数据集上进行的大量实验表明,我们提出的模型的性能与最先进的模型相比甚至更好。此外,在野外检测深度假货时,DF-UDetector的性能较小。
    The abuse of deepfakes, a rising face swap technique, causes severe concerns about the authenticity of visual content and the dissemination of misinformation. To alleviate the threats imposed by deepfakes, a vast body of data-centric detectors has been deployed. However, the performance of these methods can be easily defected by degradations on deepfakes. To improve the performance of degradation deepfake detection, we creatively explore the recovery method in the feature space to preserve the artifacts for detection instead of directly in the image domain. In this paper, we propose a method, namely DF-UDetector, against degradation deepfakes by modeling the degraded images and transforming the extracted features to a high-quality level. To be specific, the whole model consists of three key components: an image feature extractor to capture image features, a feature transforming module to map the degradation features into a higher quality, and a discriminator to determine whether the feature map is of high quality enough. Extensive experiments on multiple video datasets show that our proposed model performs comparably or even better than state-of-the-art counterparts. Moreover, DF-UDetector outperforms by a small margin when detecting deepfakes in the wild.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本研究调查了中国公众对广泛采用(并经常被指控滥用)面部识别技术的看法。通过主题建模和对151,654条微博帖子的社交网络分析,我们研究了关于面部识别技术的公民讨论的“内容维度”和“演员维度”。我们的研究结果表明,在中国的网络空间中,对这种生物数据收集技术的商业使用的社会关注和怀疑正在上升。尽管该州被采纳,监督,面部识别技术的监管被广泛授予。此外,虽然我们的发现说明了在关于面部识别技术的公开辩论中公开和平等的程度,它们还表明中国政府在上述辩论中成为重要的“对话者”,工业界和学术界的话语参与在很大程度上被边缘化了。根据结果,我们建议,对中国科学公共领域形成的进一步调查应该放在中国中央计划数字经济愿景的更广泛背景下。
    The present study investigates the Chinese public\'s perception toward the widely adopted (and often accused of misuse) technology of face recognition. Through topic modeling and a social network analysis of 151,654 Weibo posts, we examine the \"content dimension\" and the \"actor dimension\" of civic discussions on facial recognition technology. Our results demonstrate that there is rising social concern and skepticism directed at the commercial use of this biodata-collected technology in China\'s cyberspace, despite the state\'s adoption, supervision, and regulation of facial recognition technology being broadly granted. Moreover, while our findings illustrate an extent of openness and equality within the public debates on facial recognition technology, they also show the Chinese government becoming an important \"interlocutor\" within the said debates, with discursive engagement from industry and academia largely marginalized. Drawing on the results, we suggest that further investigation into the formation of China\'s scientific public sphere should be located within the broader context of China\'s vision of a centrally planned digital economy.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

公众号