Landmark-based

  • 文章类型: Journal Article
    头部姿态估计是计算机视觉中的重要任务之一,预测图像中头部的欧拉角。近年来,用于头部姿态估计的基于CNN的方法已经取得了优异的性能。他们的训练依赖于RGB图像,提供来自RGBD相机的面部标志或深度图像。然而,标记面部标志对于RGB图像中的大角度头部姿势是复杂的,和RGBD摄像机不适合户外场景。针对RGB图像中的头部姿态,提出了一种简单有效的标注方法。新颖性方法使用3D虚拟人头部来模拟RGB图像中的头部姿势。欧拉角可以根据3D虚拟头部的坐标变化来计算。然后,我们使用我们的注释方法创建一个数据集:2DHeadPose数据集,其中包含一组丰富的属性,尺寸,和角度。最后,我们提出高斯标签平滑来抑制注释噪声并反映类间关系。使用高斯标签平滑建立基线方法。实验证明,我们的标注方法,数据集,和高斯标签平滑非常有效。我们的基线方法超越了目前最先进的方法。注释工具,数据集,和源代码公开在https://github.com/youngnuaa/2DHeadPose。
    Head pose estimation is one of the essential tasks in computer vision, which predicts the Euler angles of the head in an image. In recent years, CNN-based methods for head pose estimation have achieved excellent performance. Their training relies on RGB images providing facial landmarks or depth images from RGBD cameras. However, labeling facial landmarks is complex for large angular head poses in RGB images, and RGBD cameras are unsuitable for outdoor scenes. We propose a simple and effective annotation method for the head pose in RGB images. The novelty method uses a 3D virtual human head to simulate the head pose in the RGB image. The Euler angle can be calculated from the change in coordinates of the 3D virtual head. We then create a dataset using our annotation method: 2DHeadPose dataset, which contains a rich set of attributes, dimensions, and angles. Finally, we propose Gaussian label smoothing to suppress annotation noises and reflect inter-class relationships. A baseline approach is established using Gaussian label smoothing. Experiments demonstrate that our annotation method, datasets, and Gaussian label smoothing are very effective. Our baseline approach surpasses most current state-of-the-art methods. The annotation tool, dataset, and source code are publicly available at https://github.com/youngnuaa/2DHeadPose.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

    求助全文

  • 文章类型: Journal Article
    本文提出了一种基于能量最小化和非线性弹性正则化的图像配准数值算法。显示了将基因表达数据配准到二维神经解剖小鼠图谱的应用。我们应用非线性弹性正则化以允许更大,更平滑的变形,并进一步对地标点距离实施最优性约束,以实现更好的特征匹配。为了克服由于位移矢量场的导数中的非线性而导致的非线性弹性泛函最小化的困难,我们引入一个矩阵变量来逼近雅可比矩阵,并求解简化的欧拉-拉格朗日方程。通过与使用线性正则化的图像配准的比较,实验结果表明,所提出的非线性弹性模型还需要较少的数值校正,例如二值图像配准的回归步骤,它呈现更好的地面真相,并产生更大的互信息;最重要的是,与具有双调和正则化的配准模型相比,基因表达数据与相应的小鼠图谱之间的界标点距离和L2差异度量较小。
    This paper proposes a numerical algorithm for image registration using energy minimization and nonlinear elasticity regularization. Application to the registration of gene expression data to a neuroanatomical mouse atlas in two dimensions is shown. We apply a nonlinear elasticity regularization to allow larger and smoother deformations, and further enforce optimality constraints on the landmark points distance for better feature matching. To overcome the difficulty of minimizing the nonlinear elasticity functional due to the nonlinearity in the derivatives of the displacement vector field, we introduce a matrix variable to approximate the Jacobian matrix and solve for the simplified Euler-Lagrange equations. By comparison with image registration using linear regularization, experimental results show that the proposed nonlinear elasticity model also needs fewer numerical corrections such as regridding steps for binary image registration, it renders better ground truth, and produces larger mutual information; most importantly, the landmark points distance and L2 dissimilarity measure between the gene expression data and corresponding mouse atlas are smaller compared with the registration model with biharmonic regularization.
    导出

    更多引用

    收藏

    翻译标题摘要

    我要上传

       PDF(Pubmed)

公众号