Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Fast and accurate face landmark detection library using PyTorch; Support 68-point semi-frontal and 39-point profile landmark detection; Support both coordinate-based and heatmap-based inference; Up to 100 FPS landmark inference speed with SOTA face detector on CPU.

Stars: ✭ 388 (+557.63%)

Mutual labels: face-alignment

DeepVTB

🌌 OpenVTuber-虚拟アイドル共享计划 An application of real-time face and gaze analyzation via deep nerual networks.

Stars: ✭ 32 (-45.76%)

Mutual labels: face-alignment

Lab

[CVPR 2018] Look at Boundary: A Boundary-Aware Face Alignment Algorithm

Stars: ✭ 956 (+1520.34%)

Mutual labels: face-alignment

High Performance Face Recognition

🔥🔥Several High-Performance Models for Unconstrained/Large-Scale/Low-Shot Face Recognition🔥🔥

Stars: ✭ 309 (+423.73%)

Mutual labels: face-alignment

Faceswap

3D face swapping implemented in Python

Stars: ✭ 508 (+761.02%)

Mutual labels: face-alignment

iqiyi-vid-challenge

Code for IQIYI-VID(IQIYI Video Person Identification) Challenge Implemented in Python and MXNet

Stars: ✭ 45 (-23.73%)

Mutual labels: face-alignment

Dest

🐼 One Millisecond Deformable Shape Tracking Library (DEST)

Stars: ✭ 276 (+367.8%)

Mutual labels: face-alignment

Face Pose Net

Estimate 3D face pose (6DoF) or 11 parameters of 3x4 projection matrix by a Convolutional Neural Network

Stars: ✭ 464 (+686.44%)

Mutual labels: face-alignment

Laser-Eye

Gaze Estimation via Deep Neural Networks

Stars: ✭ 79 (+33.9%)

Mutual labels: face-alignment

Hrnet Facial Landmark Detection

This is an official implementation of facial landmark detection for our TPAMI paper "Deep High-Resolution Representation Learning for Visual Recognition". https://arxiv.org/abs/1908.07919

Stars: ✭ 647 (+996.61%)

Mutual labels: face-alignment

enhanced-ssh-mxnet

The MXNet Implementation of Enhanced SSH (ESSH) for Face Detection and Alignment

Stars: ✭ 54 (-8.47%)

Mutual labels: face-alignment

Openvtuber

虚拟爱抖露(アイドル)共享计划, 是基于单目RGB摄像头的人眼与人脸特征点检测算法, 在实时3D面部捕捉以及模型驱动领域的应用.

Stars: ✭ 365 (+518.64%)

Mutual labels: face-alignment

Facekit

Implementations of PCN (an accurate real-time rotation-invariant face detector) and other face-related algorithms

Stars: ✭ 1,028 (+1642.37%)

Mutual labels: face-alignment

2d And 3d Face Alignment

This repository implements a demo of the networks described in "How far are we from solving the 2D & 3D Face Alignment problem? (and a dataset of 230,000 3D facial landmarks)" paper.

Stars: ✭ 772 (+1208.47%)

Mutual labels: face-alignment

Deepalignmentnetwork

A deep neural network for face alignment

Stars: ✭ 480 (+713.56%)

Mutual labels: face-alignment

View All Similar Projects ➔

Stable Head Pose Estimation and Landmark Regression via 3D Dense Face Reconstruction

Reimplementation of (ECCV 2020) Towards Fast, Accurate and Stable 3D Dense Face Alignment via Tensorflow Lite framework, face mesh, head pose, landmarks, and more.

CPU real-time face deteciton, alignment, and reconstruction pipeline.
Lightweight render library, 5x faster (3ms vs 15ms) than the Sim3DR tools.
Camera matrix and dense/sparse landmarks prediction via a single network.
Generate facial parameters for robust head pose and expression estimation.

Setup

Basic Requirements

Python 3.6+
pip3 install -r requirements.txt

Render for Dense Face

GCC 6.0+
bash build_render.sh
(Cautious) For Windows user, please refer to this tutorial for more details.

3D Facial Landmarks

In this project, we perform dense face reconstruction by 3DMM parameters regression. The regression target is simplified as camera matrix (C, shape of 3x4), appearance parameters (S, shape of 1x40), and expression variables (E, shape of 1x10), with 62 dimensions in total.

The sparse or dense facial landmarks can be estimated by applying these parameters to a predefined 3D model, such as BFM. More specifically, the following formula describes how to generate a face through parameters:

$Generate Face$

where U and W are from pre-defined face model. Combine them linearly with parameters to generate sparse or dense faces. Finally, we need to integrate the posture information into the result:

$With matrix$

where R (shape of 3x3) and T (shape of 3x1) denote rotation and translation matrices, respectively, which are fractured from the camera matrix C.

Sparse

Since we have reconstructed the entire face, the 3D face alignment can be achieved by selecting the landmarks at the corresponding positions. See [TPAMI 2017] Face alignment in full pose range: A 3d total solution for more details.

Comparing with the method of first detecting 2D landmarks and then performing depth estimation, directly fitting 3DMM to solve 3D face alignment can not only obtain more accurate results in larger pose scenes, but also has obvious advantages in speed.

We provide a demonstration script that can generate 68 landmarks based on the reconstructed face. Run the following command to view the real-time 3D face alignment results:

python3 demo_video.py -m sparse -f <your-video-path>

Dense

Currently, our method supports up to 38,365 landmarks. We draw landmarks every 6 indexes for a better illustration. Run the demonstrate script in dense mode for real-time dense facial landmark localization:

python3 demo_video.py -m dense -f <your-video-path>

Face Reconstruction

Our network is multi-task since it can directly regress 3DMM params from a single face image for reconstruction, as well as estimate the head pose via R and T prediction. During training, the predicted R can supervise the params regression branch to generate refined face mesh. Meanwhile, the landmarks calculated via the params are provided as the labeled data to the pose estimation branch for training, through the cv2.solvePnP tool.

Theoretically speaking, the head pose estimation task in our model is weakly supervised, since only a few labeled data is required to activate the training process. In detail, the loss function at the initial stage can be described as follows:

$Init Pose Loss$

After the initialization process, the ground truth can be replaced by the prediction results of other branches. Therefore, the loss function will be transformed to the following equation in the later stage of the training process:

$Pose Loss$

In general, fitting the 3D model during the training process dynamically can avoid the inaccurate head pose estimation results caused by the coarse predefined model.

Head Pose

Traditional head pose estimation approaches, such as Appearance Template Models, Detector Arrays, and Mainfold Embedding have been extensively studied. However, methods based on deep learning improved the prediction accuracy to meet actual needs, until recent years.

Given a set of predefined 3D facial landmarks and the corresponding 2D image projections, the SolvePnP tool can be utilized to calculate the rotation matrix. However, the adopted mean 3D human face model usually introduces intrinsic error during the fitting process. Meanwhile, the additional landmarks extraction component is also kind of cumbersome.

Therefore, we designed a network branch for directly regress 6DoF parameters from the face image. The predictions include the 3DoF rotation matrix R (Pitch, Yaw, Roll), and the 3DoF translation matrix T (x, y, z), Compared with the landmark-based method, directly regression the camera matrix is more robust and stable, as well as significantly reduce the network training cost. Run the demonstrate script in pose mode to view the real-time head pose estimation results:

python3 demo_video.py -m pose -f <your-video-path>

Expression

Coarse expression estimation can be achieved by combining the predefined expressions in BFM linearly. In our model, regression the E is one of the tasks of the params prediction branch. Obviously, the accuracy of the linear combination is positively related to the dimension.

Clipping parameters can accelerate the training process, however, it can also reduce reconstruction accuracy, especially details such as eye and mouth. More specifically, E has a greater impact on face details than S when emotion is involved. Therefore, we choose 10-dimension for a tradeoff between the speed and the accuracy, the training data can be found at here for refinement.

In addition, we provide a simple facial expression rendering script. Run the following command for illustration:

python3 demo_image.py <your-image-path>

Mesh

According to the predefined BFM and the predicted 3DMM parameters, the dense 3D facial landmarks can be easily calculated. On this basis, through the index mapping between the morphable triangle vertices and the dense landmarks defined in BFM, the renderer can plot these geometries with depth infomation for mesh preview. Run the demonstrate script in mesh mode for real-time face reconstruction:

python3 demo_video.py -m mesh -f <your-video-path>

Benchmark

Our network can directly output the camera matrix and sparse or dense landmarks. Compared with the model in the original paper with the same backbone, the additional parameters yield via the pose regression branch does not significantly affect the inference speed, which means it can still be CPU real-time.

Scheme	THREAD=1	THREAD=2	THREAD=4
Inference	7.79ms	6.88ms	5.83ms

In addition, since most of the operations are wrapped in the model, the time consumption of pre-processing and post-processing are significantly reduced. Meanwhile, the optimized lightweight renderer is 5x faster (3ms vs 15ms) than the Sim3DR tools. These measures decline the latency of the entire pipeline.

Stage	Preprocess	Inference	Postprocess	Render
Each face cost	0.23ms	7.79ms	0.39ms	3.92ms

Run the following command for speed benchmark:

python3 video_speed_benchmark.py <your-video-path>

Citation

@inproceedings{guo2020towards,
    title={Towards Fast, Accurate and Stable 3D Dense Face Alignment},
    author={Guo, Jianzhu and Zhu, Xiangyu and Yang, Yang and Yang, Fan and Lei, Zhen and Li, Stan Z},
    booktitle={Proceedings of the European Conference on Computer Vision (ECCV)},
    year={2020}
}

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 59

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗