Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

vitoralbiero / Img2pose

Licence: other

The official PyTorch implementation of img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation - CVPR 2021

Programming Languages

python

139335 projects - #7 most used programming language

Labels

face-detection face-alignment

Projects that are alternatives of or similar to Img2pose

Facepapercollection

A collection of face related papers

Stars: ✭ 241 (-2.43%)

Mutual labels: face-detection, face-alignment

Facekit

Implementations of PCN (an accurate real-time rotation-invariant face detector) and other face-related algorithms

Stars: ✭ 1,028 (+316.19%)

Mutual labels: face-detection, face-alignment

iqiyi-vid-challenge

Code for IQIYI-VID(IQIYI Video Person Identification) Challenge Implemented in Python and MXNet

Stars: ✭ 45 (-81.78%)

Mutual labels: face-detection, face-alignment

DeepVTB

🌌 OpenVTuber-虚拟アイドル共享计划 An application of real-time face and gaze analyzation via deep nerual networks.

Stars: ✭ 32 (-87.04%)

Mutual labels: face-detection, face-alignment

Tenginekit

TengineKit - Free, Fast, Easy, Real-Time Face Detection & Face Landmarks & Face Attributes & Hand Detection & Hand Landmarks & Body Detection & Body Landmarks & Iris Landmarks & Yolov5 SDK On Mobile.

Stars: ✭ 2,103 (+751.42%)

Mutual labels: face-detection, face-alignment

enhanced-ssh-mxnet

The MXNet Implementation of Enhanced SSH (ESSH) for Face Detection and Alignment

Stars: ✭ 54 (-78.14%)

Mutual labels: face-detection, face-alignment

Face Alignment

🔥 2D and 3D Face alignment library build using pytorch

Stars: ✭ 5,417 (+2093.12%)

Mutual labels: face-alignment, face-detection

Openvtuber

虚拟爱抖露(アイドル)共享计划, 是基于单目RGB摄像头的人眼与人脸特征点检测算法, 在实时3D面部捕捉以及模型驱动领域的应用.

Stars: ✭ 365 (+47.77%)

Mutual labels: face-detection, face-alignment

Pyseeta

python api for SeetaFaceEngine(https://github.com/seetaface/SeetaFaceEngine.git)

Stars: ✭ 93 (-62.35%)

Mutual labels: face-detection, face-alignment

Insightface

State-of-the-art 2D and 3D Face Analysis Project

Stars: ✭ 10,886 (+4307.29%)

Mutual labels: face-detection, face-alignment

Face-alignment-Trees

This is the C++ implement of the paper: Face Detection, Pose Estimation, and Landmark Localization in the Wild

Stars: ✭ 17 (-93.12%)

Mutual labels: face-detection, face-alignment

Face.evolve.pytorch

🔥🔥High-Performance Face Recognition Library on PaddlePaddle & PyTorch🔥🔥

Stars: ✭ 2,719 (+1000.81%)

Mutual labels: face-detection, face-alignment

Face

I have published my face related codes in this repository

Stars: ✭ 53 (-78.54%)

Mutual labels: face-detection, face-alignment

retinaface

RetinaFace: Deep Face Detection Library for Python

Stars: ✭ 242 (-2.02%)

Mutual labels: face-detection, face-alignment

Awesome Face recognition

papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face Deblurring; Face Generation && Face Synthesis; Face Transfer; Face Anti-Spoofing; Face Retrieval;

Stars: ✭ 3,220 (+1203.64%)

Mutual labels: face-detection, face-alignment

Retinadetector

基于RetinaFace的目标检测方法，适用于人脸、缺陷、小目标、行人等

Stars: ✭ 73 (-70.45%)

Mutual labels: face-detection, face-alignment

Deep Face Recognition

One-shot Learning and deep face recognition notebooks and workshop materials

Stars: ✭ 147 (-40.49%)

Mutual labels: face-detection, face-alignment

Face Dataset

Face related datasets

Stars: ✭ 204 (-17.41%)

Mutual labels: face-detection, face-alignment

Face toolbox keras

A collection of deep learning frameworks ported to Keras for face analysis.

Stars: ✭ 202 (-18.22%)

Mutual labels: face-detection

Wear A Mask

😷 An SPA that uses only the front-end to perform deep-learning-based facial landmark detection on images and automatically adds breathing mask stickers.

Stars: ✭ 226 (-8.5%)

Mutual labels: face-detection

View All Similar Projects ➔

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Paper accepted to the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

Figure 1: We estimate the 6DoF rigid transformation of a 3D face (rendered in silver), aligning it with even the tiniest faces, without face detection or facial landmark localization. Our estimated 3D face locations are rendered by descending distances from the camera, for coherent visualization.

TL;DR

This repository provides a novel method for six degrees of fredoom (6DoF) detection on multiple faces without the need of prior face detection. After prediction, one can visualize the detections (as show in the figure above), customize projected bounding boxes, or crop and align each face for further processing. See details below.

Paper details
- Abstract
- Citation
Installation
Training
Testing
Output customization
Align faces
Resources
License

Paper details

Vítor Albiero, Xingyu Chen, Xi Yin, Guan Pang, Tal Hassner, "img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation," CVPR, 2021, arXiv:2012.07791

Abstract

We propose real-time, six degrees of freedom (6DoF), 3D face pose estimation without face detection or landmark localization. We observe that estimating the 6DoF rigid transformation of a face is a simpler problem than facial landmark detection, often used for 3D face alignment. In addition, 6DoF offers more information than face bounding box labels. We leverage these observations to make multiple contributions: (a) We describe an easily trained, efficient, Faster R-CNN--based model which regresses 6DoF pose for all faces in the photo, without preliminary face detection. (b) We explain how pose is converted and kept consistent between the input photo and arbitrary crops created while training and evaluating our model. (c) Finally, we show how face poses can replace detection bounding box training labels. Tests on AFLW2000-3D and BIWI show that our method runs at real-time and outperforms state of the art (SotA) face pose estimators. Remarkably, our method also surpasses SotA models of comparable complexity on the WIDER FACE detection benchmark, despite not been optimized on bounding box labels.

Citation

If you use any part of our code or data, please cite our paper.

@inproceedings{albiero2021img2pose,
  title={img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation},
  author={Albiero, Vítor and Chen, Xingyu and Yin, Xi and Pang, Guan and Hassner, Tal},
  booktitle={CVPR},
  year={2021},
  url={https://arxiv.org/abs/2012.07791},
}

Installation

Install dependecies with Python 3.

pip install -r requirements.txt

Install the renderer, which is used to visualize predictions. The renderer implementation is forked from here.

cd Sim3DR
sh build_sim3dr.sh

Training

Prepare WIDER FACE dataset

First, download our annotations as instructed in Annotations.

Download WIDER FACE dataset and extract to datasets/WIDER_Face.

Then, to create the train and validation files (LMDB), run the following scripts.

python3 convert_json_list_to_lmdb.py
--json_list ./annotations/WIDER_train_annotations.txt
--dataset_path ./datasets/WIDER_Face/WIDER_train/images/
--dest ./datasets/lmdb/
-—train

This first script will generate a LMDB dataset, which contains the training images along with annotations. It will also output a pose mean and std deviation files, which will be used for training and testing.

python3 convert_json_list_to_lmdb.py 
--json_list ./annotations/WIDER_val_annotations.txt 
--dataset_path ./datasets/WIDER_Face/WIDER_val/images/ 
--dest ./datasets/lmdb

This second script will create a LMDB containing the validation images along with annotations.

Train

Once the LMDB train/val files are created, to start training simple run the script below.

CUDA_VISIBLE_DEVICES=0 python3 train.py
--pose_mean ./datasets/lmdb/WIDER_train_annotations_pose_mean.npy
--pose_stddev ./datasets/lmdb/WIDER_train_annotations_pose_stddev.npy
--workspace ./workspace/
--train_source ./datasets/lmdb/WIDER_train_annotations.lmdb
--val_source ./datasets/lmdb/WIDER_val_annotations.lmdb
--prefix trial_1
--batch_size 2
--lr_plateau
--early_stop
--random_flip
--random_crop
--max_size 1400

For now, only single GPU training is tested. Distributed training is partially implemented, PRs welcome.

Training on your own dataset

If your dataset has facial landmarks and bounding boxes already annotated, store them into JSON files following the same format as in the WIDER FACE annotations.

If not, run the script below to annotate your dataset. You will need a detector and import it inside the script.

python3 utils/annotate_dataset.py 
--image_list list_of_images.txt 
--output_path ./annotations/dataset_name

After the dataset is annotated, create a list pointing to the JSON files there were saved. Then, follow the steps in Prepare WIDER FACE dataset replacing the WIDER annotations with your own dataset annotations. Once the LMDB and pose files are created, follow the steps in Train replacing the WIDER LMDB and pose files with your dataset own files.

Testing

To evaluate with the pretrained model, download the model from Model Zoo, and extract it to the main folder. It will create a folder called models, which contains the model weights and the pose mean and std dev that was used for training.

If evaluating with own trained model, change the pose mean and standard deviation to the ones trained with.

Visualizing trained model

To visualize a trained model on the WIDER FACE validation set run the notebook visualize_trained_model_predictions.

WIDER FACE dataset evaluation

If you haven't done already, download the WIDER FACE dataset and extract to datasets/WIDER_Face.

python3 evaluation/evaluate_wider.py 
--dataset_path datasets/WIDER_Face/WIDER_val/images/
--dataset_list datasets/WIDER_Face/wider_face_split/wider_face_val_bbx_gt.txt
--pretrained_path models/img2pose_v1.pth
--output_path results/WIDER_FACE/Val/

To check mAP and plot curves, download the eval tools and point to results/WIDER_FACE/Val.

AFLW2000-3D dataset evaluation

Download the AFLW2000-3D dataset and unzip to datasets/AFLW2000.

Run the notebook aflw_2000_3d_evaluation.

BIWI dataset evaluation

Download the BIWI dataset and unzip to datasets/BIWI.

Run the notebook biwi_evaluation.

Testing on your own images

Run the notebook test_own_images.

Output customization

For every face detected, the model outputs by default:

Pose: r_x, r_y, r_z, t_x, t_y, t_z
Projected bounding boxes: left, top, right, bottom
Face scores: 0 to 1

Since the projected bounding box without expansion ends at the start of the forehead, we provide a way of expanding the forehead invidually, along with default x and y expansion.

To customize the size of the projected bounding boxes, when creating the model change any of the bounding box expansion variables as shown below (a complete example can be seen at visualize_trained_model_predictions).

# how much to expand in width
bbox_x_factor = 1.1
# how much to expand in height
bbox_y_factor = 1.1
# how much to expand in the forehead
expand_forehead = 0.3

img2pose_model = img2poseModel(
    ...,    
    bbox_x_factor=bbox_x_factor,
    bbox_y_factor=bbox_y_factor,
    expand_forehead=expand_forehead,
)

Align faces

To align the detected faces, call the function bellow passing the reference points, the image with the faces to align, and the poses outputted by img2pose. The function will return a list with PIL images containing one aligned face per give pose.

from utils.pose_operations import align_faces

# load reference points
threed_points = np.load("pose_references/reference_3d_5_points_trans.npy")

aligned_faces = align_faces(threed_points, img, poses)

Resources

Model Zoo

Annotations

Data Zoo

License

Check license for license details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 247

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

vitoralbiero / Img2pose

Programming Languages

Labels

Projects that are alternatives of or similar to Img2pose

img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation

Paper accepted to the IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2021

TL;DR

Table of contents

Paper details

Abstract

Citation

Installation

Training

Prepare WIDER FACE dataset

Train

Training on your own dataset

Testing

Visualizing trained model

WIDER FACE dataset evaluation

AFLW2000-3D dataset evaluation

BIWI dataset evaluation

Testing on your own images

Output customization

Align faces

Resources

License