Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → anapt → 3D-facial-reconstruction

anapt / 3D-facial-reconstruction

Licence: GPL-3.0 License

3D facial reconstruction, expression recognition and transfer from monocular RGB images with a deep convolutional auto-encoding neural network

Programming Languages

139335 projects - #7 most used programming language

Labels

neural-network expression-recognition 3d-reconstruction rgb-images auto-encoding

Projects that are alternatives of or similar to 3D-facial-reconstruction

SkeletonBridgeRecon

The code for CVPR2019 Oral paper "A Skeleton-bridged Deep Learning Approach for Generating Meshes of Complex Topologies from Single RGB Images"

Stars: ✭ 72 (+453.85%)

Mutual labels: 3d-reconstruction

CalibrationWizard

[ICCV'19] Calibration Wizard: A Guidance System for Camera Calibration Based on Modelling Geometric and Corner Uncertainty

Stars: ✭ 80 (+515.38%)

Mutual labels: 3d-reconstruction

J3DReconstruction

Windows下基于openMVG+openMVS的三维重建解决方案以及基于Qt的可视化桌面平台

Stars: ✭ 59 (+353.85%)

Mutual labels: 3d-reconstruction

Vacancy: A Voxel Carving implementation in C++

Stars: ✭ 40 (+207.69%)

Mutual labels: 3d-reconstruction

Stanford University CS231A: Computer Vision, From 3D Reconstruction to Recognition HomeWork Answer

Stars: ✭ 27 (+107.69%)

Mutual labels: 3d-reconstruction

JetScan : GPU accelerated portable RGB-D reconstruction system

Stars: ✭ 77 (+492.31%)

Mutual labels: 3d-reconstruction

Demonstration of different algorithms and operations on faces. Join the Discord channel for discussion.

Stars: ✭ 226 (+1638.46%)

Mutual labels: 3d-reconstruction

Code for "NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video", CVPR 2021 oral

Stars: ✭ 812 (+6146.15%)

Mutual labels: 3d-reconstruction

Code for "Learning Object-Compositional Neural Radiance Field for Editable Scene Rendering", ICCV 2021

Stars: ✭ 135 (+938.46%)

Mutual labels: 3d-reconstruction

Computer-Vision

Cool Vision projects

Stars: ✭ 51 (+292.31%)

Mutual labels: 3d-reconstruction

SLAM - Simultaneous localization and mapping using OpenCV and NumPy.

Stars: ✭ 80 (+515.38%)

Mutual labels: 3d-reconstruction

Structured-Light-Depth-Acquisition

Matlab Implementation of a 3D Reconstruction algorithm

Stars: ✭ 48 (+269.23%)

Mutual labels: 3d-reconstruction

Code for CVPR 2018 paper --- Texture Mapping for 3D Reconstruction with RGB-D Sensor

Stars: ✭ 104 (+700%)

Mutual labels: 3d-reconstruction

Computer-Vision-and-Photogrammetry

Course at University of Wroclaw - full 3D reconstruction from images pipeline

Stars: ✭ 35 (+169.23%)

Mutual labels: 3d-reconstruction

learning-topology-synthetic-data

Tensorflow implementation of Learning Topology from Synthetic Data for Unsupervised Depth Completion (RAL 2021 & ICRA 2021)

Stars: ✭ 22 (+69.23%)

Mutual labels: 3d-reconstruction

Code for 3D Reconstruction of Novel Object Shapes from Single Images paper

Stars: ✭ 92 (+607.69%)

Mutual labels: 3d-reconstruction

[ISPRS Journal of Photogrammetry and Remote Sensing, 2022] ResDepth: A Deep Residual Prior For 3D Reconstruction From High-resolution Satellite Images

Stars: ✭ 30 (+130.77%)

Mutual labels: 3d-reconstruction

Silhouette-Guided-3D

PyTorch Code of our WACV2020 paper: Silhouette Guided Point Cloud Reconstruction beyond Occlusion

Stars: ✭ 36 (+176.92%)

Mutual labels: 3d-reconstruction

adareg-monodispnet

Repository for Bilateral Cyclic Constraint and Adaptive Regularization for Unsupervised Monocular Depth Prediction (CVPR2019)

Stars: ✭ 22 (+69.23%)

Mutual labels: 3d-reconstruction

handobjectconsist

[cvpr 20] Demo, training and evaluation code for joint hand-object pose estimation in sparsely annotated videos

Stars: ✭ 100 (+669.23%)

Mutual labels: 3d-reconstruction

View All Similar Projects ➔

Thesis

3D facial reconstruction, expression recognition and transfer from monocular RGB images with a deep convolutional auto-encoding neural network

Abstract

The present work implements an automatic system for coding and reconstructing 3D faces from low resolution RGB images by utilizing machine learning algorithms. Given a 3D morphable model, different faces are represented as a vector of variables ("code vector") which describe the shape, expression and color of the face. The multiplication of these parameter vectors with the PCA bases provided by the morphable model results in the 3D coordinates of the reconstructed face. As part of this work, an algorithm for the creation of two-dimensional synthetic faces solely from the information captured by the code vector was developed. The synthetic faces were used to train the neural network that acts as the encoding phase of the auto-encoding system and bootstrapping techniques were used to generalize the network to real-world facial images.

The outcome of this work is not only proof of the potential of 3D facial reconstruction from RGB images, but also the ability to exploit the 3D face by changing its expression, color or lighting. In the context of said exploitation, a neural network was implemented to identify the facial expression from the information encoded in the code vector. The problem tackled by the present work has until now been solved by the use of iterative algorithms based on the linear combination of existing prototype samples, which require a large amount of data from three-dimensional scans. Here, an attempt is made to solve this problem purely with machine learning and synthetic data.

Results

Autoencoding network for 3D reconstruction

Below are the results of the 3D reconstruction auto-encoding network:

A comparison between the results of Xception, ResNet50 and InceptionV3 architectures. Image (i.) shows the original face and images (ii.) - (iv.) depict the reconstructions with Xception, ResNet50 and InceptionV3, respectively.

A comparison between the reconstructions of ResNet50 at 4 different stages of the training. Image (i.) shows the original face, image (ii.) shows the initial reconstruction by a ResNet50 encoder and images (iii.) - (iv.) show the reconstructions after each bootstrapping iteration.

Facial expression recognition network

Below are the results of the 2-hidden-layer expression classification network.

The network can classify between 7 different expressions, namely anger, disgust, fear, happiness, neutral, sadness and surprise. The images below depict the base expressions that were used as a reference when creating the synthetic dataset.
The matrix (CM) shows the accuracy of the network on 700 real faces from The MUG Facial Expression Database. The network was trained on synthetic data.

Dependencies

pip install -r requirements.txt

Data Dependencies

The following files have to be downloaded and placed on the ./DATASET directory

The Basel Face Model : a simple viewer of the model is available here
dlib's Landmark Predictor : shape_predictor_68_face_landmarks.dat
OpenCV's Frontal Face Detector : haarcascade_frontalface_default.xml

For validation on real faces:

The MUG Facial Expression Database

How images are pre-processed for reconstructions

Bugs & Request

Please report bugs and request features using the Issue Tracker.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 13

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗