Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

We present MocapNET2, a real-time method that estimates the 3D human pose directly in the popular Bio Vision Hierarchy (BVH) format, given estimations of the 2D body joints originating from monocular color images. Our contributions include: (a) A novel and compact 2D pose NSRM representation. (b) A human body orientation classifier and an ensemble of orientation-tuned neural networks that regress the 3D human pose by also allowing for the decomposition of the body to an upper and lower kinematic hierarchy. This permits the recovery of the human pose even in the case of significant occlusions. (c) An efficient Inverse Kinematics solver that refines the neural-network-based solution providing 3D human pose estimations that are consistent with the limb sizes of a target person (if known). All the above yield a 33% accuracy improvement on the Human 3.6 Million (H3.6M) dataset compared to the baseline method (MocapNET) while maintaining real-time performance (70 fps in CPU-only execution).

Stars: ✭ 194 (-5.83%)

Mutual labels: pose-estimation

Online Privacy Test Resource List

Privacy Online Test and Resource Compendium (POTARC) 🕵🏻

Stars: ✭ 185 (-10.19%)

Mutual labels: detection

Face Yaw Roll Pitch From Pose Estimation Using Opencv

This work is used for pose estimation(yaw, pitch and roll) by Face landmarks(left eye, right eye, nose, left mouth, right mouth and chin)

Stars: ✭ 183 (-11.17%)

Mutual labels: pose-estimation

Raspberrypi Facedetection Mtcnn Caffe With Motion

MTCNN with Motion Detection, on Raspberry Pi with Love

Stars: ✭ 204 (-0.97%)

Mutual labels: detection

Pine

🌲 Aimbot powered by real-time object detection with neural networks, GPU accelerated with Nvidia. Optimized for use with CS:GO.

Stars: ✭ 202 (-1.94%)

Mutual labels: detection

Can Autoplay

The auto-play feature detection in HTMLMediaElement (<audio> or <video>).

Stars: ✭ 196 (-4.85%)

Mutual labels: detection

Lidc nodule detection

lidc nodule detection with CNN and LSTM network

Stars: ✭ 187 (-9.22%)

Mutual labels: detection

Ml Auto Baseball Pitching Overlay

⚾🤖⚾ Automatic baseball pitching overlay in realtime

Stars: ✭ 200 (-2.91%)

Mutual labels: pose-estimation

Quantuminsert

Quantum Insert

Stars: ✭ 186 (-9.71%)

Mutual labels: detection

Clandmark

Open Source Landmarking Library

Stars: ✭ 204 (-0.97%)

Mutual labels: detection

Hope

Source code of CVPR 2020 paper, "HOPE-Net: A Graph-based Model for Hand-Object Pose Estimation"

Stars: ✭ 184 (-10.68%)

Mutual labels: pose-estimation

Rpnplus

RPN+(Tensorflow) for people detection

Stars: ✭ 191 (-7.28%)

Mutual labels: detection

Bonnetal

Bonnet and then some! Deep Learning Framework for various Image Recognition Tasks. Photogrammetry and Robotics Lab, University of Bonn

Stars: ✭ 202 (-1.94%)

Mutual labels: detection

Multiposenet.pytorch

pytorch implementation of MultiPoseNet (ECCV 2018, Muhammed Kocabas et al.)

Stars: ✭ 191 (-7.28%)

Mutual labels: pose-estimation

View All Similar Projects ➔

基于hourglass的人脸关键点检测

成绩及演示

kaggle用户名： raymon
成绩截图
demo

方法

学习目标

使用2D高斯函数来构建学习目标（heatmap）。将某一关键点的ground-truth作为中心点，这样一来，中心点处将具有最高的得分，越远离中心点，得分将越低。公式地表示，则有
可视化学习目标
网络结构：先将图像不改变尺度地进行卷积，得到合适数量通道的特征图，再将特征图送入hourglass模型，最后使用1x1的卷积，线性映射得到15张特征图，也就是预测得到的15张heatmap，每种关键点预测得到一张heatmap。另外，网络结构的设计并不改变输入的尺度，属于一种pixel-wise的全卷积网络（FCN）。

hourglass 模型图

本次算法中的网络结构图
inference
- 归一化图像：将像素值除以255，从而将所有像素值映射到[0,1]；
- 调整网络输入：将网络输入reshape为(1,96,96)；
- 网络预测：将输入送入网络，预测得到15张heatmap；输出为(15,96,96)；
- 使用max算子，找出预测得到的每张heatmap中最大值所在的坐标作为对该种关键点的预测。

实验

pytorch实现
training
1. 仅仅使用完全标注的数据(2140张图像)训练model，使用SGD，lr设置为0.001，训练10个epoch后发现loss不再下降（即使更换lr），但是model并不work，也就是出现了loss无法下降的问题；
2. 换用adam进行优化，lr设置0.001，发现loss疯狂下降，10个epoch后，loss收敛，不再变化，测试模型work，提交得分为2.92；
3. 使用全部的数据集进行训练，对于漏标的关键点，使用mask，不贡献loss，继续训练loss下降，10个epoch后，loss收敛，提交结果得分为1.84
4. 更换lr为0.0005后继续训练，loss下降，10个epoch后，测试提交结果，得分为1.8086；
5. 继续训练，发现得分基本稳定，或许效果变差了。

部分日志节选

[ Epoch 00800 -> 00000 / 7049 ] loss : 0.000668076856527 loss_coor : 0.0844968110323 max : 1.22951054573 min : 0.0
[ Epoch 00800 -> 00072 / 7049 ] loss : 0.000570345146116 loss_coor :  0.082247518003 max : 1.27145636082 min : 0.0
[ Epoch 00800 -> 00144 / 7049 ] loss : 0.000559106993023 loss_coor :  0.083879455924 max : 1.15733075142 min : 0.0
[ Epoch 00800 -> 00216 / 7049 ] loss : 0.000504959840328 loss_coor :   0.08310367167 max : 1.18976962566 min : 0.0
[ Epoch 00800 -> 00288 / 7049 ] loss : 0.000570140429772 loss_coor : 0.0833380296826 max : 1.2717602253 min : 0.0
[ Epoch 00800 -> 00360 / 7049 ] loss : 0.000506064505316 loss_coor : 0.0804231092334 max : 1.17254185677 min : 0.0
[ Epoch 00800 -> 00432 / 7049 ] loss : 0.000406593986554 loss_coor : 0.0836449936032 max : 1.20352518559 min : 0.0
[ Epoch 00800 -> 00504 / 7049 ] loss : 0.000405297701946 loss_coor : 0.0817726328969 max : 1.23924446106 min : 0.0
[ Epoch 00800 -> 00576 / 7049 ] loss : 0.000487966550281 loss_coor : 0.0837124586105 max : 1.16643023491 min : 0.0
[ Epoch 00800 -> 00648 / 7049 ] loss : 0.000363715342246 loss_coor : 0.0824847668409 max : 1.16725647449 min : 0.0
[ Epoch 00800 -> 00720 / 7049 ] loss : 0.000432570668636 loss_coor : 0.0835245028138 max : 1.13535535336 min : 0.0
[ Epoch 00800 -> 00792 / 7049 ] loss : 0.00042586523341 loss_coor : 0.0825248062611 max : 1.13426482677 min : 0.0
[ Epoch 00800 -> 00864 / 7049 ] loss : 0.000455046625575 loss_coor : 0.0808562189341 max : 1.09106147289 min : 0.0
[ Epoch 00800 -> 00936 / 7049 ] loss : 0.000385391729651 loss_coor :   0.08440952003 max : 1.09624230862 min : 0.0
[ Epoch 00800 -> 01008 / 7049 ] loss : 0.00059827347286 loss_coor : 0.0826571434736 max : 1.15060770512 min : 0.0
[ Epoch 00800 -> 01080 / 7049 ] loss : 0.000396907416871 loss_coor : 0.0815715491772 max : 1.13137769699 min : 0.0
[ Epoch 00800 -> 01152 / 7049 ] loss : 0.000362811610103 loss_coor : 0.0865842476487 max : 1.08385503292 min : 0.0
[ Epoch 00800 -> 01224 / 7049 ] loss : 0.000393233931391 loss_coor : 0.0852425917983 max : 1.11058795452 min : 0.0
[ Epoch 00800 -> 01296 / 7049 ] loss : 0.000454480032204 loss_coor : 0.0859401002526 max : 1.10379803181 min : 0.0
[ Epoch 00800 -> 01368 / 7049 ] loss : 0.000389240361983 loss_coor : 0.0848914980888 max : 1.0531847477 min : 0.0
[ Epoch 00800 -> 01440 / 7049 ] loss : 0.000810728233773 loss_coor : 0.0820694491267 max : 1.0755417347 min : 0.0
[ Epoch 00800 -> 01512 / 7049 ] loss : 0.000508533208631 loss_coor : 0.0811709463596 max : 1.11292123795 min : 0.0
[ Epoch 00800 -> 01584 / 7049 ] loss : 0.000415266724303 loss_coor : 0.0858198255301 max : 1.05843913555 min : 0.0
[ Epoch 00800 -> 01656 / 7049 ] loss : 0.000354411109583 loss_coor : 0.0838029235601 max : 1.08815252781 min : 0.0

参考

Stacked Hourglass Networks for Human Pose Estimation:https://arxiv.org/abs/1603.06937
Mask R-CNN:https://arxiv.org/abs/1703.06870
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields:https://arxiv.org/abs/1611.08050
Associative Embedding: End-to-End Learning for Joint Detection and Grouping:https://arxiv.org/abs/1611.05424

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 206

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

raymon-tian / Hourglass Facekeypoints Detection

Programming Languages

Labels

Projects that are alternatives of or similar to Hourglass Facekeypoints Detection

基于hourglass的人脸关键点检测

相关信息

成绩及演示

相关工作

方法

实验

参考