All Projects → ymao1993 → HumanRecognition

ymao1993 / HumanRecognition

Licence: MIT license
Person Recognition System on PIPA dataset

Programming Languages

python
139335 projects - #7 most used programming language
C++
36643 projects - #6 most used programming language
HTML
75241 projects
Jupyter Notebook
11667 projects
shell
77523 projects
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to HumanRecognition

Music Dance Video Synthesis
(ACM MM 20 Oral) PyTorch implementation of Self-supervised Dance Video Synthesis Conditioned on Music
Stars: ✭ 150 (+435.71%)
Mutual labels:  multimedia
Swiftffmpeg
A Swift wrapper for the FFmpeg API
Stars: ✭ 243 (+767.86%)
Mutual labels:  multimedia
RSS-to-Telegram-Bot
A Telegram RSS bot that cares about your reading experience
Stars: ✭ 482 (+1621.43%)
Mutual labels:  multimedia
Xbmc
Kodi is an award-winning free and open source home theater/media center software and entertainment hub for digital media. With its beautiful interface and powerful skinning engine, it's available for Android, BSD, Linux, macOS, iOS and Windows.
Stars: ✭ 13,175 (+46953.57%)
Mutual labels:  multimedia
Mpv
🎥 Command line video player
Stars: ✭ 17,018 (+60678.57%)
Mutual labels:  multimedia
Vlckit
libvlc bindings for OS X, iOS and tvOS in Objective-C, official mirror
Stars: ✭ 249 (+789.29%)
Mutual labels:  multimedia
Vditor
♏ 一款浏览器端的 Markdown 编辑器。
Stars: ✭ 1,742 (+6121.43%)
Mutual labels:  multimedia
smk
SMK - Simple multimedia kit - C++ WebAssembly
Stars: ✭ 89 (+217.86%)
Mutual labels:  multimedia
Movian
An advanced media center
Stars: ✭ 235 (+739.29%)
Mutual labels:  multimedia
vlc
VLC media player - All pull requests are ignored, please follow https://wiki.videolan.org/Sending_Patches_VLC/
Stars: ✭ 10,341 (+36832.14%)
Mutual labels:  multimedia
Video Thumbnail Generator
📷 Generate thumbnail sprites from videos.
Stars: ✭ 190 (+578.57%)
Mutual labels:  multimedia
Csfml
Official binding of SFML for C
Stars: ✭ 211 (+653.57%)
Mutual labels:  multimedia
nano-dlna
A minimal UPnP/DLNA media streamer
Stars: ✭ 146 (+421.43%)
Mutual labels:  multimedia
Haruna
Open source video player built with Qt/QML and libmpv.
Stars: ✭ 147 (+425%)
Mutual labels:  multimedia
awesome-puredata
A list of Pure Data libraries, abstractions, projects and presentations
Stars: ✭ 36 (+28.57%)
Mutual labels:  multimedia
Ffmpeg Video Player
An FFmpeg and SDL Tutorial.
Stars: ✭ 149 (+432.14%)
Mutual labels:  multimedia
Python Mpv
Python interface to the awesome mpv media player
Stars: ✭ 245 (+775%)
Mutual labels:  multimedia
conan-sfml
[OBSOLETE] The recipe is now in https://github.com/bincrafters/community
Stars: ✭ 13 (-53.57%)
Mutual labels:  multimedia
emacs-application-framework
EAF, an extensible framework that revolutionizes the graphical capabilities of Emacs
Stars: ✭ 2,454 (+8664.29%)
Mutual labels:  multimedia
fileplayer.github.io
FilePlayer (http://fileplayer.github.io) is another HTML5 demo. It's an open source web media player. It plays audio and video, including immersive 360° videos directly in your browser. (⚠️ the FilePlayer's development is currently off because we are working on @gridsound)
Stars: ✭ 42 (+50%)
Mutual labels:  multimedia

HumanRecognition

The system is develped to perform person recognition task on PIPA dataset, the detailed description of the approach of this system can be found here.

Performance (Last Updated: 5/19/2017)

Config Accuracy
Face 62.18%
Head 63.19%
Upper-body 67.44%
Full-body 58.96%
All modality fused 82.31%
All modality fused + MRF refining 86.18%

Usage

Preparation

  • get dataset

     ./scripts/get_PIPA.sh
    
  • get models

     ./scripts/get_models.sh
    
  • get features (if you don't wish to extract them)

     ./scripts/get_features.sh
    

Face Feature

We use FaceNet for face feature extraction. FaceNet is a CNN trained to directly optimize the embedding itself.

  • test face feature extractor

     python pyHumanRecog/face_feature_extractor_test.py
    

Head Feature

  • train head feature extractor (feel free to experiment with different batch size)

     python pyHumanRecog/head_feature_extractor_train.py --batch_size 32
    
  • test head feature extractor

     python pyHumanRecog/head_feature_extractor_test.py --batch_size 32
    

Body Feature

  • train body feature extractor (feel free to experiment with different batch size)

     python pyHumanRecog/body_feature_extractor_train.py --batch_size 32
    
  • test body feature extractor

     python pyHumanRecog/body_feature_extractor_test.py --batch_size 32
    

Upper-body Feature

  • train upper-body feature extractor (feel free to experiment with different batch size)

     python pyHumanRecog/upper_body_feature_extractor_train.py --batch_size 32
    
  • test upper-body feature extractor

     python pyHumanRecog/upper_body_feature_extractor_test.py --batch_size 32
    

Pose estimation

We use CPM for pose estimation. The estimated CPM pose will mainly be used for image warping.

  • CPM pose estimation

     python pyHumanRecog/extract_pose.py <img_dump_folder> <pose_dump_folder>
    

    <image_dump_folder>: folder to dump CPM pose visualization images

    <pose_dump_folder>: folder to dump CPM pose positions

Evaluation

For performance evaluation, Please first modify performance_test_config.py (within pyHumanRecog folder) to specify the features you wish to use and their corresponding weights. Then execute the following command.

python pyHumanRecog/performance_test.py

To perform MRF optimization (which incorporates the photo-level cooccurrence and mutual exclusive pattern into the final prediction), Set refine_with_photo_level_context = True in HumanRecog/performance_test_config.py.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].