Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → okankop → Mff Pytorch

okankop / Mff Pytorch

Licence: other

Motion Fused Frames implementation in PyTorch, codes and pretrained models.

Programming Languages

python

139335 projects - #7 most used programming language

Labels

pytorch pretrained-models arxiv

Projects that are alternatives of or similar to Mff Pytorch

People Counter Python

Create a smart video application using the Intel Distribution of OpenVINO toolkit. The toolkit uses models and inference to run single-class object detection.

Stars: ✭ 62 (-46.55%)

Mutual labels: pretrained-models

Gen Efficientnet Pytorch

Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS

Stars: ✭ 1,275 (+999.14%)

Mutual labels: pretrained-models

Ghostnet

CV backbones including GhostNet, TinyNet and TNT, developed by Huawei Noah's Ark Lab.

Stars: ✭ 1,744 (+1403.45%)

Mutual labels: pretrained-models

Sotawhat

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Stars: ✭ 1,181 (+918.1%)

Mutual labels: arxiv

Cnn Paper2

🎨 🎨 深度学习卷积神经网络教程：图像识别，目标检测，语义分割，实例分割，人脸识别，神经风格转换，GAN等🎨🎨 https://dataxujing.github.io/CNN-paper2/

Stars: ✭ 77 (-33.62%)

Mutual labels: arxiv

Lipreading Densenet3d

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

Stars: ✭ 91 (-21.55%)

Mutual labels: arxiv

Gpt2 Ml

GPT2 for Multiple Languages, including pretrained models. GPT2 多语言支持, 15亿参数中文预训练模型

Stars: ✭ 1,066 (+818.97%)

Mutual labels: pretrained-models

Lipnet Pytorch

The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)

Stars: ✭ 104 (-10.34%)

Mutual labels: arxiv

Awesome Vln

A curated list of research papers in Vision-Language Navigation (VLN)

Stars: ✭ 86 (-25.86%)

Mutual labels: arxiv

Paper Reading

深度学习论文阅读、数据仓库实践体验。比做算法的懂工程落地，比做工程的懂算法模型。

Stars: ✭ 101 (-12.93%)

Mutual labels: arxiv

Casual Digressions

💤 Old repository of notes on machine learning papers.

Stars: ✭ 73 (-37.07%)

Mutual labels: arxiv

Dialogue Understanding

This repository contains PyTorch implementation for the baseline models from the paper Utterance-level Dialogue Understanding: An Empirical Study

Stars: ✭ 77 (-33.62%)

Mutual labels: pretrained-models

Hbonet

[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions

Stars: ✭ 94 (-18.97%)

Mutual labels: pretrained-models

Farm

🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.

Stars: ✭ 1,140 (+882.76%)

Mutual labels: pretrained-models

Dimenet

DimeNet and DimeNet++ models, as proposed in "Directional Message Passing for Molecular Graphs" (ICLR 2020) and "Fast and Uncertainty-Aware Directional Message Passing for Non-Equilibrium Molecules" (NeurIPS-W 2020)

Stars: ✭ 106 (-8.62%)

Mutual labels: pretrained-models

Pytorch Image Models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Stars: ✭ 15,232 (+13031.03%)

Mutual labels: pretrained-models

Nfnets pytorch

Pre-trained NFNets with 99% of the accuracy of the official paper "High-Performance Large-Scale Image Recognition Without Normalization".

Stars: ✭ 85 (-26.72%)

Mutual labels: pretrained-models

Reproducible Image Denoising State Of The Art

Collection of popular and reproducible image denoising works.

Stars: ✭ 1,776 (+1431.03%)

Mutual labels: arxiv

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+47953.45%)

Mutual labels: pretrained-models

Covid Twitter Bert

Pretrained BERT model for analysing COVID-19 Twitter data

Stars: ✭ 101 (-12.93%)

Mutual labels: pretrained-models

View All Similar Projects ➔

Motion Fused Frames (MFFs)

Pytorch implementation of the article Motion fused frames: Data level fusion strategy for hand gesture recognition

- Update: Code is updated for Pytorch 1.5.0 and CUDA 10.2

Installation

Clone the repo with the following command:

git clone https://github.com/okankop/MFF-pytorch.git

Setup in virtual environment and install the requirements:

conda create -n MFF python=3.7.4
pip install -r requirements.txt

Dataset Preparation

Download the jester dataset or NVIDIA dynamic hand gestures dataset or ChaLearn LAP IsoGD dataset. Decompress them into the same folder and use process_dataset.py to generate the index files for train, val, and test split. Poperly set up the train, validatin, and category meta files in datasets_video.py. Finally, use directory flow_computation to calculate the optical flow images using Brox method.

Assume the structure of data directories is the following:

~/MFF-pytorch/
   datasets/
      jester/
         rgb/
            .../ (directories of video samples)
                .../ (jpg color frames)
         flow/
            u/
               .../ (directories of video samples)
                  .../ (jpg optical-flow-u frames)
            v/
               .../ (directories of video samples)
                  .../ (jpg optical-flow-v frames)
    model/
       .../(saved models for the last checkpoint and best model)

Running the Code

Followings are some examples for training under different scenarios:

Train 4-segment network with 3 flow, 1 color frames (4-MFFs-3f1c architecture)

python main.py jester RGBFlow --arch BNInception --num_segments 4 \
--consensus_type MLP --num_motion 3  --batch-size 32

Train resuming the last checkpoint (4-MFFs-3f1c architecture)

python main.py jester RGBFlow --resume=<path-to-last-checkpoint> --arch BNInception \
--consensus_type MLP --num_segments 4 --num_motion 3  --batch-size 32

The command to test trained models (4-MFFs-3f1c architecture). Pretrained models are under pretrained_models.

python test_models.py jester RGBFlow pretrained_models/MFF_jester_RGBFlow_BNInception_segment4_3f1c_best.pth.tar --arch BNInception --consensus_type MLP --test_crops 1 --num_motion 3 --test_segments 4

All GPUs are used for the training. If you want a part of GPUs, use CUDA_VISIBLE_DEVICES=...

Citation

If you use this code or pre-trained models, please cite the following:

@InProceedings{Kopuklu_2018_CVPR_Workshops,
author = {Kopuklu, Okan and Kose, Neslihan and Rigoll, Gerhard},
title = {Motion Fused Frames: Data Level Fusion Strategy for Hand Gesture Recognition},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
month = {June},
year = {2018}
}

Acknowledgement

This project is built on top of the codebase TSN-pytorch. We thank Yuanjun Xiong for releasing TSN-Pytorch codebase, which we build our work on top. We also thank Bolei Zhou for the insprational work Temporal Segment Networks, from which we imported process_dataset.py to our project.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 116

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (2) 🔗