[NeurIPS'21] Unified tracking framework with a single appearance model. It supports Single Object Tracking (SOT), Video Object Segmentation (VOS), Multi-Object Tracking (MOT), Multi-Object Tracking and Segmentation (MOTS), Pose Tracking, Video Instance Segmentation (VIS), and class-agnostic MOT (e.g. TAO dataset).

Stars: ✭ 293 (+285.53%)

Mutual labels: video-object-segmentation

Awesome-low-level-vision-resources

A curated list of resources for Low-level Vision Tasks

Stars: ✭ 35 (-53.95%)

Mutual labels: cvpr2021

DeFMO

[CVPR 2021] DeFMO: Deblurring and Shape Recovery of Fast Moving Objects

Stars: ✭ 144 (+89.47%)

Mutual labels: cvpr2021

anchor-diff-VOS

Anchor Diffusion for Unsupervised Video Object Segmentation

Stars: ✭ 113 (+48.68%)

Mutual labels: video-object-segmentation

One-Shot-Face-Swapping-on-Megapixels

One Shot Face Swapping on Megapixels.

Stars: ✭ 260 (+242.11%)

Mutual labels: cvpr2021

Cvpr2021 Papers With Code

CVPR 2021 论文和开源项目合集

Stars: ✭ 7,138 (+9292.11%)

Mutual labels: cvpr2021

Domain-Consensus-Clustering

[CVPR2021] Domain Consensus Clustering for Universal Domain Adaptation

Stars: ✭ 85 (+11.84%)

Mutual labels: cvpr2021

LBYLNet

[CVPR2021] Look before you leap: learning landmark features for one-stage visual grounding.

Stars: ✭ 46 (-39.47%)

Mutual labels: cvpr2021

SiamMaskCpp

C++ Implementation of SiamMask

Stars: ✭ 92 (+21.05%)

Mutual labels: video-object-segmentation

Siammask

[CVPR2019] Fast Online Object Tracking and Segmentation: A Unifying Approach

Stars: ✭ 3,205 (+4117.11%)

Mutual labels: video-object-segmentation

Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Stars: ✭ 81 (+6.58%)

Mutual labels: cvpr2021

GIS-RAmap

Pytorch implementation of CVPR2021 oral paper (best paper candidate), "Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps"

Stars: ✭ 36 (-52.63%)

Mutual labels: video-object-segmentation

cfvqa

[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias

Stars: ✭ 96 (+26.32%)

Mutual labels: cvpr2021

Cvpr2021 Paper Code Interpretation

cvpr2021/cvpr2020/cvpr2019/cvpr2018/cvpr2017 论文/代码/解读/直播合集，极市团队整理

Stars: ✭ 8,075 (+10525%)

Mutual labels: cvpr2021

HandMesh

No description or website provided.

Stars: ✭ 258 (+239.47%)

Mutual labels: cvpr2021

GraphMemVOS

Video Object Segmentation with Episodic Graph Memory Networks (ECCV2020 spotlight)

Stars: ✭ 92 (+21.05%)

Mutual labels: video-object-segmentation

SGGpoint

[CVPR 2021] Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis (official pytorch implementation)

Stars: ✭ 41 (-46.05%)

Mutual labels: cvpr2021

soft-intro-vae-pytorch

[CVPR 2021 Oral] Official PyTorch implementation of Soft-IntroVAE from the paper "Soft-IntroVAE: Analyzing and Improving Introspective Variational Autoencoders"

Stars: ✭ 170 (+123.68%)

Mutual labels: cvpr2021

View All Similar Projects ➔

RMNet

This repository contains the source code for the paper Efficient Regional Memory Network for Video Object Segmentation.

Cite this work

@inproceedings{xie2021efficient,
  title={Efficient Regional Memory Network for Video Object Segmentation},
  author={Xie, Haozhe and 
          Yao, Hongxun and 
          Zhou, Shangchen and 
          Zhang, Shengping and 
          Sun, Wenxiu},
  booktitle={CVPR},
  year={2021}
}

Datasets

We use the ECSSD, COCO, PASCAL VOC, MSRA10K, DAVIS, and YouTube-VOS datasets in our experiments, which are available below:

Pretrained Models

The pretrained models for DAVIS and YouTube-VOS are available as follows:

RMNet for DAVIS (202 MB)
RMNet for YouTube-VOS (202 MB)

Prerequisites

Clone the Code Repository

git clone https://github.com/hzxie/RMNet.git

Install Python Denpendencies

cd RMNet
pip install -r requirements.txt

Build PyTorch Extensions

NOTE: PyTorch >= 1.4, CUDA >= 9.0 and GCC >= 4.9 are required.

RMNET_HOME=`pwd`

cd $RMNET_HOME/extensions/reg_att_map_generator
python setup.py install --user

cd $RMNET_HOME/extensions/flow_affine_transformation
python setup.py install --user

Precompute the Optical Flow

For the DAVIS dataset, the optical flows are computed by FlowNet2-CSS with the model pretrained on FlyingThings3D.
For the YouTube-VOS dataset, the optical flows are computed by RAFT with the model pretrained on Sintel.

Update Settings in `config.py`

You need to update the file path of the datasets:

__C.DATASETS                                     = edict()
__C.DATASETS.DAVIS                               = edict()
__C.DATASETS.DAVIS.INDEXING_FILE_PATH            = './datasets/DAVIS.json'
__C.DATASETS.DAVIS.IMG_FILE_PATH                 = '/path/to/Datasets/DAVIS/JPEGImages/480p/%s/%05d.jpg'
__C.DATASETS.DAVIS.ANNOTATION_FILE_PATH          = '/path/to/Datasets/DAVIS/Annotations/480p/%s/%05d.png'
__C.DATASETS.DAVIS.OPTICAL_FLOW_FILE_PATH        = '/path/to/Datasets/DAVIS/OpticalFlows/480p/%s/%05d.flo'
__C.DATASETS.YOUTUBE_VOS                         = edict()
__C.DATASETS.YOUTUBE_VOS.INDEXING_FILE_PATH      = '/path/to/Datasets/YouTubeVOS/%s/meta.json'
__C.DATASETS.YOUTUBE_VOS.IMG_FILE_PATH           = '/path/to/Datasets/YouTubeVOS/%s/JPEGImages/%s/%s.jpg'
__C.DATASETS.YOUTUBE_VOS.ANNOTATION_FILE_PATH    = '/path/to/Datasets/YouTubeVOS/%s/Annotations/%s/%s.png'
__C.DATASETS.YOUTUBE_VOS.OPTICAL_FLOW_FILE_PATH  = '/path/to/Datasets/YouTubeVOS/%s/OpticalFlows/%s/%s.flo'
__C.DATASETS.PASCAL_VOC                          = edict()
__C.DATASETS.PASCAL_VOC.INDEXING_FILE_PATH       = '/path/to/Datasets/voc2012/trainval.txt'
__C.DATASETS.PASCAL_VOC.IMG_FILE_PATH            = '/path/to/Datasets/voc2012/images/%s.jpg'
__C.DATASETS.PASCAL_VOC.ANNOTATION_FILE_PATH     = '/path/to/Datasets/voc2012/masks/%s.png'
__C.DATASETS.ECSSD                               = edict()
__C.DATASETS.ECSSD.N_IMAGES                      = 1000
__C.DATASETS.ECSSD.IMG_FILE_PATH                 = '/path/to/Datasets/ecssd/images/%s.jpg'
__C.DATASETS.ECSSD.ANNOTATION_FILE_PATH          = '/path/to/Datasets/ecssd/masks/%s.png'
__C.DATASETS.MSRA10K                             = edict()
__C.DATASETS.MSRA10K.INDEXING_FILE_PATH          = './datasets/msra10k.txt'
__C.DATASETS.MSRA10K.IMG_FILE_PATH               = '/path/to/Datasets/msra10k/images/%s.jpg'
__C.DATASETS.MSRA10K.ANNOTATION_FILE_PATH        = '/path/to/Datasets/msra10k/masks/%s.png'
__C.DATASETS.MSCOCO                              = edict()
__C.DATASETS.MSCOCO.INDEXING_FILE_PATH           = './datasets/mscoco.txt'
__C.DATASETS.MSCOCO.IMG_FILE_PATH                = '/path/to/Datasets/coco2017/images/train2017/%s.jpg'
__C.DATASETS.MSCOCO.ANNOTATION_FILE_PATH         = '/path/to/Datasets/coco2017/masks/train2017/%s.png'
__C.DATASETS.ADE20K                              = edict()
__C.DATASETS.ADE20K.INDEXING_FILE_PATH           = './datasets/ade20k.txt'
__C.DATASETS.ADE20K.IMG_FILE_PATH                = '/path/to/Datasets/ADE20K_2016_07_26/images/training/%s.jpg'
__C.DATASETS.ADE20K.ANNOTATION_FILE_PATH         = '/path/to/Datasets/ADE20K_2016_07_26/images/training/%s_seg.png'

# Dataset Options: DAVIS, DAVIS_FRAMES, YOUTUBE_VOS, ECSSD, MSCOCO, PASCAL_VOC, MSRA10K, ADE20K
__C.DATASET.TRAIN_DATASET                        = ['ECSSD', 'PASCAL_VOC', 'MSRA10K', 'MSCOCO']  # Pretrain
__C.DATASET.TRAIN_DATASET                        = ['YOUTUBE_VOS', 'DAVISx5']                    # Fine-tune
__C.DATASET.TEST_DATASET                         = 'DAVIS'

# Network Options: RMNet, TinyFlowNet
__C.TRAIN.NETWORK                                = 'RMNet'

Get Started

To train RMNet, you can simply use the following command:

python3 runner.py

To test RMNet, you can use the following command:

python3 runner.py --test --weights=/path/to/pretrained/model.pth

License

This project is open sourced under MIT license.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

hzxie / RMNet

Programming Languages

Labels

Projects that are alternatives of or similar to RMNet

RMNet

Cite this work

Datasets

Pretrained Models

Prerequisites

Clone the Code Repository

Install Python Denpendencies

Build PyTorch Extensions

Precompute the Optical Flow

Update Settings in `config.py`

Get Started

License

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

hzxie / RMNet

Programming Languages

Labels

Projects that are alternatives of or similar to RMNet

RMNet

Cite this work

Datasets

Pretrained Models

Prerequisites

Clone the Code Repository

Install Python Denpendencies

Build PyTorch Extensions

Precompute the Optical Flow

Update Settings in config.py

Get Started

License

Update Settings in `config.py`