MegviiDetection / Video_analyst
Licence: mit
A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.
Stars: ✭ 550
Programming Languages
python
139335 projects - #7 most used programming language
Labels
Projects that are alternatives of or similar to Video analyst
Alfred
alfred-py: A deep learning utility library for **human**, more detail about the usage of lib to: https://zhuanlan.zhihu.com/p/341446046
Stars: ✭ 460 (-16.36%)
Mutual labels: segmentation
Trackit
[ECCV'20] Ocean: Object-aware Anchor-Free Tracking
Stars: ✭ 424 (-22.91%)
Mutual labels: segmentation
Jetson Inference
Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.
Stars: ✭ 5,191 (+843.82%)
Mutual labels: segmentation
Cascadepsp
[CVPR2020] CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Stars: ✭ 407 (-26%)
Mutual labels: segmentation
Superpoint graph
Large-scale Point Cloud Semantic Segmentation with Superpoint Graphs
Stars: ✭ 533 (-3.09%)
Mutual labels: segmentation
Simpleitk
SimpleITK: a layer built on top of the Insight Toolkit (ITK), intended to simplify and facilitate ITK's use in rapid prototyping, education and interpreted languages.
Stars: ✭ 458 (-16.73%)
Mutual labels: segmentation
Face segmentation
Deep face segmentation in extremely hard conditions
Stars: ✭ 510 (-7.27%)
Mutual labels: segmentation
Pose2seg
Code for the paper "Pose2Seg: Detection Free Human Instance Segmentation" @ CVPR2019.
Stars: ✭ 423 (-23.09%)
Mutual labels: segmentation
Dipy
DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic methods for spatial normalization, signal processing, machine learning, statistical analysis and visualization of medical images. Additionally, it contains specialized methods for computational anatomy including diffusion, perfusion and structural imaging.
Stars: ✭ 417 (-24.18%)
Mutual labels: segmentation
Lidar Bonnetal
Semantic and Instance Segmentation of LiDAR point clouds for autonomous driving
Stars: ✭ 465 (-15.45%)
Mutual labels: segmentation
Medicalzoopytorch
A pytorch-based deep learning framework for multi-modal 2D/3D medical image segmentation
Stars: ✭ 546 (-0.73%)
Mutual labels: segmentation
Fishnet
Implementation code of the paper: FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction, NeurIPS 2018
Stars: ✭ 533 (-3.09%)
Mutual labels: segmentation
Multi Human Parsing
🔥🔥Official Repository for Multi-Human-Parsing (MHP)🔥🔥
Stars: ✭ 507 (-7.82%)
Mutual labels: segmentation
Video Analyst
This is the implementation of a series of basic algorithms which is useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS), etc.
Current implementation list:
SOT Quick start
Setup
Please refer to SETUP.md, SOT_SETUP.md
Demo
SOT video demo
# demo with web camera
python3 ./demo/main/video/sot_video.py --config 'experiments/siamfcpp/test/vot/siamfcpp_alexnet.yaml' --device cuda --video "webcam"
# demo with video file, and dump result into video file (optional)
python3 ./demo/main/video/sot_video.py --config 'experiments/siamfcpp/test/vot/siamfcpp_alexnet.yaml' --device cuda --video $video_dir/demo.mp4 --output $dump_path/result.mp4
# demo with extracted image files, and dump result into image files (optional)
python3 ./demo/main/video/sot_video.py --config 'experiments/siamfcpp/test/vot/siamfcpp_alexnet.yaml' --device cuda --video $video_dir/*.jpg --output $dump_dir
Test
Please refer to SOT_TEST.md for detail.
Training
Please refer to SOT_TRAINING.md for detail.
Repository structure (in progress)
project_root/
├── experiments # experiment configurations, in yaml format
├── main
│ ├── train.py # trainng entry point
│ └── test.py # test entry point
├── video_analyst
│ ├── data # modules related to data
│ │ ├── dataset # data fetcher of each individual dataset
│ │ ├── sampler # data sampler, including inner-dataset and intra-dataset sampling procedure
│ │ ├── dataloader.py # data loading procedure
│ │ └── transformer # data augmentation
│ ├── engine # procedure controller, including traiing control / hp&model loading
│ │ ├── monitor # monitor for tasks during training, including visualization / logging / benchmarking
│ │ ├── trainer.py # train a epoch
│ │ ├── tester.py # test a model on a benchmark
│ ├── model # model builder
│ │ ├── backbone # backbone network builder
│ │ ├── common_opr # shared operator (e.g. cross-correlation)
│ │ ├── task_model # holistic model builder
│ │ ├── task_head # head network builder
│ │ └── loss # loss builder
│ ├── pipeline # pipeline builder (tracking / vos)
│ │ ├── segmenter # segmenter builder for vos
│ │ ├── tracker # tracker builder for tracking
│ │ └── utils # pipeline utils
│ ├── config # configuration manager
│ ├── evaluation # benchmark
│ ├── optim # optimization-related module (learning rate, gradient clipping, etc.)
│ │ ├── optimizer # optimizer
│ │ ├── scheduler # learning rate scheduler
│ │ └── grad_modifier # gradient-related operation (parameter freezing)
│ └── utils # useful tools
└── README.md
docs
For detail, please refer to markdown files under docs.
SOT
- SOT_SETUP.md: instructions for setting-up
- SOT_MODEL_ZOO.md: description of released sot models
- SOT_TRAINING.md: details related to training
- SOT_TEST.md: details related to test
VOS
- VOS_SETUP.md: instructions for setting-up
- VOS_MODEL_ZOO.md: description of released sot models
- VOS_TRAINING.md: details related to training
- VOS_TEST.md: details related to training
DEVELOP
- DEVELOP.md: description of project design (registry, configuration tree, etc.)
- PIPELINE_API.md: description for pipeline API
- FORMATTING_INSTRUCTION: instruction for code formatting (yapf/isort/flake/etc.)
TODO
[] refine code stype and test cases
Acknowledgement
- video_analyst/evaluation/vot_benchmark and other related code have been borrowed from PySOT
- video_analyst/evaluation/got_benchmark and other related code have been borrowed from got-toolkit
- detectron2
- fvcore
- pytracking
- DROL
References
@inproceedings{xu2020siamfc++,
title={SiamFC++: Towards Robust and Accurate Visual Tracking with Target Estimation Guidelines.},
author={Xu, Yinda and Wang, Zeyu and Li, Zuoxin and Yuan, Ye and Yu, Gang},
booktitle={AAAI},
pages={12549--12556},
year={2020}
}
@inproceedings{chen2020state,
title={State-Aware Tracker for Real-Time Video Object Segmentation},
author={Chen, Xi and Li, Zuoxin and Yuan, Ye and Yu, Gang and Shen, Jianxin and Qi, Donglian},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={9384--9393},
year={2020}
}
Contact
Maintainer (sorted by family name):
- Xi Chen@XavierCHEN34
- Zuoxin Li@lzx1413
- Zeyu Wang@JWarlock
- Yinda Xu@MARMOTatZJU
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].