All Projects → cap-ntu → Video To Retail Platform

cap-ntu / Video To Retail Platform

Licence: apache-2.0
An intelligent multimodal-learning based system for video, product and ads analysis. Based on the system, people can build a lot of downstream applications such as product recommendation, video retrieval, etc.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Video To Retail Platform

Androidtensorflowmachinelearningexample
Android TensorFlow MachineLearning Example (Building TensorFlow for Android)
Stars: ✭ 1,369 (+1167.59%)
Mutual labels:  deep-neural-networks
Faceaging By Cyclegan
Stars: ✭ 105 (-2.78%)
Mutual labels:  deep-neural-networks
Video2description
Video to Text: Generates description in natural language for given video (Video Captioning)
Stars: ✭ 107 (-0.93%)
Mutual labels:  deep-neural-networks
Sdr Densenet Pytorch
Stochastic Delta Rule implemented in Pytorch on DenseNet
Stars: ✭ 102 (-5.56%)
Mutual labels:  deep-neural-networks
Intro To Deep Learning
A collection of materials to help you learn about deep learning
Stars: ✭ 103 (-4.63%)
Mutual labels:  deep-neural-networks
Jlm
A fast LSTM Language Model for large vocabulary language like Japanese and Chinese
Stars: ✭ 105 (-2.78%)
Mutual labels:  deep-neural-networks
Mxnet Finetuner
An all-in-one Deep Learning toolkit for image classification to fine-tuning pretrained models using MXNet.
Stars: ✭ 100 (-7.41%)
Mutual labels:  deep-neural-networks
Deep learning object detection
A paper list of object detection using deep learning.
Stars: ✭ 10,334 (+9468.52%)
Mutual labels:  deep-neural-networks
Tensorflow2.0 Examples
🙄 Difficult algorithm, Simple code.
Stars: ✭ 1,397 (+1193.52%)
Mutual labels:  deep-neural-networks
Ict
Code for reproducing ICT ( published in IJCAI 2019)
Stars: ✭ 107 (-0.93%)
Mutual labels:  deep-neural-networks
Models
DLTK Model Zoo
Stars: ✭ 101 (-6.48%)
Mutual labels:  deep-neural-networks
Pytorchnlpbook
Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://nlproc.info
Stars: ✭ 1,390 (+1187.04%)
Mutual labels:  deep-neural-networks
Opentpod
Open Toolkit for Painless Object Detection
Stars: ✭ 106 (-1.85%)
Mutual labels:  deep-neural-networks
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+1156.48%)
Mutual labels:  deep-neural-networks
Selfdrivingcar
A collection of all projects pertaining to different layers in the SDC software stack
Stars: ✭ 107 (-0.93%)
Mutual labels:  deep-neural-networks
Top Deep Learning
Top 200 deep learning Github repositories sorted by the number of stars.
Stars: ✭ 1,365 (+1163.89%)
Mutual labels:  deep-neural-networks
Planematch
[ECCV'18 Oral] PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction
Stars: ✭ 105 (-2.78%)
Mutual labels:  deep-neural-networks
Faceswap
Deepfakes Software For All
Stars: ✭ 39,911 (+36854.63%)
Mutual labels:  deep-neural-networks
Neural Doodle
Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)
Stars: ✭ 9,680 (+8862.96%)
Mutual labels:  deep-neural-networks
Ssd Pytorch
SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity
Stars: ✭ 107 (-0.93%)
Mutual labels:  deep-neural-networks

Hysia Video to Retail Platform

* This project is supported by Cloud Application and Platform Lab led by Prof. Yonggang Wen

codebeat badge

An intelligent multimodal-learning based system for video, product and ads analysis. You can build various downstream applications with the system, such as product recommendation, video retrieval. Several examples are provided.

The system is under active development currently. You are welcome to create a issue, pull request here. We will credit them into our next version.

hysia-block-diagram

ShowcaseFeaturesSetup EnvironmentConfigurationDemoCitation

News

  • (2020-08) The work has been accepted as an open-source competation paper at ACMMM2020!
  • (2020-05) The docker image has been updated
  • (2020-05) You can easily bind your model to our system

Showcase

👉 Full list of showcase.

  1. Upload video and process it by selecting different models

    select-models

  2. Display video processing result

    display-analytic-result

  3. Search scene by image and text

    search-result

  4. Insert product advertisement and display insertion result

    view-ads

Features

  • Multimodal learning-based video analysis:
    • Scene / Object / Face detection and recognition
    • Multimodality data pre-processing
    • Results align and store
  • Downstream applications:
    • Intelligent ads insertion
    • Content-product match
  • Visualized testbed
    • Visualize multimodality results
    • Can be installed separately

Setup Environment

1. Download Data

👉 For ❌ no Google Drive access.

# Make sure this script is run from project root
bash scripts/download-data.sh

2. Installation

👉 Install with Docker 🐳

docker pull hysia/hysia:v2o

Configuration

Change decoder and model server running devices at device_placement.yml:

decoder: CPU
visual_model_server: CUDA:1
audio_model_server: CUDA:2
feature_model_server: CUDA:3
product_search_server: CUDA:2
scene_search_server: CUDA:3

Device value format: cpu, cuda or cuda:<int>.

Demo

Run with docker 🐳

docker run --rm \
  --gpus all -d -p 8000:8000 \
  -v ${PWD}/server/config/device_placement.yml:/content/server/config/device_placement.yml \
  hysia/hysia:v2o

Then you can go to http://localhost:8000. Use username: admin and password: admin to login.

Some Useful Tools

  • Large dataset preprocessing
  • Video/audio decoding
  • Model profiling
  • Multimodality data testbed

Contributing

You are welcome to contribute to Hysia! Please refer to here to get start.

Paper Citation

If you use Hysia in your work, we would be very grateful if you cite

@inproceedings{10.1145/3394171.3414536,
    author = {Zhang, Huaizheng and Li, Yuanming and Ai, Qiming and Luo, Yong and Wen, Yonggang and Jin, Yichao and Ta, Nguyen Binh Duong},
    title = {Hysia: Serving DNN-Based Video-to-Retail Applications in Cloud},
    year = {2020},
    booktitle = {Proceedings of the 28th ACM International Conference on Multimedia},
    pages = {4457–4460},
}

About Us

Maintainers

  • Huaizheng Zhang
  • Yuanming Li
  • Qiming Ai

Previous Contributors

  • Shengsheng Zhou
  • Wenbo Jiang (Now, Shopee)
  • Ziyuan Liu (Now, Tencent)
  • Yongjie Wang (Now, NTU PhD)
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].