All Categories → No Category → multi-modal

Top 10 multi-modal open source projects

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

✭ 3,661

python deep-learning artificial-intelligence attention-mechanism transformers multi-modal text-to-image

Open Source Routing Engine for OpenStreetMap

✭ 1,794

C++CMake python shell openstreetmap dijkstra routing-engine directions routing astar traveling-salesman isochrones multi-modal tiled

Multi-Modal-Transformer

The repository collects many various multi-modal transformer architectures, including image transformer, video transformer, image-language transformer, video-language transformer and related datasets. Additionally, it also collects many useful tutorials and tools in these related domains.

✭ 61

multi-modal image-transformer vision-transformer video-language efficiency-transformer video-transformer mlp-mixer transformer-readling-list multi-modal-cvpr2021

iPerceive

OASIS

Official implementation of the paper "You Only Need Adversarial Supervision for Semantic Image Synthesis" (ICLR 2021)

✭ 232

python shell machine-learning computer-vision deep-learning pytorch gan image-generation multi-modal generative-adversarial-networks oasis image-to-image-translation bcai semantic-image-synthesis iclr2021 label-to-image-translation

nemar

[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation

✭ 120

python shell matlab deep-learning cnn pytorch multi-modal image-registration affine-transformation stn image-to-image-translation multimodal deformable-transformation multi-modal-learning cvpr2020 registartion multimodal-image-registration

EGSC-IT

Tensorflow implementation of ICLR2019 paper "Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency"

✭ 29

python shell tensorflow multi-modal image-translation mode-collapse semantic-consistency

TRAR-VQA

[ICCV 2021] TRAR: Routing the Attention Spans in Transformers for Visual Question Answering -- Official Implementation

✭ 49

python visualization pytorch transformer attention official multi-modal clevr visual-question-answering vision-and-language dynamic-network multi-modality multi-modal-learning multi-scale-features vqav2 iccv2021 local-and-global

MMTOD

Multi-modal Thermal Object Detector

✭ 38

python Cuda c Jupyter Notebook C++matlab shell pytorch faster-rcnn multi-modal borrow-from-anywhere

skill-sample-nodejs-berry-bash

Demonstrates the use of interactive render template directives through multi modal screen design.

✭ 22

javascript alexa multi-modal nodejs-sdk-v2

1-10 of 10 multi-modal projects