All Projects → LAVT-pytorch → Similar Projects or Alternatives

41 Open source projects that are alternatives of or similar to LAVT-pytorch

MVGL
TCyb 2018: Graph learning for multiview clustering
Stars: ✭ 26 (+62.5%)
Mutual labels:  multimodal
NER-Multimodal-pytorch
Pytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)
Stars: ✭ 42 (+162.5%)
Mutual labels:  multimodal
HugsVision
HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (+862.5%)
Mutual labels:  state-of-the-art
docarray
The data structure for unstructured data
Stars: ✭ 561 (+3406.25%)
Mutual labels:  multimodal
hardware-attacks-state-of-the-art
Microarchitectural exploitation and other hardware attacks.
Stars: ✭ 29 (+81.25%)
Mutual labels:  state-of-the-art
Diverse-Structure-Inpainting
CVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"
Stars: ✭ 131 (+718.75%)
Mutual labels:  multimodal
Recommender-Systems-with-Collaborative-Filtering-and-Deep-Learning-Techniques
Implemented User Based and Item based Recommendation System along with state of the art Deep Learning Techniques
Stars: ✭ 41 (+156.25%)
Mutual labels:  state-of-the-art
CompareModels TRECQA
Compare six baseline deep learning models on TrecQA
Stars: ✭ 61 (+281.25%)
Mutual labels:  state-of-the-art
WearableSensorData
This repository provides the codes and data used in our paper "Human Activity Recognition Based on Wearable Sensor Data: A Standardization of the State-of-the-Art", where we implement and evaluate several state-of-the-art approaches, ranging from handcrafted-based methods to convolutional neural networks.
Stars: ✭ 65 (+306.25%)
Mutual labels:  state-of-the-art
delving-deeper-into-the-decoder-for-video-captioning
Source code for Delving Deeper into the Decoder for Video Captioning
Stars: ✭ 36 (+125%)
Mutual labels:  state-of-the-art
RSTNet
RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)
Stars: ✭ 71 (+343.75%)
Mutual labels:  multimodal
best AI papers 2021
A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.
Stars: ✭ 2,740 (+17025%)
Mutual labels:  state-of-the-art
nemar
[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation
Stars: ✭ 120 (+650%)
Mutual labels:  multimodal
Deep-multimodal-subspace-clustering-networks
Tensorflow implementation of "Deep Multimodal Subspace Clustering Networks"
Stars: ✭ 62 (+287.5%)
Mutual labels:  multimodal
berserker
Berserker - BERt chineSE woRd toKenizER
Stars: ✭ 17 (+6.25%)
Mutual labels:  state-of-the-art
iMIX
A framework for Multimodal Intelligence research from Inspur HSSLAB.
Stars: ✭ 21 (+31.25%)
Mutual labels:  multimodal
strollr2d icassp2017
Image Denoising Codes using STROLLR learning, the Matlab implementation of the paper in ICASSP2017
Stars: ✭ 22 (+37.5%)
Mutual labels:  state-of-the-art
lipnet
LipNet with gluon
Stars: ✭ 16 (+0%)
Mutual labels:  multimodal
SCINet
Forecast time series and stock prices with SCINet
Stars: ✭ 28 (+75%)
Mutual labels:  state-of-the-art
img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (+7231.25%)
Mutual labels:  multimodal
fairytale
encode.ru community archiver
Stars: ✭ 29 (+81.25%)
Mutual labels:  state-of-the-art
gakg
GAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.
Stars: ✭ 21 (+31.25%)
Mutual labels:  multimodal
mix-stage
Official Repository for the paper Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach published in ECCV 2020 (https://arxiv.org/abs/2007.12553)
Stars: ✭ 22 (+37.5%)
Mutual labels:  multimodal
ViNet
ViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction
Stars: ✭ 36 (+125%)
Mutual labels:  state-of-the-art
salt iccv2017
SALT (iccv2017) based Video Denoising Codes, Matlab implementation
Stars: ✭ 26 (+62.5%)
Mutual labels:  state-of-the-art
MinkLocMultimodal
MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition
Stars: ✭ 65 (+306.25%)
Mutual labels:  multimodal
pytorch-multimodal sarcasm detection
It is the implementation of paper "Multi-Modal Sarcasm Detection in Twitter with Hierarchical Fusion Model"
Stars: ✭ 3 (-81.25%)
Mutual labels:  multimodal
Kaleido-BERT
(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.
Stars: ✭ 252 (+1475%)
Mutual labels:  multimodal
pykale
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem
Stars: ✭ 381 (+2281.25%)
Mutual labels:  multimodal
Fengshenbang-LM
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Stars: ✭ 1,813 (+11231.25%)
Mutual labels:  multimodal
VideoNavQA
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Stars: ✭ 22 (+37.5%)
Mutual labels:  multimodal
tsflex
Flexible time series feature extraction & processing
Stars: ✭ 252 (+1475%)
Mutual labels:  multimodal
StateArts
Intellij plugin that creates state machine diagram from state machine
Stars: ✭ 87 (+443.75%)
Mutual labels:  state-of-the-art
Best ai paper 2020
A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
Stars: ✭ 2,140 (+13275%)
Mutual labels:  state-of-the-art
Reproducible Image Denoising State Of The Art
Collection of popular and reproducible image denoising works.
Stars: ✭ 1,776 (+11000%)
Mutual labels:  state-of-the-art
Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
Stars: ✭ 78 (+387.5%)
Mutual labels:  state-of-the-art
BeatNet
This repository contains the implementation of the AI-based "BeatNet" Joint beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. 2021's state-of-the-art online model - (ISMIR 2021).
Stars: ✭ 56 (+250%)
Mutual labels:  state-of-the-art
Mmf
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Stars: ✭ 4,713 (+29356.25%)
Mutual labels:  multimodal
clip-guided-diffusion
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Stars: ✭ 260 (+1525%)
Mutual labels:  multimodal
Modality-Transferable-MER
Modality-Transferable-MER, multimodal emotion recognition model with zero-shot and few-shot abilities.
Stars: ✭ 36 (+125%)
Mutual labels:  multimodal
slp
Utils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning
Stars: ✭ 17 (+6.25%)
Mutual labels:  multimodal
1-41 of 41 similar projects