MVGLTCyb 2018: Graph learning for multiview clustering
Stars: ✭ 26 (+62.5%)
NER-Multimodal-pytorchPytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)
Stars: ✭ 42 (+162.5%)
HugsVisionHugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (+862.5%)
docarrayThe data structure for unstructured data
Stars: ✭ 561 (+3406.25%)
WearableSensorDataThis repository provides the codes and data used in our paper "Human Activity Recognition Based on Wearable Sensor Data: A Standardization of the State-of-the-Art", where we implement and evaluate several state-of-the-art approaches, ranging from handcrafted-based methods to convolutional neural networks.
Stars: ✭ 65 (+306.25%)
RSTNetRSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)
Stars: ✭ 71 (+343.75%)
best AI papers 2021A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.
Stars: ✭ 2,740 (+17025%)
nemar[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation
Stars: ✭ 120 (+650%)
berserkerBerserker - BERt chineSE woRd toKenizER
Stars: ✭ 17 (+6.25%)
iMIXA framework for Multimodal Intelligence research from Inspur HSSLAB.
Stars: ✭ 21 (+31.25%)
strollr2d icassp2017Image Denoising Codes using STROLLR learning, the Matlab implementation of the paper in ICASSP2017
Stars: ✭ 22 (+37.5%)
lipnetLipNet with gluon
Stars: ✭ 16 (+0%)
SCINetForecast time series and stock prices with SCINet
Stars: ✭ 28 (+75%)
img2datasetEasily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Stars: ✭ 1,173 (+7231.25%)
fairytaleencode.ru community archiver
Stars: ✭ 29 (+81.25%)
gakgGAKG is a multimodal Geoscience Academic Knowledge Graph (GAKG) framework by fusing papers' illustrations, text, and bibliometric data.
Stars: ✭ 21 (+31.25%)
mix-stageOfficial Repository for the paper Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach published in ECCV 2020 (https://arxiv.org/abs/2007.12553)
Stars: ✭ 22 (+37.5%)
ViNetViNet Pushing the limits of Visual Modality for Audio Visual Saliency Prediction
Stars: ✭ 36 (+125%)
salt iccv2017SALT (iccv2017) based Video Denoising Codes, Matlab implementation
Stars: ✭ 26 (+62.5%)
MinkLocMultimodalMinkLoc++: Lidar and Monocular Image Fusion for Place Recognition
Stars: ✭ 65 (+306.25%)
Kaleido-BERT(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.
Stars: ✭ 252 (+1475%)
pykaleKnowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem
Stars: ✭ 381 (+2281.25%)
Fengshenbang-LMFengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Stars: ✭ 1,813 (+11231.25%)
VideoNavQAAn alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Stars: ✭ 22 (+37.5%)
tsflexFlexible time series feature extraction & processing
Stars: ✭ 252 (+1475%)
StateArtsIntellij plugin that creates state machine diagram from state machine
Stars: ✭ 87 (+443.75%)
Best ai paper 2020A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code
Stars: ✭ 2,140 (+13275%)
HetuA high-performance distributed deep learning system targeting large-scale and automated distributed training.
Stars: ✭ 78 (+387.5%)
BeatNetThis repository contains the implementation of the AI-based "BeatNet" Joint beat, downbeat, tempo, and meter tracking system using CRNN and particle filtering. 2021's state-of-the-art online model - (ISMIR 2021).
Stars: ✭ 56 (+250%)
MmfA modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Stars: ✭ 4,713 (+29356.25%)
clip-guided-diffusionA CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Stars: ✭ 260 (+1525%)
Modality-Transferable-MERModality-Transferable-MER, multimodal emotion recognition model with zero-shot and few-shot abilities.
Stars: ✭ 36 (+125%)
slpUtils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning
Stars: ✭ 17 (+6.25%)