slpUtils and modules for Speech Language and Multimodal processing using pytorch and pytorch lightning
Stars: ✭ 17 (+466.67%)
Mutual labels: multimodal
tsflexFlexible time series feature extraction & processing
Stars: ✭ 252 (+8300%)
Mutual labels: multimodal
RSTNetRSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)
Stars: ✭ 71 (+2266.67%)
Mutual labels: multimodal
MVGLTCyb 2018: Graph learning for multiview clustering
Stars: ✭ 26 (+766.67%)
Mutual labels: multimodal
clip-guided-diffusionA CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Stars: ✭ 260 (+8566.67%)
Mutual labels: multimodal
lipnetLipNet with gluon
Stars: ✭ 16 (+433.33%)
Mutual labels: multimodal
pykaleKnowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem
Stars: ✭ 381 (+12600%)
Mutual labels: multimodal
LAVT-pytorchLAVT: Language-Aware Vision Transformer for Referring Image Segmentation
Stars: ✭ 16 (+433.33%)
Mutual labels: multimodal
ArSarcasmThis repository contains the Arabic sarcasm dataset (ArSarcasm)
Stars: ✭ 18 (+500%)
Mutual labels: sarcasm-detection
Diverse-Structure-InpaintingCVPR 2021: "Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE"
Stars: ✭ 131 (+4266.67%)
Mutual labels: multimodal
NER-Multimodal-pytorchPytorch Implementation of "Adaptive Co-attention Network for Named Entity Recognition in Tweets" (AAAI 2018)
Stars: ✭ 42 (+1300%)
Mutual labels: multimodal
MmfA modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Stars: ✭ 4,713 (+157000%)
Mutual labels: multimodal
nemar[CVPR2020] Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation
Stars: ✭ 120 (+3900%)
Mutual labels: multimodal
VideoNavQAAn alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Stars: ✭ 22 (+633.33%)
Mutual labels: multimodal
iMIXA framework for Multimodal Intelligence research from Inspur HSSLAB.
Stars: ✭ 21 (+600%)
Mutual labels: multimodal
Modality-Transferable-MERModality-Transferable-MER, multimodal emotion recognition model with zero-shot and few-shot abilities.
Stars: ✭ 36 (+1100%)
Mutual labels: multimodal
Kaleido-BERT(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.
Stars: ✭ 252 (+8300%)
Mutual labels: multimodal
Fengshenbang-LMFengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Stars: ✭ 1,813 (+60333.33%)
Mutual labels: multimodal