All Projects → MIA → Similar Projects or Alternatives

80 Open source projects that are alternatives of or similar to MIA

gis (go image server) go 实现的图片服务，实现基本的上传，下载，存储，按比例裁剪等功能

Stars: ✭ 108 (+89.47%)

Mutual labels: image-captioning

Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks

Stars: ✭ 57 (+0%)

Mutual labels: image-captioning

Referring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019

Stars: ✭ 30 (-47.37%)

Mutual labels: vision-and-language

Image Caption Generator

[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow

Stars: ✭ 141 (+147.37%)

Mutual labels: image-captioning

Self Critical.pytorch

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Stars: ✭ 716 (+1156.14%)

Mutual labels: image-captioning

udacity-cvnd-projects

My solutions to the projects assigned for the Udacity Computer Vision Nanodegree

Stars: ✭ 36 (-36.84%)

Mutual labels: image-captioning

Transformer image caption

Image Captioning based on Bottom-Up and Top-Down Attention model

Stars: ✭ 94 (+64.91%)

Mutual labels: image-captioning

Official Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"

Stars: ✭ 52 (-8.77%)

Mutual labels: vision-and-language

An implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".

Stars: ✭ 31 (-45.61%)

Mutual labels: image-captioning

Meshed Memory Transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Stars: ✭ 230 (+303.51%)

Mutual labels: image-captioning

Image Captioning

Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]

Stars: ✭ 171 (+200%)

Mutual labels: image-captioning

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Stars: ✭ 323 (+466.67%)

Mutual labels: image-captioning

X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)

Stars: ✭ 283 (+396.49%)

Mutual labels: vision-and-language

A Pytorch Tutorial To Image Captioning

Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning

Stars: ✭ 1,867 (+3175.44%)

Mutual labels: image-captioning

A framework for Multimodal Intelligence research from Inspur HSSLAB.

Stars: ✭ 21 (-63.16%)

Mutual labels: vision-and-language

Medical Report Generation

A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.

Stars: ✭ 100 (+75.44%)

Mutual labels: image-captioning

Image-Captioning-with-Beam-Search

Generating image captions using Xception Network and Beam Search in Keras

Stars: ✭ 18 (-68.42%)

Mutual labels: image-captioning

Image Text Papers

Image Caption and Text to Image papers.

Stars: ✭ 71 (+24.56%)

Mutual labels: image-captioning

pix2code-pytorch

PyTorch implementation of pix2code. 🔥

Stars: ✭ 24 (-57.89%)

Mutual labels: image-captioning

Image captioning

generate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset

Stars: ✭ 51 (-10.53%)

Mutual labels: image-captioning

Code for paper "Attention on Attention for Image Captioning". ICCV 2019

Stars: ✭ 242 (+324.56%)

Mutual labels: image-captioning

Neural Image Captioning

Implementation of Neural Image Captioning model using Keras with Theano backend

Stars: ✭ 12 (-78.95%)

Mutual labels: image-captioning

A pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.

Stars: ✭ 28 (-50.88%)

Mutual labels: image-captioning

An open-source tool for sequence learning in NLP built on TensorFlow.

Stars: ✭ 400 (+601.75%)

Mutual labels: image-captioning

Image To Image Search

A reverse image search engine powered by elastic search and tensorflow

Stars: ✭ 200 (+250.88%)

Mutual labels: image-captioning

Fairseq Image Captioning

Transformer-based image captioning extension for pytorch/fairseq

Stars: ✭ 180 (+215.79%)

Mutual labels: image-captioning

Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition

Stars: ✭ 317 (+456.14%)

Mutual labels: image-captioning

CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks

Stars: ✭ 105 (+84.21%)

Mutual labels: vision-and-language

Show Adapt And Tell

Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017

Stars: ✭ 146 (+156.14%)

Mutual labels: image-captioning

A length-controllable and non-autoregressive image captioning model.

Stars: ✭ 50 (-12.28%)

Mutual labels: image-captioning

Image Caption Generator

A neural network to generate captions for an image using CNN and RNN with BEAM Search.

Stars: ✭ 126 (+121.05%)

Mutual labels: image-captioning

A PyTorch implementation of VIOLET

Stars: ✭ 119 (+108.77%)

Mutual labels: vision-and-language

Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection

Stars: ✭ 116 (+103.51%)

Mutual labels: image-captioning

clip playground

An ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities

Stars: ✭ 80 (+40.35%)

Mutual labels: vision-and-language

Video2description

Video to Text: Generates description in natural language for given video (Video Captioning)

Stars: ✭ 107 (+87.72%)

Mutual labels: image-captioning

Image Captioning Using Transformer

Stars: ✭ 206 (+261.4%)

Mutual labels: image-captioning

CVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present

Stars: ✭ 94 (+64.91%)

Mutual labels: image-captioning

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

Stars: ✭ 41 (-28.07%)

Mutual labels: vision-and-language

Automatic Image Captioning

Generating Captions for images using Deep Learning

Stars: ✭ 84 (+47.37%)

Mutual labels: image-captioning

CS231n Assignments Solutions - Spring 2020

Stars: ✭ 48 (-15.79%)

Mutual labels: image-captioning

Simple Swift class to provide all the configurations you need to create custom camera view in your app

Stars: ✭ 1,130 (+1882.46%)

Mutual labels: image-captioning

Official PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'

Stars: ✭ 14 (-75.44%)

Mutual labels: vision-and-language

Image Captioning

Image Captioning: Implementing the Neural Image Caption Generator with python

Stars: ✭ 52 (-8.77%)

Mutual labels: image-captioning

Show Control And Tell

Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019

Stars: ✭ 243 (+326.32%)

Mutual labels: image-captioning

Bottom Up Attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome

Stars: ✭ 989 (+1635.09%)

Mutual labels: image-captioning

wikiHow paper list

A paper list of research conducted based on wikiHow

Stars: ✭ 25 (-56.14%)

Mutual labels: vision-and-language

Tensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs

Stars: ✭ 15 (-73.68%)

Mutual labels: image-captioning

Caption generator

A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.

Stars: ✭ 243 (+326.32%)

Mutual labels: image-captioning

Show Attend And Tell

TensorFlow Implementation of "Show, Attend and Tell"

Stars: ✭ 869 (+1424.56%)

Mutual labels: image-captioning

Show and Tell : A Neural Image Caption Generator

Stars: ✭ 74 (+29.82%)

Mutual labels: image-captioning

Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain

Stars: ✭ 448 (+685.96%)

Mutual labels: image-captioning

ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.

Stars: ✭ 200 (+250.88%)

Mutual labels: image-captioning

Oscar and VinVL

Stars: ✭ 396 (+594.74%)

Mutual labels: image-captioning

[ICCV 2021] TRAR: Routing the Attention Spans in Transformers for Visual Question Answering -- Official Implementation

Stars: ✭ 49 (-14.04%)

Mutual labels: vision-and-language

Image Captions Generation with Spatial and Channel-wise Attention

Stars: ✭ 198 (+247.37%)

Mutual labels: image-captioning

Twitter bot for generating photo descriptions (alt text)

Stars: ✭ 21 (-63.16%)

Mutual labels: image-captioning

Image-Captioining

The objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and prov…

Stars: ✭ 20 (-64.91%)

Mutual labels: image-captioning

stanford-cs231n-assignments-2020

This repository contains my solutions to the assignments for Stanford's CS231n "Convolutional Neural Networks for Visual Recognition" (Spring 2020).

Stars: ✭ 84 (+47.37%)

Mutual labels: vision-and-language

This repo includes all the projects I have finished in the Udacity Nanodegree programs

Stars: ✭ 57 (+0%)

Mutual labels: image-captioning

Up Down Captioner

Automatic image captioning model based on Caffe, using features from bottom-up attention.

Stars: ✭ 195 (+242.11%)

Mutual labels: image-captioning

1-60 of 80 similar projects