Gisgis (go image server) go 实现的图片服务,实现基本的上传,下载,存储,按比例裁剪等功能
Stars: ✭ 108 (+89.47%)
Coco CnEnriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
Stars: ✭ 57 (+0%)
lang2segReferring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019
Stars: ✭ 30 (-47.37%)
Image Caption Generator[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
Stars: ✭ 141 (+147.37%)
Self Critical.pytorchUnofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Stars: ✭ 716 (+1156.14%)
udacity-cvnd-projectsMy solutions to the projects assigned for the Udacity Computer Vision Nanodegree
Stars: ✭ 36 (-36.84%)
CBPOfficial Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"
Stars: ✭ 52 (-8.77%)
Punny captionsAn implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".
Stars: ✭ 31 (-45.61%)
Image CaptioningImplementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Stars: ✭ 171 (+200%)
Virtex[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations
Stars: ✭ 323 (+466.67%)
X-VLMX-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
Stars: ✭ 283 (+396.49%)
iMIXA framework for Multimodal Intelligence research from Inspur HSSLAB.
Stars: ✭ 21 (-63.16%)
Medical Report GenerationA pytorch implementation of On the Automatic Generation of Medical Imaging Reports.
Stars: ✭ 100 (+75.44%)
Image captioninggenerate captions for images using a CNN-RNN model that is trained on the Microsoft Common Objects in COntext (MS COCO) dataset
Stars: ✭ 51 (-10.53%)
AoanetCode for paper "Attention on Attention for Image Captioning". ICCV 2019
Stars: ✭ 242 (+324.56%)
Neural Image CaptioningImplementation of Neural Image Captioning model using Keras with Theano backend
Stars: ✭ 12 (-78.95%)
BUTD modelA pytorch implementation of "Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering" for image captioning.
Stars: ✭ 28 (-50.88%)
NeuralmonkeyAn open-source tool for sequence learning in NLP built on TensorFlow.
Stars: ✭ 400 (+601.75%)
Image To Image SearchA reverse image search engine powered by elastic search and tensorflow
Stars: ✭ 200 (+250.88%)
Cs231Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition
Stars: ✭ 317 (+456.14%)
calvinCALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Stars: ✭ 105 (+84.21%)
Show Adapt And TellCode for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
Stars: ✭ 146 (+156.14%)
LaBERTA length-controllable and non-autoregressive image captioning model.
Stars: ✭ 50 (-12.28%)
Image Caption GeneratorA neural network to generate captions for an image using CNN and RNN with BEAM Search.
Stars: ✭ 126 (+121.05%)
pytorch violetA PyTorch implementation of VIOLET
Stars: ✭ 119 (+108.77%)
SightseqComputer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Stars: ✭ 116 (+103.51%)
clip playgroundAn ever-growing playground of notebooks showcasing CLIP's impressive zero-shot capabilities
Stars: ✭ 80 (+40.35%)
Video2descriptionVideo to Text: Generates description in natural language for given video (Video Captioning)
Stars: ✭ 107 (+87.72%)
catrImage Captioning Using Transformer
Stars: ✭ 206 (+261.4%)
ArnetCVPR 2018 - Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present
Stars: ✭ 94 (+64.91%)
VidSitu[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
Stars: ✭ 41 (-28.07%)
CS231nCS231n Assignments Solutions - Spring 2020
Stars: ✭ 48 (-15.79%)
CameramanagerSimple Swift class to provide all the configurations you need to create custom camera view in your app
Stars: ✭ 1,130 (+1882.46%)
synse-zslOfficial PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'
Stars: ✭ 14 (-75.44%)
Image CaptioningImage Captioning: Implementing the Neural Image Caption Generator with python
Stars: ✭ 52 (-8.77%)
Show Control And TellShow, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
Stars: ✭ 243 (+326.32%)
Bottom Up AttentionBottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Stars: ✭ 989 (+1635.09%)
wikiHow paper listA paper list of research conducted based on wikiHow
Stars: ✭ 25 (-56.14%)
Im2pTensorflow implementation of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs
Stars: ✭ 15 (-73.68%)
Caption generatorA modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.
Stars: ✭ 243 (+326.32%)
Show and TellShow and Tell : A Neural Image Caption Generator
Stars: ✭ 74 (+29.82%)
OmninetOfficial Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
Stars: ✭ 448 (+685.96%)
DataturksML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
Stars: ✭ 200 (+250.88%)
OscarOscar and VinVL
Stars: ✭ 396 (+594.74%)
TRAR-VQA[ICCV 2021] TRAR: Routing the Attention Spans in Transformers for Visual Question Answering -- Official Implementation
Stars: ✭ 49 (-14.04%)
Sca Cnn.cvpr17Image Captions Generation with Spatial and Channel-wise Attention
Stars: ✭ 198 (+247.37%)
gramtionTwitter bot for generating photo descriptions (alt text)
Stars: ✭ 21 (-63.16%)
Image-CaptioiningThe objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and prov…
Stars: ✭ 20 (-64.91%)
stanford-cs231n-assignments-2020This repository contains my solutions to the assignments for Stanford's CS231n "Convolutional Neural Networks for Visual Recognition" (Spring 2020).
Stars: ✭ 84 (+47.37%)
UdacityThis repo includes all the projects I have finished in the Udacity Nanodegree programs
Stars: ✭ 57 (+0%)
Up Down CaptionerAutomatic image captioning model based on Caffe, using features from bottom-up attention.
Stars: ✭ 195 (+242.11%)