Show-Attend-and-TellA PyTorch implementation of the paper Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Stars: ✭ 58 (-82.04%)
super-gradientsEasily train or fine-tune SOTA computer vision models with one open source training library
Stars: ✭ 429 (+32.82%)
CS231nMy solutions for Assignments of CS231n: Convolutional Neural Networks for Visual Recognition
Stars: ✭ 30 (-90.71%)
Awesome-CaptioningA curated list of Multimodal Captioning related research(including image captioning, video captioning, and text captioning)
Stars: ✭ 56 (-82.66%)
LaBERTA length-controllable and non-autoregressive image captioning model.
Stars: ✭ 50 (-84.52%)
HugsVisionHugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision
Stars: ✭ 154 (-52.32%)
Tacotron2-PyTorchYet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
Stars: ✭ 118 (-63.47%)
Bert SquadSQuAD Question Answering Using BERT, PyTorch
Stars: ✭ 256 (-20.74%)
open clipAn open source implementation of CLIP.
Stars: ✭ 1,534 (+374.92%)
Image-CaptionUsing LSTM or Transformer to solve Image Captioning in Pytorch
Stars: ✭ 36 (-88.85%)
EntityEntitySeg Toolbox: Towards Open-World and High-Quality Image Segmentation
Stars: ✭ 313 (-3.1%)
stylenetA pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
Stars: ✭ 58 (-82.04%)
text classifierTensorflow2.3的文本分类项目,支持各种分类模型,支持相关tricks。
Stars: ✭ 135 (-58.2%)
Tensorflow Model Zoo.torchInceptionV3, InceptionV4, Inception-Resnet pretrained models for Torch7 and PyTorch
Stars: ✭ 280 (-13.31%)
Pytorch-NLUPytorch-NLU,一个中文文本分类、序列标注工具包,支持中文长文本、短文本的多类、多标签分类任务,支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech ta…
Stars: ✭ 151 (-53.25%)
ObjectNetPyTorch implementation of "Pyramid Scene Parsing Network".
Stars: ✭ 15 (-95.36%)
gramtionTwitter bot for generating photo descriptions (alt text)
Stars: ✭ 21 (-93.5%)
Open3d MlAn extension of Open3D to address 3D Machine Learning tasks
Stars: ✭ 284 (-12.07%)
Image-CaptioiningThe objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and prov…
Stars: ✭ 20 (-93.81%)
Machine-LearningThe projects I do in Machine Learning with PyTorch, keras, Tensorflow, scikit learn and Python.
Stars: ✭ 54 (-83.28%)
BIRADS classifierHigh-resolution breast cancer screening with multi-view deep convolutional neural networks
Stars: ✭ 122 (-62.23%)
roberta-wwm-base-distillthis is roberta wwm base distilled model which was distilled from roberta wwm by roberta wwm large
Stars: ✭ 61 (-81.11%)
regnet.pytorchPyTorch-style and human-readable RegNet with a spectrum of pre-trained models
Stars: ✭ 50 (-84.52%)
Mobilenetv3.pytorch74.3% MobileNetV3-Large and 67.2% MobileNetV3-Small model on ImageNet
Stars: ✭ 283 (-12.38%)
pigalleryPiGallery: AI-powered Self-hosted Secure Multi-user Image Gallery and Detailed Image analysis using Machine Learning, EXIF Parsing and Geo Tagging
Stars: ✭ 35 (-89.16%)
AdaptivePytorch Implementation of Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning
Stars: ✭ 97 (-69.97%)
syntaxdotNeural syntax annotator, supporting sequence labeling, lemmatization, and dependency parsing.
Stars: ✭ 32 (-90.09%)
MIACode for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
Stars: ✭ 57 (-82.35%)
Gluon FaceAn unofficial Gluon FR Toolkit for face recognition. https://gluon-face.readthedocs.io
Stars: ✭ 264 (-18.27%)
concurrent-video-analytic-pipeline-optimization-sample-lCreate a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.
Stars: ✭ 39 (-87.93%)
finetunerFinetuning any DNN for better embedding on neural search tasks
Stars: ✭ 442 (+36.84%)
WARPCode for ACL'2021 paper WARP 🌀 Word-level Adversarial ReProgramming. Outperforming `GPT-3` on SuperGLUE Few-Shot text classification. https://aclanthology.org/2021.acl-long.381/
Stars: ✭ 66 (-79.57%)
ScanPyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
Stars: ✭ 306 (-5.26%)
safety-gear-detector-pythonObserve workers as they pass in front of a camera to determine if they have adequate safety protection.
Stars: ✭ 54 (-83.28%)
image-captioning-DLCTOfficial pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
Stars: ✭ 134 (-58.51%)
CogViewText-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
Stars: ✭ 708 (+119.2%)
rl-trained-agentsA collection of pre-trained RL agents using Stable Baselines3
Stars: ✭ 47 (-85.45%)
Show and TellShow and Tell : A Neural Image Caption Generator
Stars: ✭ 74 (-77.09%)
Image CaptioningImage Captioning using InceptionV3 and beam search
Stars: ✭ 290 (-10.22%)
motor-defect-detector-pythonPredict performance issues with manufacturing equipment motors. Perform local or cloud analytics of the issues found, and then display the data on a user interface to determine when failures might arise.
Stars: ✭ 24 (-92.57%)
pptodMulti-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System (ACL 2022)
Stars: ✭ 77 (-76.16%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-93.5%)
im2pTensorflow implement of paper: A Hierarchical Approach for Generating Descriptive Image Paragraphs
Stars: ✭ 43 (-86.69%)
RSTNetRSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)
Stars: ✭ 71 (-78.02%)
Cs231Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition
Stars: ✭ 317 (-1.86%)
AdaptiveattentionImplementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
Stars: ✭ 303 (-6.19%)
SpleeterSpleeter is Deezer source separation library with pretrained models
written in Python and uses Tensorflow. It makes it easy
to train source separation model (assuming you have a dataset of isolated sources), and provides
already trained state of the art model for performing various flavour of separation :
Stars: ✭ 18,128 (+5512.38%)
captioning chainerA fast implementation of Neural Image Caption by Chainer
Stars: ✭ 17 (-94.74%)
CPPE-DatasetCode for our paper CPPE - 5 (Medical Personal Protective Equipment), a new challenging object detection dataset
Stars: ✭ 42 (-87%)