Bottom Up AttentionBottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
Stars: ✭ 989 (+1495.16%)
AoA-pytorchA Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
Stars: ✭ 33 (-46.77%)
just-ask[TPAMI Special Issue on ICCV 2021 Best Papers, Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Stars: ✭ 57 (-8.06%)
FigureQA-baselineTensorFlow implementation of the CNN-LSTM, Relation Network and text-only baselines for the paper "FigureQA: An Annotated Figure Dataset for Visual Reasoning"
Stars: ✭ 28 (-54.84%)
self critical vqaCode for NeurIPS 2019 paper ``Self-Critical Reasoning for Robust Visual Question Answering''
Stars: ✭ 39 (-37.1%)
opensmileThe Munich Open-Source Large-Scale Multimedia Feature Extractor
Stars: ✭ 280 (+351.61%)
Feature-based-opinion-miningExtracting all the features of a product from its reviews, giving every feature a score (depending on the user reviews) and also ranking the reviews based on their usefulness
Stars: ✭ 40 (-35.48%)
Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+25.81%)
CFUNCombining Faster R-CNN and U-net for efficient medical image segmentation
Stars: ✭ 109 (+75.81%)
object-trackingMultiple Object Tracking System in Keras + (Detection Network - YOLO)
Stars: ✭ 89 (+43.55%)
keras-faster-rcnnkeras实现faster rcnn,end2end训练、预测; 持续更新中,见todo... ;欢迎试用、关注并反馈问题
Stars: ✭ 85 (+37.1%)
imsearchFramework to build your own reverse image search engine
Stars: ✭ 64 (+3.23%)
gan tensorflowAutomatic feature engineering using Generative Adversarial Networks using TensorFlow.
Stars: ✭ 48 (-22.58%)
autoencoders tensorflowAutomatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (+6.45%)
antropyAntroPy: entropy and complexity of (EEG) time-series in Python
Stars: ✭ 111 (+79.03%)
Faster-RCNN-LocNetA simplified implementation of paper : Improved Localization Accuracy by LocNet for Faster R-CNN Based Text Detection
Stars: ✭ 25 (-59.68%)
NTFk.jlUnsupervised Machine Learning: Nonnegative Tensor Factorization + k-means clustering
Stars: ✭ 36 (-41.94%)
lung-image-analysisA basic framework for pulmonary nodule detection and characterization in CT
Stars: ✭ 26 (-58.06%)
rositaROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration
Stars: ✭ 36 (-41.94%)
DSegInvariant Superpixel Features for Object Detection
Stars: ✭ 18 (-70.97%)
GeobitNonrigidDescriptor ICCV 2019C++ implementation of the nonrigid descriptor Geobit presented at ICCV 2019 "GEOBIT: A Geodesic-Based Binary Descriptor Invariant to Non-Rigid Deformations for RGB-D Images"
Stars: ✭ 11 (-82.26%)
pyHSICLassoVersatile Nonlinear Feature Selection Algorithm for High-dimensional Data
Stars: ✭ 125 (+101.61%)
Depth-VRDImproving Visual Relation Detection using Depth Maps (ICPR 2020)
Stars: ✭ 33 (-46.77%)
50-days-of-Statistics-for-Data-ScienceThis repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded in this repository.
Stars: ✭ 19 (-69.35%)
Deep-LearningThis repo provides projects on deep-learning mainly using Tensorflow 2.0
Stars: ✭ 22 (-64.52%)
lbjavaLearning Based Java (LBJava)
Stars: ✭ 12 (-80.65%)
tf-faster-rcnnTensorflow 2 Faster-RCNN implementation from scratch supporting to the batch processing with MobileNetV2 and VGG16 backbones
Stars: ✭ 88 (+41.94%)
featurewizUse advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+269.35%)
VisDrone2018ECCV2018(Challenge-Object Detection in Images)
Stars: ✭ 86 (+38.71%)
Bag-of-Visual-Words🎒 Bag of Visual words (BoW) approach for object classification and detection in images together with SIFT feature extractor and SVM classifier.
Stars: ✭ 39 (-37.1%)
sourceafis-netFingerprint recognition engine for .NET that takes a pair of human fingerprint images and returns their similarity score. Supports efficient 1:N search.
Stars: ✭ 43 (-30.65%)
probnmn-clevrCode for ICML 2019 paper "Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering" [long-oral]
Stars: ✭ 63 (+1.61%)
towheeTowhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Stars: ✭ 821 (+1224.19%)
ezSIFTezSIFT: An easy-to-use standalone SIFT library written in C/C++
Stars: ✭ 80 (+29.03%)
DVQA datasetDVQA Dataset: A Bar chart question answering dataset presented at CVPR 2018
Stars: ✭ 20 (-67.74%)
detect-shortcutsRepo for ICCV 2021 paper: Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering
Stars: ✭ 17 (-72.58%)
MixingBearPackage for automatic beat-mixing of music files in Python 🐻🎚
Stars: ✭ 73 (+17.74%)
mildnetVisual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.
Stars: ✭ 76 (+22.58%)
faster rcnnAnother pytorch implementation of Faster RCNN.
Stars: ✭ 24 (-61.29%)
iMIXA framework for Multimodal Intelligence research from Inspur HSSLAB.
Stars: ✭ 21 (-66.13%)
VCMLPyTorch implementation of paper "Visual Concept-Metaconcept Learner", NeruIPS 2019
Stars: ✭ 45 (-27.42%)
Real-Time-Object-Detection-API-using-TensorFlowA Transfer Learning based Object Detection API that detects all objects in an image, video or live webcam. An SSD model and a Faster R-CNN model was pretrained on Mobile net coco dataset along with a label map in Tensorflow. This model were used to detect objects captured in an image, video or real time webcam. Open CV was used for streaming obj…
Stars: ✭ 50 (-19.35%)
fastknnFast k-Nearest Neighbors Classifier for Large Datasets
Stars: ✭ 64 (+3.23%)
KVQAKorean Visual Question Answering
Stars: ✭ 44 (-29.03%)
vqa-softAccompanying code for "A Simple Loss Function for Improving the Convergence and Accuracy of Visual Question Answering Models" CVPR 2017 VQA workshop paper.
Stars: ✭ 14 (-77.42%)