[Paper 2020] Towards Better Accuracy-efficiency Trade-offs: Divide and Co-training. Plus, an image classification toolbox includes ResNet, Wide-ResNet, ResNeXt, ResNeSt, ResNeXSt, SENet, Shake-Shake, DenseNet, PyramidNet, and EfficientNet.

Stars: ✭ 54 (-5.26%)

Mutual labels: imagenet

py-faster-rcnn-imagenet

Train faster rcnn on imagine dataset, related blog post: https://andrewliao11.github.io/object/detection/2016/07/23/detection/

Stars: ✭ 133 (+133.33%)

Mutual labels: imagenet

Reproducibilty-Challenge-ECANET

Unofficial Implementation of ECANets (CVPR 2020) for the Reproducibility Challenge 2020.

Stars: ✭ 27 (-52.63%)

Mutual labels: image-recognition

BAKE

Self-distillation with Batch Knowledge Ensembling Improves ImageNet Classification

Stars: ✭ 79 (+38.6%)

Mutual labels: imagenet

Ssd.pytorch

A PyTorch Implementation of Single Shot MultiBox Detector

Stars: ✭ 4,499 (+7792.98%)

Mutual labels: image-recognition

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (+10.53%)

Mutual labels: speech-recognition

kosr

Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)

Stars: ✭ 25 (-56.14%)

Mutual labels: speech-recognition

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+214.04%)

Mutual labels: speech-recognition

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+1317.54%)

Mutual labels: speech-recognition

cozmo-tensorflow

🤖 Cozmo the Robot recognizes objects with TensorFlow

Stars: ✭ 61 (+7.02%)

Mutual labels: imagenet

CvT

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

Stars: ✭ 262 (+359.65%)

Mutual labels: imagenet

visualsearch

Visual Search is a little app to find and cluster similar images using Tagbox

Stars: ✭ 31 (-45.61%)

Mutual labels: image-recognition

Msdnet

Multi-Scale Dense Networks for Resource Efficient Image Classification （ICLR 2018 Oral）

Stars: ✭ 443 (+677.19%)

Mutual labels: imagenet

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+259.65%)

Mutual labels: speech-recognition

TriangleGAN

TriangleGAN, ACM MM 2019.

Stars: ✭ 28 (-50.88%)

Mutual labels: image-recognition

sharpmask

TensorFlow implementation of DeepMask and SharpMask

Stars: ✭ 31 (-45.61%)

Mutual labels: image-recognition

Constrained attention filter

(ECCV 2020) Tensorflow implementation of A Generic Visualization Approach for Convolutional Neural Networks

Stars: ✭ 36 (-36.84%)

Mutual labels: imagenet

good-speech-web-client

Practice your speech level in any language using speech recognition

Stars: ✭ 26 (-54.39%)

Mutual labels: speech-recognition

Contactless-Attendance-System

✨ A Contactless Attendance System where your face is identified for Attendance.

Stars: ✭ 20 (-64.91%)

Mutual labels: image-recognition

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (-5.26%)

Mutual labels: speech-recognition

Class Balanced Loss

Class-Balanced Loss Based on Effective Number of Samples. CVPR 2019

Stars: ✭ 433 (+659.65%)

Mutual labels: imagenet

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (+1.75%)

Mutual labels: speech-recognition

aws-rekognition

A Laravel Package/Facade for the AWS Rekognition API

Stars: ✭ 20 (-64.91%)

Mutual labels: image-recognition

TextNormalizationCoveringGrammars

Covering grammars for English and Russian text normalization

Stars: ✭ 60 (+5.26%)

Mutual labels: speech-recognition

Pytorch image classification

PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

Stars: ✭ 795 (+1294.74%)

Mutual labels: imagenet

object-flaw-detector-cpp

Detect various irregularities of a product as it moves along a conveyor belt.

Stars: ✭ 19 (-66.67%)

Mutual labels: image-recognition

GFNet

[NeurIPS 2021] Global Filter Networks for Image Classification

Stars: ✭ 199 (+249.12%)

Mutual labels: image-recognition

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-63.16%)

Mutual labels: speech-recognition

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+8571.93%)

Mutual labels: speech-recognition

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+47.37%)

Mutual labels: speech-recognition

Image-Detection-Samples

This sample app supports "Building a MVP with Face recognition and AR" and "Quest of a Hero part 2" presentations as well as it has two different possibilities to build face detection mechanism. The first one is OpenCV based and the second one is by means of Camera 2 API

Stars: ✭ 36 (-36.84%)

Mutual labels: image-recognition

nested-transformer

Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf

Stars: ✭ 174 (+205.26%)

Mutual labels: imagenet

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (-17.54%)

Mutual labels: speech-recognition

EffcientNetV2

EfficientNetV2 implementation using PyTorch

Stars: ✭ 94 (+64.91%)

Mutual labels: imagenet

CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

Stars: ✭ 131 (+129.82%)

Mutual labels: speech-recognition

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+4726.32%)

Mutual labels: speech-recognition

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (+612.28%)

Mutual labels: speech-recognition

ocr recognition

use java opencv tesseract ocr image words detects and recognition,use python generate jTessBoxEditor train box file

Stars: ✭ 52 (-8.77%)

Mutual labels: image-recognition

Benchmarking Keras Pytorch

🔥 Reproducibly benchmarking Keras and PyTorch models

Stars: ✭ 346 (+507.02%)

Mutual labels: imagenet

concurrent-video-analytic-pipeline-optimization-sample-l

Create a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.

Stars: ✭ 39 (-31.58%)

Mutual labels: image-recognition

Tensorflow-Keyword-Spotting

Keyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.

Stars: ✭ 27 (-52.63%)

Mutual labels: speech-recognition

rosecho

Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用

Stars: ✭ 28 (-50.88%)

Mutual labels: speech-recognition

Imagenet

Trial on kaggle imagenet object localization by yolo v3 in google cloud