PLSCPaddle Large Scale Classification Tools,supports ArcFace, CosFace, PartialFC, Data Parallel + Model Parallel. Model includes ResNet, ViT, DeiT, FaceViT.
Stars: ✭ 113 (+222.86%)
TensorrtxImplementation of popular deep learning networks with TensorRT network definition API
Stars: ✭ 3,456 (+9774.29%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+600%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+2008.57%)
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+220%)
CRNN.tf2Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
Stars: ✭ 131 (+274.29%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+1448.57%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+20%)
SightseqComputer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Stars: ✭ 116 (+231.43%)
CrnnA TensorFlow implementation of https://github.com/bgshih/crnn
Stars: ✭ 287 (+720%)
MiniVoxCode for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Stars: ✭ 15 (-57.14%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+1065.71%)
Deepstream ProjectThis is a highly separated deployment project based on Deepstream , including the full range of Yolo and continuously expanding deployment projects such as Ocr.
Stars: ✭ 120 (+242.86%)
SAN[ECCV 2020] Scale Adaptive Network: Learning to Learn Parameterized Classification Networks for Scalable Input Images
Stars: ✭ 41 (+17.14%)
InsightFace-RESTInsightFace REST API for easy deployment of face recognition services with TensorRT in Docker.
Stars: ✭ 308 (+780%)
spokestack-tray-androidA UI component that makes it easy to add voice interaction to your app.
Stars: ✭ 13 (-62.86%)
FaceRecognitionCppLarge input size REAL-TIME Face Detector on Cpp. It can also support face verification using MobileFaceNet+Arcface with real-time inference. 480P Over 30FPS on CPU
Stars: ✭ 40 (+14.29%)
SE-Net-CIFARSE-Net Incorporates with ResNet and WideResnet on CIFAR-10/100 Dataset.
Stars: ✭ 48 (+37.14%)
LibtorchTutorialsThis is a code repository for pytorch c++ (or libtorch) tutorial.
Stars: ✭ 463 (+1222.86%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+1202.86%)
PCPMPresenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-40%)
awesome-computer-vision-modelsA list of popular deep learning models related to classification, segmentation and detection problems
Stars: ✭ 419 (+1097.14%)
video featuresExtract video features from raw videos using multiple GPUs. We support RAFT and PWC flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, ResNet features.
Stars: ✭ 225 (+542.86%)
AESRC2020Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Stars: ✭ 40 (+14.29%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (+54.29%)
sparsezooNeural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
Stars: ✭ 264 (+654.29%)
DMPfoldDe novo protein structure prediction using iteratively predicted structural constraints
Stars: ✭ 52 (+48.57%)
avsr-tf1Audio-Visual Speech Recognition using Sequence to Sequence Models
Stars: ✭ 76 (+117.14%)
myG2PMyanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).
Stars: ✭ 43 (+22.86%)
wenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 2,384 (+6711.43%)
klaamArabic speech recognition, classification and text-to-speech.
Stars: ✭ 151 (+331.43%)
medium blogsmedium blog supplementaries | Backprop | Resnet & ResNext | RNN |
Stars: ✭ 69 (+97.14%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (+31.43%)
BankCard-RecognizerIdentifying numbers from bankcard, based on Deep Learning with Keras [China Software Cup 2019]
Stars: ✭ 74 (+111.43%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-40%)
crnn.mxnetcrnn in mxnet.can train with chinese characters
Stars: ✭ 47 (+34.29%)
PiwhoSpeaker recognition library based on MARF for raspberry pi and other SBCs.
Stars: ✭ 50 (+42.86%)
rnn benchmarksRNN benchmarks of pytorch, tensorflow and theano
Stars: ✭ 85 (+142.86%)
gluon2pytorchGluon to PyTorch deep neural network model converter
Stars: ✭ 72 (+105.71%)