KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Icdar 2019 SroieICDAR 2019 Robust Reading Challenge on Scanned Receipts OCR and Information Extraction
Ctc pytorchCTC end -to-end ASR for timit and 863 corpus.
Tensorflowasr集成了Tensorflow 2版本的端到端语音识别模型,并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2
SightseqComputer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection
Text recognition toolboxtext_recognition_toolbox: The reimplementation of a series of classical scene text recognition papers with Pytorch in a uniform way.
Casr Demo基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Lstm Ctc Ocrusing rnn (lstm or gru) and ctc to convert line image into text, based on torch7 and warp-ctc
Caffe ocr主流ocr算法研究实验性的项目,目前实现了CNN+BLSTM+CTC架构
EesenThe official repository of the Eesen project
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Neural spEnd-to-end ASR/LM implementation with PyTorch
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
MegreaderA research project for text detection and recognition using PyTorch 1.2.
CrnnA TensorFlow implementation of https://github.com/bgshih/crnn
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
CRNN.tf2Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
ctc-asrEnd-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.