Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (-63.75%)

Mutual labels: speech-recognition

TextNormalizationCoveringGrammars

Covering grammars for English and Russian text normalization

Stars: ✭ 60 (-62.5%)

Mutual labels: speech-recognition

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-86.87%)

Mutual labels: speech-recognition

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (-41.25%)

Mutual labels: speech-recognition

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (-47.5%)

Mutual labels: speech-recognition

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+1619.38%)

Mutual labels: speech-recognition

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+58.13%)

Mutual labels: speech-recognition

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+55.63%)

Mutual labels: speech-recognition

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+51.25%)

Mutual labels: speech-recognition

Chinese text normalization

Chinese text normalization for speech processing

Stars: ✭ 242 (+51.25%)

Mutual labels: speech-recognition

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+2203.13%)

Mutual labels: speech-recognition

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+37.5%)

Mutual labels: speech-recognition

Dragonfly

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

Stars: ✭ 209 (+30.63%)

Mutual labels: speech-recognition

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+28.13%)

Mutual labels: speech-recognition

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+22.5%)

Mutual labels: speech-recognition

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+21.88%)

Mutual labels: speech-recognition

Lingvo

Stars: ✭ 2,361 (+1375.63%)

Mutual labels: speech-recognition

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+20%)

Mutual labels: speech-recognition

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (+19.38%)