The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+1052.38%)

Mutual labels: speech-recognition, speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (+0%)

Mutual labels: speech-recognition, speech-to-text

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (+533.33%)

Mutual labels: speech-recognition, speech-to-text

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (+552.38%)

Mutual labels: speech-recognition, speech-to-text

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (+23.81%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+619.05%)

Mutual labels: speech-recognition, speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-33.33%)

Mutual labels: speech-recognition, speech-to-text

Lingvo

Stars: ✭ 2,361 (+11142.86%)

Mutual labels: speech-recognition, speech-to-text

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+17447.62%)

Mutual labels: speech-recognition, speech-to-text

Vosk

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (+766.67%)

Mutual labels: speech-recognition, speech-to-text

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+1585.71%)

Mutual labels: speech-recognition, speech-to-text

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+3904.76%)

Mutual labels: speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (+152.38%)

Mutual labels: speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (+0%)

Mutual labels: speech-recognition, speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (+42.86%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (+19.05%)

Mutual labels: speech-recognition, speech-to-text

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (+28.57%)

Mutual labels: speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+509.52%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (+504.76%)

Mutual labels: speech-recognition, speech-to-text

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (+585.71%)

Mutual labels: speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+53000%)

Mutual labels: speech-recognition, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (+685.71%)

Mutual labels: speech-recognition, speech-to-text

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+666.67%)

Mutual labels: speech-recognition, speech-to-text

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (+600%)

Mutual labels: speech-recognition, speech-to-text

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+7085.71%)

Mutual labels: speech-recognition, speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+814.29%)

Mutual labels: speech-recognition, speech-to-text

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+800%)

Mutual labels: speech-recognition, speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+828.57%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+9985.71%)

Mutual labels: speech-recognition, speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+947.62%)

Mutual labels: speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+876.19%)

Mutual labels: speech-recognition, speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+1104.76%)

Mutual labels: speech-recognition, speech-to-text

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (+404.76%)

Mutual labels: speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+752.38%)

Mutual labels: speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+876.19%)

Mutual labels: speech-recognition, speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+138.1%)

Mutual labels: speech-recognition, speech-to-text

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (+0%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (+209.52%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (+80.95%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (+14.29%)

Mutual labels: speech-recognition, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+290.48%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.