DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+103677.78%)

Mutual labels: speech-recognition

Flowavenet

A Pytorch implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Stars: ✭ 471 (+2516.67%)

Mutual labels: wavenet

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+2166.67%)

Mutual labels: speech-recognition

Alan Sdk Android

Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.

Stars: ✭ 278 (+1444.44%)

Mutual labels: speech-recognition

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+3327.78%)

Mutual labels: speech-recognition

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (+2111.11%)

Mutual labels: speech-recognition

Adapt

Adapt Intent Parser

Stars: ✭ 690 (+3733.33%)

Mutual labels: speech-recognition

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (+2027.78%)

Mutual labels: speech-recognition

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (+2838.89%)

Mutual labels: speech-recognition

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+25083.33%)

Mutual labels: speech-recognition

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+4144.44%)

Mutual labels: speech-recognition

Time Series Prediction

A collection of time series prediction methods: rnn, seq2seq, cnn, wavenet, transformer, unet, n-beats, gan, kalman-filter

Stars: ✭ 351 (+1850%)

Mutual labels: wavenet

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (+2572.22%)

Mutual labels: speech-recognition

Alan Sdk Ios

Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.

Stars: ✭ 318 (+1666.67%)

Mutual labels: speech-recognition

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+33227.78%)

Mutual labels: speech-recognition

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (+1555.56%)

Mutual labels: speech-recognition

Speech Demo

语音api示例

Stars: ✭ 454 (+2422.22%)

Mutual labels: speech-recognition

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+27361.11%)

Mutual labels: speech-recognition

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (+1438.89%)

Mutual labels: speech-recognition

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+3355.56%)

Mutual labels: speech-recognition

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (+2155.56%)

Mutual labels: speech-recognition

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+34433.33%)

Mutual labels: speech-recognition

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (+2122.22%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+2911.11%)

Mutual labels: speech-recognition

Free Spoken Digit Dataset

A free audio dataset of spoken digits. Think MNIST for audio.

Stars: ✭ 396 (+2100%)

Mutual labels: speech-recognition

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (+4188.89%)

Mutual labels: speech-recognition

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (+2077.78%)

Mutual labels: speech-recognition

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+2855.56%)

Mutual labels: speech-recognition

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (+1977.78%)

Mutual labels: speech-recognition

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+3688.89%)

Mutual labels: wavenet

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

Stars: ✭ 368 (+1944.44%)

Mutual labels: speech-recognition

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+2800%)

Mutual labels: speech-recognition

Deepspeech Examples

Examples of how to use or integrate DeepSpeech

Stars: ✭ 356 (+1877.78%)

Mutual labels: speech-recognition

Kur

Descriptive Deep Learning

Stars: ✭ 811 (+4405.56%)

Mutual labels: speech-recognition

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+1866.67%)

Mutual labels: speech-recognition

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+2622.22%)

Mutual labels: speech-recognition

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (+1805.56%)

Mutual labels: speech-recognition

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (+3638.89%)

Mutual labels: speech-recognition

J.a.r.v.i.s

python powered Intelligent System