DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+1255.59%)

Mutual labels: speech-recognition, speech-to-text

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-96.23%)

Mutual labels: speech-synthesis, text-to-speech

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-94.92%)

Mutual labels: speech-recognition, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-91.07%)

Mutual labels: speech-recognition, speech-to-text

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-98.4%)

Mutual labels: text-to-speech, speech-synthesis

voicekit-examples

Examples on how to use Tinkoff Voicekit

Stars: ✭ 35 (-97.46%)

Mutual labels: speech-synthesis, speech-recognition

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (-79.75%)

Mutual labels: speech-synthesis, text-to-speech

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-79.9%)

Mutual labels: speech-recognition, speech-to-text

Alan Sdk Ionic

Alan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.

Stars: ✭ 287 (-79.17%)

Mutual labels: speech-recognition, text-to-speech

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-94.63%)

Mutual labels: text-to-speech, speech-synthesis

Alan Sdk Flutter

Alan AI Flutter SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 309 (-77.58%)

Mutual labels: speech-recognition, text-to-speech

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (-77.36%)

Mutual labels: speech-synthesis, text-to-speech

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-94.34%)

Mutual labels: speech-recognition, speech-to-text

Nnmnkwii

Library to build speech synthesis systems designed for easy and fast prototyping.

Stars: ✭ 308 (-77.65%)

Mutual labels: speech-synthesis, text-to-speech

Pytorch Chatbot

Pytorch seq2seq chatbot

Stars: ✭ 336 (-75.62%)

Mutual labels: seq2seq, sequence-to-sequence

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-76.49%)

Mutual labels: speech-synthesis, text-to-speech

Machine Translation

Stars: ✭ 51 (-96.3%)

Mutual labels: seq2seq, sequence-to-sequence

Neural-HMM

Neural HMMs are all you need (for high-quality attention-free TTS)

Stars: ✭ 69 (-94.99%)

Mutual labels: text-to-speech, speech-synthesis

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (-75.11%)

Mutual labels: speech-recognition, text-to-speech

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-77.87%)

Mutual labels: speech-recognition, speech-to-text

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-76.42%)

Mutual labels: speech-synthesis, text-to-speech

Espeak

eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.

Stars: ✭ 339 (-75.4%)

Mutual labels: speech-synthesis, text-to-speech

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (-1.52%)

Mutual labels: speech-recognition, speech-to-text

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (-72.86%)

Mutual labels: speech-recognition, language-model

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

Stars: ✭ 368 (-73.29%)

Mutual labels: speech-recognition, text-to-speech

Textnormalizationcoveringgrammars

Covering grammars for English and Russian text normalization

Stars: ✭ 46 (-96.66%)

Mutual labels: speech-recognition, text-to-speech

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+228.96%)

Mutual labels: speech-recognition, speech-synthesis

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-72.21%)

Mutual labels: speech-recognition, speech-to-text

Neuralmonkey

An open-source tool for sequence learning in NLP built on TensorFlow.

Stars: ✭ 400 (-70.97%)

Mutual labels: neural-machine-translation, sequence-to-sequence

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (-71.12%)

Mutual labels: speech-recognition, language-model

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (-70.97%)

Mutual labels: speech-recognition, speech-to-text

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-73.73%)

Mutual labels: speech-synthesis, text-to-speech

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-71.48%)

Mutual labels: speech-recognition, speech-to-text

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (-70.54%)

Mutual labels: speech-recognition, speech-to-text

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+258.71%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (-65.09%)

Mutual labels: speech-recognition, speech-to-text

Speech Demo

语音api示例

Stars: ✭ 454 (-67.05%)

Mutual labels: speech-recognition, speech-to-text

Nlp Library

curated collection of papers for the nlp practitioner 📖👩‍🔬

Stars: ✭ 1,025 (-25.62%)

Mutual labels: language-model, neural-machine-translation

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-68.07%)

Mutual labels: speech-recognition, speech-to-text

Nmt Keras

Neural Machine Translation with Keras

Stars: ✭ 501 (-63.64%)

Mutual labels: neural-machine-translation, sequence-to-sequence

Nativescript Speech Recognition

💬 Speech to text, using the awesome engines readily available on the device.

Stars: ✭ 72 (-94.78%)

Mutual labels: speech-recognition, speech-to-text

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (-61.61%)

Mutual labels: speech-recognition, language-model

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-62.12%)

Mutual labels: speech-recognition, speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-61.39%)

Mutual labels: speech-recognition, speech-to-text

Nmt List

A list of Neural MT implementations

Stars: ✭ 359 (-73.95%)

Mutual labels: neural-machine-translation, sequence-to-sequence

Joeynmt

Minimalist NMT for educational purposes

Stars: ✭ 420 (-69.52%)

Mutual labels: seq2seq, neural-machine-translation

Seq2seq.pytorch

Sequence-to-Sequence learning using PyTorch

Stars: ✭ 514 (-62.7%)

Mutual labels: seq2seq, neural-machine-translation

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-96.88%)

Mutual labels: speech-synthesis, text-to-speech

61-120 of 950 similar projects

‹

›

next*5