End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-98.64%)

Mutual labels: text-to-speech, speech-synthesis, speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-95.36%)

Mutual labels: speech-recognition, text-to-speech, speech-synthesis, speech-to-text

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (-97.23%)

Mutual labels: speech-recognition, text-to-speech, speech-to-text

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (-59.05%)

Mutual labels: speech-recognition, speech-synthesis, speech-to-text

Wavegrad

Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.

Stars: ✭ 245 (-93.35%)

Mutual labels: jupyter-notebook, text-to-speech, speech-synthesis

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-86.7%)

Mutual labels: speech-recognition, speech-synthesis, speech-to-text

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (-96.69%)

Mutual labels: jupyter-notebook, text-to-speech, speech-synthesis

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-99.51%)

Mutual labels: speech-synthesis, speech-recognition, speech-to-text

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (-98.59%)

Mutual labels: text-to-speech, speech-synthesis, speech-recognition

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-95.63%)

Mutual labels: jupyter-notebook, speech-recognition, speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (-93.13%)

Mutual labels: jupyter-notebook, speech-recognition, speech-to-text

Interspeech2019 Tutorial

INTERSPEECH 2019 Tutorial Materials

Stars: ✭ 160 (-95.66%)

Mutual labels: jupyter-notebook, speech-recognition, text-to-speech

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (-72.56%)

Mutual labels: speech-recognition, speech-synthesis, speech-to-text

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (+23.01%)

Mutual labels: speech-recognition, machine-translation, speech-synthesis

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-85.83%)

Mutual labels: jupyter-notebook, speech-recognition, speech-to-text

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (-81.49%)

Mutual labels: jupyter-notebook, text-to-speech, speech-synthesis

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

Stars: ✭ 392 (-89.36%)

Mutual labels: jupyter-notebook, speech-recognition, nmt

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (-69.61%)

Mutual labels: speech-recognition, text-to-speech, speech-to-text

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-99.48%)

Mutual labels: text-to-speech, speech-recognition, speech-to-text

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-98.59%)

Mutual labels: jupyter-notebook, text-to-speech, speech-synthesis

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-96.04%)

Mutual labels: speech-recognition, nlp-machine-learning, speech-to-text

Tacotron Pytorch

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Stars: ✭ 104 (-97.18%)

Mutual labels: text-to-speech, speech-synthesis

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (-97.18%)

Mutual labels: speech-recognition, speech-to-text

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (-55.6%)

Mutual labels: text-to-speech, speech-synthesis

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (-97.12%)

Mutual labels: speech-recognition, speech-to-text

Cross Lingual Voice Cloning

Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.

Stars: ✭ 106 (-97.12%)

Mutual labels: jupyter-notebook, text-to-speech

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-97.07%)

Mutual labels: text-to-speech, speech-synthesis

Bertqa Attention On Steroids

BertQA - Attention on Steroids

Stars: ✭ 112 (-96.96%)

Mutual labels: jupyter-notebook, nlp-machine-learning

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-96.99%)

Mutual labels: text-to-speech, speech-synthesis

Go Astibob

Golang framework to build an AI that can understand and speak back to you, and everything else you want

Stars: ✭ 222 (-93.98%)

Mutual labels: text-to-speech, speech-to-text

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (-96.8%)

Mutual labels: speech-recognition, machine-translation

Nlp Models Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Stars: ✭ 1,603 (-56.5%)

Mutual labels: jupyter-notebook, speech-to-text

Nlp Pretrained Model

A collection of Natural language processing pre-trained models.

Stars: ✭ 122 (-96.69%)

Mutual labels: nlp-machine-learning, text-to-speech

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (-96.55%)

Mutual labels: speech-recognition, speech-to-text

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (-53.89%)

Mutual labels: text-to-speech, speech-synthesis

Alan Sdk Pcf

Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.

Stars: ✭ 128 (-96.53%)

Mutual labels: speech-recognition, text-to-speech

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+202.61%)

Mutual labels: speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-96.53%)

Mutual labels: speech-recognition, speech-to-text

Seq2seq tutorial

Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"

Stars: ✭ 132 (-96.42%)

Mutual labels: jupyter-notebook, nlp-machine-learning

Diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Stars: ✭ 139 (-96.23%)

Mutual labels: text-to-speech, speech-synthesis

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (-96.28%)

Mutual labels: speech-recognition, speech-to-text

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-96.26%)

Mutual labels: text-to-speech, speech-synthesis

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (-35.36%)

Mutual labels: text-to-speech, speech-synthesis

Subword Nmt

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

Stars: ✭ 1,819 (-50.64%)

Mutual labels: machine-translation, nmt

Dla

Deep learning for audio processing

Stars: ✭ 142 (-96.15%)

Mutual labels: jupyter-notebook, speech-recognition

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.