The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+155.21%)

Mutual labels: speech-to-text, speech-synthesis

Tacotron 2

DeepMind's Tacotron-2 Tensorflow implementation

Stars: ✭ 1,968 (+925%)

Mutual labels: speech-synthesis, tacotron

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+617.71%)

Mutual labels: speech-to-text, speech-synthesis

Tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)

Stars: ✭ 2,581 (+1244.27%)

Mutual labels: speech-synthesis, tacotron

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-46.35%)

Mutual labels: speech-to-text, speech-synthesis

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-85.94%)

Mutual labels: speech-synthesis, speech-to-text

Wavernn

WaveRNN Vocoder + TTS

Stars: ✭ 1,636 (+752.08%)

Mutual labels: speech-synthesis, tacotron

Tacotron2

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Stars: ✭ 43 (-77.6%)

Mutual labels: speech-synthesis, tacotron

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-88.54%)

Mutual labels: speech-synthesis, tacotron

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+338.02%)

Mutual labels: speech-synthesis, speech-to-text

tacotron2

Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.

Stars: ✭ 17 (-91.15%)

Mutual labels: speech-synthesis, tacotron

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-81.77%)

Mutual labels: speech-synthesis, speech-to-text

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+4358.33%)

Mutual labels: speech-synthesis, speech-to-text

Tacotron pytorch

Tacotron implementation of pytorch

Stars: ✭ 12 (-93.75%)

Mutual labels: speech-synthesis, tacotron

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-73.96%)

Mutual labels: speech-synthesis, speech-to-text

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+426.56%)

Mutual labels: speech-to-text, speech-synthesis

Xva Synth

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Stars: ✭ 136 (-29.17%)

Mutual labels: speech-synthesis, tacotron

Proctoring Ai

Creating a software for automatic monitoring in online proctoring

Stars: ✭ 155 (-19.27%)

Mutual labels: speech-to-text

Legacy straight

A vocoder framework which had been widely used in research community since 1999.

Stars: ✭ 130 (-32.29%)

Mutual labels: speech-synthesis

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

Stars: ✭ 176 (-8.33%)

Mutual labels: speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-33.33%)

Mutual labels: speech-to-text

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (-33.85%)

Mutual labels: speech-to-text

Marytts

MARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java

Stars: ✭ 1,699 (+784.9%)

Mutual labels: speech-synthesis

Speecht

An opensource speech-to-text software written in tensorflow

Stars: ✭ 152 (-20.83%)

Mutual labels: speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+5707.81%)

Mutual labels: speech-to-text

Pytorch Dc Tts

Text to Speech with PyTorch (English and Mongolian)

Stars: ✭ 122 (-36.46%)

Mutual labels: speech-synthesis

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (-1.56%)

Mutual labels: speech-to-text

Gst Tacotron

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

Stars: ✭ 175 (-8.85%)

Mutual labels: tacotron

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-21.35%)

Mutual labels: speech-to-text

Deepvoice3 pytorch

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Stars: ✭ 1,654 (+761.46%)

Mutual labels: speech-synthesis

Nlp Models Tensorflow

Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0

Stars: ✭ 1,603 (+734.9%)

Mutual labels: speech-to-text

Awesome Speech Recognition Speech Synthesis Papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+985.94%)

Mutual labels: speech-synthesis

Durian

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Stars: ✭ 111 (-42.19%)

Mutual labels: speech-synthesis

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-23.96%)

Mutual labels: speech-to-text

Crystal

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Stars: ✭ 108 (-43.75%)

Mutual labels: speech-synthesis

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (-25%)

Mutual labels: speech-to-text

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (-44.79%)

Mutual labels: speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+0%)

Mutual labels: speech-to-text

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+1003.13%)

Mutual labels: speech-to-text

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+903.13%)

Mutual labels: speech-synthesis

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (-45.83%)

Mutual labels: speech-to-text

Tensorflowtts

😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

Stars: ✭ 2,382 (+1140.63%)

Mutual labels: speech-synthesis

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (-46.87%)

Mutual labels: speech-to-text

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (-16.15%)

Mutual labels: speech-to-text

Prosody

Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text

Stars: ✭ 139 (-27.6%)

Mutual labels: speech-synthesis

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+606.77%)

Mutual labels: speech-to-text

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-49.48%)

Mutual labels: speech-to-text

Wavegrad

A fast, high-quality neural vocoder.

Stars: ✭ 138 (-28.12%)

Mutual labels: speech-synthesis

Waveflow

A PyTorch implementation of "WaveFlow: A Compact Flow-based Model for Raw Audio"

Stars: ✭ 95 (-50.52%)

Mutual labels: speech-synthesis

1-60 of 290 similar projects

›