End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-94.49%)

Mutual labels: text-to-speech, speech-to-text

Athena

A free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity

Stars: ✭ 73 (-91.96%)

Mutual labels: text-to-speech, speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-88.66%)

Mutual labels: speech-to-text, text-to-speech

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+51.76%)

Mutual labels: speech-to-text, text-to-speech

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (-7.38%)

Mutual labels: text-to-speech, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (-81.17%)

Mutual labels: speech-to-text, text-to-speech

bingspeech-api-client

Microsoft Bing Speech API client in node.js

Stars: ✭ 32 (-96.48%)

Mutual labels: text-to-speech, speech-to-text

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-97.91%)

Mutual labels: text-to-speech, speech-to-text

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-56.72%)

Mutual labels: speech-to-text

Open stt

Open STT

Stars: ✭ 584 (-35.68%)

Mutual labels: speech-to-text

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

Stars: ✭ 368 (-59.47%)

Mutual labels: text-to-speech

Tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Stars: ✭ 305 (-66.41%)

Mutual labels: text-to-speech

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (-18.72%)

Mutual labels: speech-to-text

Eddiscovery

Captains log and 3d star map for Elite Dangerous

Stars: ✭ 541 (-40.42%)

Mutual labels: text-to-speech

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (-62.22%)

Mutual labels: text-to-speech

Espeak

eSpeak NG is an open source speech synthesizer that supports 101 languages and accents.

Stars: ✭ 339 (-62.67%)

Mutual labels: text-to-speech

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (-55.95%)

Mutual labels: speech-to-text

Transformertts

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Stars: ✭ 617 (-32.05%)

Mutual labels: text-to-speech

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-57.82%)

Mutual labels: speech-to-text

Nonocaptcha

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

Stars: ✭ 744 (-18.06%)

Mutual labels: speech-to-text

Xzvoice

Free and open source text-to-speech software

Stars: ✭ 355 (-60.9%)

Mutual labels: text-to-speech

Nodejs Speech

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

Stars: ✭ 545 (-39.98%)

Mutual labels: speech-to-text

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (-60.13%)

Mutual labels: text-to-speech

Espeak Ng

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

Stars: ✭ 799 (-12%)

Mutual labels: text-to-speech

Autoedit 2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

Stars: ✭ 343 (-62.22%)

Mutual labels: speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-41.41%)

Mutual labels: speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+584.58%)

Mutual labels: speech-to-text

Tts

🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Stars: ✭ 5,427 (+497.69%)

Mutual labels: text-to-speech

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-42.51%)

Mutual labels: speech-to-text

Multilingual text to speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Stars: ✭ 324 (-64.32%)

Mutual labels: text-to-speech

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (-64.21%)

Mutual labels: text-to-speech

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-46.04%)

Mutual labels: speech-to-text

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+1957.27%)

Mutual labels: speech-to-text

React Mic

Record audio from a user's microphone and display a cool visualization.

Stars: ✭ 323 (-64.43%)

Mutual labels: speech-to-text

Vonage Php Sdk Core

Vonage REST API client for PHP. API support for SMS, Voice, Text-to-Speech, Numbers, Verify (2FA) and more.

Stars: ✭ 849 (-6.5%)

Mutual labels: text-to-speech

Zhrtvc

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。