End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-51.46%)

Mutual labels: text-to-speech, speech-synthesis, voice-recognition, speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (+66.02%)

Mutual labels: raspberry-pi, speech-recognition, speech-to-text, speech-synthesis, text-to-speech

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (+29.13%)

Mutual labels: natural-language-processing, speech-recognition, speech-to-text, speech-synthesis, text-to-speech

Lingvo

Stars: ✭ 2,361 (+2192.23%)

Mutual labels: speech-recognition, speech-to-text, speech-synthesis, tts

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+3477.67%)

Mutual labels: speech-recognition, text-to-speech, speech-synthesis, speech-to-text

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+1217.48%)

Mutual labels: raspberry-pi, speech-recognition, speech-to-text, voice-recognition

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+8210.68%)

Mutual labels: text-to-speech, speech-synthesis, speech-recognition, speech-to-text

Clause

🏇 聊天机器人，自然语言理解，语义理解

Stars: ✭ 323 (+213.59%)

Mutual labels: bot, natural-language-processing, natural-language-understanding, nlu

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (+271.84%)

Mutual labels: raspberry-pi, speech-recognition, speech-to-text, voice-recognition

Botlibre

An open platform for artificial intelligence, chat bots, virtual agents, social media automation, and live chat automation.

Stars: ✭ 412 (+300%)

Mutual labels: bot, natural-language-processing, natural-language-understanding, nlu

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+1365.05%)

Mutual labels: bot, speech-recognition, speech-to-text, speech-synthesis

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+18035.92%)

Mutual labels: neural-networks, speech-recognition, speech-to-text, embedded

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-66.02%)

Mutual labels: text-to-speech, speech-synthesis, speech-recognition, speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+1237.86%)

Mutual labels: speech-recognition, speech-to-text, speech-synthesis, text-to-speech

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (-0.97%)

Mutual labels: speech-recognition, speech-to-text, text-to-speech

bingspeech-api-client

Microsoft Bing Speech API client in node.js

Stars: ✭ 32 (-68.93%)

Mutual labels: text-to-speech, tts, speech-to-text

Py Nltools

A collection of basic python modules for spoken natural language processing

Stars: ✭ 46 (-55.34%)

Mutual labels: natural-language-processing, speech-recognition, tts

Cs224n Gpu That Talks

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Stars: ✭ 52 (-49.51%)

Mutual labels: speech-synthesis, text-to-speech, tts

talkie

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Stars: ✭ 43 (-58.25%)

Mutual labels: text-to-speech, tts, speech-synthesis

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-81.55%)

Mutual labels: text-to-speech, speech-recognition, speech-to-text

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+881.55%)

Mutual labels: speech-recognition, speech-to-text, speech-synthesis

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+53.4%)

Mutual labels: text-to-speech, tts, speech-synthesis

LVCNet

LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation

Stars: ✭ 67 (-34.95%)

Mutual labels: text-to-speech, tts, speech-synthesis

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-66.99%)

Mutual labels: voice-recognition, speech-recognition, speech-to-text

Articutapi

API of Articut 中文斷詞 (兼具語意詞性標記)：「斷詞」又稱「分詞」，是中文資訊處理的基礎。Articut 不用機器學習，不需資料模型，只用現代白話中文語法規則，即能達到 SIGHAN 2005 F1-measure 94% 以上，Recall 96% 以上的成績。

Stars: ✭ 252 (+144.66%)

Mutual labels: natural-language-processing, natural-language-understanding, nlu

Fre-GAN-pytorch

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

Stars: ✭ 73 (-29.13%)

Mutual labels: text-to-speech, tts, speech-synthesis

Parakeet

PAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)

Stars: ✭ 279 (+170.87%)

Mutual labels: speech-synthesis, text-to-speech, tts

Oie Resources

A curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.

Stars: ✭ 283 (+174.76%)

Mutual labels: natural-language-processing, natural-language-understanding, nlu

Parallel-Tacotron2

PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Stars: ✭ 149 (+44.66%)

Mutual labels: text-to-speech, tts, speech-synthesis

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+987.38%)

Mutual labels: speech-recognition, speech-to-text, text-to-speech

Comprehensive-Tacotron2

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Stars: ✭ 22 (-78.64%)

Mutual labels: text-to-speech, tts, speech-synthesis

Intent classifier

Stars: ✭ 67 (-34.95%)

Mutual labels: natural-language-processing, neural-networks, natural-language-understanding

esp32-flite

Speech synthesis running on ESP32 based on Flite engine.

Stars: ✭ 28 (-72.82%)

Mutual labels: text-to-speech, tts, speech-synthesis

Cognitive Speech Tts

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Stars: ✭ 312 (+202.91%)

Mutual labels: speech-synthesis, text-to-speech, tts

editts

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech

Stars: ✭ 74 (-28.16%)

Mutual labels: text-to-speech, tts, speech-synthesis

Glow Tts

A Generative Flow for Text-to-Speech via Monotonic Alignment Search

Stars: ✭ 284 (+175.73%)

Mutual labels: speech-synthesis, text-to-speech, tts

Hifi Gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Stars: ✭ 325 (+215.53%)

Mutual labels: speech-synthesis, text-to-speech, tts

Nativescript Speech Recognition

💬 Speech to text, using the awesome engines readily available on the device.

Stars: ✭ 72 (-30.1%)

Mutual labels: speech-recognition, speech-to-text, voice-recognition

Bidaf Keras

Bidirectional Attention Flow for Machine Comprehension implemented in Keras 2

Stars: ✭ 60 (-41.75%)

Mutual labels: natural-language-processing, neural-networks, natural-language-understanding

Voice Builder

An opensource text-to-speech (TTS) voice building tool

Stars: ✭ 362 (+251.46%)

Mutual labels: speech-synthesis, text-to-speech, tts

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-24.27%)

Mutual labels: neural-networks, speech-recognition, speech-to-text

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (+243.69%)

Mutual labels: raspberry-pi, speech-recognition, speech-synthesis

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (+327.18%)

Mutual labels: speech-recognition, speech-to-text, voice-recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (+366.99%)

Mutual labels: raspberry-pi, speech-recognition, voice-recognition

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (+366.99%)

Mutual labels: speech-recognition, speech-to-text, voice-recognition

Spark Nlp Models

Models and Pipelines for the Spark NLP library

Stars: ✭ 88 (-14.56%)

Mutual labels: natural-language-processing, natural-language-understanding, nlu

Chat

基于自然语言理解与机器学习的聊天机器人，支持多用户并发及自定义多轮对话

Stars: ✭ 516 (+400.97%)

Mutual labels: natural-language-processing, natural-language-understanding, nlu

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (+233.01%)

Mutual labels: neural-networks, speech-recognition, text-to-speech

Nlp.js

An NLP library for building bots, with entity extraction, sentiment analysis, automatic language identify, and so more

Stars: ✭ 4,670 (+4433.98%)

Mutual labels: bot, natural-language-processing, nlu

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+375.73%)

Mutual labels: speech-recognition, speech-to-text, speech-synthesis

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+416.5%)

Mutual labels: speech-recognition, speech-to-text, voice-recognition

Parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch

Stars: ✭ 682 (+562.14%)

Mutual labels: speech-synthesis, text-to-speech, tts

Nlp Recipes

Natural Language Processing Best Practices & Examples