spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-48.08%)
Mutual labels: text-to-speech, speech-synthesis, voice-recognition, speech-recognition, vad, asr, voice-assistant, natural-language-understanding, speech-api, voice-activity-detection, voice-synthesis, wakeword, wakeword-activation Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+98.08%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1517.31%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+1.92%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-59.62%)
brasilttsBrasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…
Stars: ✭ 34 (-34.62%)
LingvoLingvo
Stars: ✭ 2,361 (+4440.38%)
StyleSpeechOfficial implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+209.62%)
edittsOfficial implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech
Stars: ✭ 74 (+42.31%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+203.85%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+228.85%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+467.31%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+46.15%)
WavegradImplementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+371.15%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+61.54%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+2744.23%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-36.54%)
DurianImplementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (+113.46%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+16361.54%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+942.31%)
talkieText-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.
Stars: ✭ 43 (-17.31%)
TensorVoxDesktop application for neural speech synthesis written in C++
Stars: ✭ 140 (+169.23%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+1117.31%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-59.62%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-3.85%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-40.38%)
JSpeakA Text to Speech Reader Front-end that Reads from the Clipboard and with Exceptionable Features
Stars: ✭ 16 (-69.23%)
AdaSpeechAdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+107.69%)
Voice BuilderAn opensource text-to-speech (TTS) voice building tool
Stars: ✭ 362 (+596.15%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (+40.38%)
WsayWindows "say"
Stars: ✭ 36 (-30.77%)
voice gender detection♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (-1.92%)
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (+146.15%)
MaryttsMARY TTS -- an open-source, multilingual text-to-speech synthesis system written in pure java
Stars: ✭ 1,699 (+3167.31%)
vasisualyVasisualy it's a simple Russian voice assistant written on Python for GNU/Linux, Windows and Android.
Stars: ✭ 33 (-36.54%)
Pytorch Dc TtsText to Speech with PyTorch (English and Mongolian)
Stars: ✭ 122 (+134.62%)
TalkifyJavascript Text to speech library
Stars: ✭ 132 (+153.85%)
Daft-ExprtPyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Stars: ✭ 41 (-21.15%)
DiffwaveDiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+167.31%)
WavegradA fast, high-quality neural vocoder.
Stars: ✭ 138 (+165.38%)
WaveGrad2PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
Stars: ✭ 55 (+5.77%)
TtsText-to-Speech for Arduino
Stars: ✭ 118 (+126.92%)
AndroidmaryttsAndroid MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS
Stars: ✭ 134 (+157.69%)
melganMelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (+3.85%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-55.77%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+3634.62%)
Tensorflowtts😝 TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
Stars: ✭ 2,382 (+4480.77%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+294.23%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+580.77%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+244.23%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+71.15%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (+207.69%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+25%)
vitsVITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Stars: ✭ 1,604 (+2984.62%)
Expressive-FastSpeech2PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean, and your own languages.
Stars: ✭ 139 (+167.31%)
LVCNetLVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Stars: ✭ 67 (+28.85%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-32.69%)