Facemoji😆 A voice chatbot that can imitate your expression. OpenCV+Dlib+Live2D+Moments Recorder+Turing Robot+Iflytek IAT+Iflytek TTS
Stars: ✭ 320 (-68.53%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-46.71%)
Cognitive Speech TtsMicrosoft Text-to-Speech API sample code in several languages, part of Cognitive Services.
Stars: ✭ 312 (-69.32%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (-27.43%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-48.67%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-70.7%)
Jsut LabHTS-style full-context labels for JSUT v1.1
Stars: ✭ 28 (-97.25%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-70.8%)
Sednndeep learning based speech enhancement using keras or pytorch, make it easy to use
Stars: ✭ 288 (-71.68%)
Glow TtsA Generative Flow for Text-to-Speech via Monotonic Alignment Search
Stars: ✭ 284 (-72.07%)
ParakeetPAddle PARAllel text-to-speech toolKIT (supporting WaveFlow, WaveNet, Transformer TTS and Tacotron2)
Stars: ✭ 279 (-72.57%)
MtransMulti-source Translation
Stars: ✭ 711 (-30.09%)
Flutter ttsFlutter Text to Speech package
Stars: ✭ 263 (-74.14%)
EkhoChinese text-to-speech engine
Stars: ✭ 690 (-32.15%)
Xr3player🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (-53.59%)
Speech Alignerspeech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
Stars: ✭ 259 (-74.53%)
Amazing Python Scripts🚀 Curated collection of Amazing Python scripts from Basics to Advance with automation task scripts.
Stars: ✭ 229 (-77.48%)
Comprehensive-Tacotron2PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.
Stars: ✭ 22 (-97.84%)
Noise2Noise-audio denoising without clean training dataSource code for the paper titled "Speech Denoising without Clean Training Data: a Noise2Noise Approach". Paper accepted at the INTERSPEECH 2021 conference. This paper tackles the problem of the heavy dependence of clean speech data required by deep learning based audio denoising methods by showing that it is possible to train deep speech denoisi…
Stars: ✭ 49 (-95.18%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (-32.15%)
MelganMelGAN vocoder (compatible with NVIDIA/tacotron2)
Stars: ✭ 444 (-56.34%)
hifigan-denoiserHiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-91.35%)
esp32-fliteSpeech synthesis running on ESP32 based on Flite engine.
Stars: ✭ 28 (-97.25%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-56.74%)
minutes🔭 Speaker diarization via transfer learning
Stars: ✭ 25 (-97.54%)
ParallelwaveganUnofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN) with Pytorch
Stars: ✭ 682 (-32.94%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-97.84%)
RequestifyTF2Client side commands for mic spamming and more!
Stars: ✭ 13 (-98.72%)
flite-goGo bindings for Flite (festival-lite)
Stars: ✭ 14 (-98.62%)
Transformer TtsA Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"
Stars: ✭ 418 (-58.9%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (-33.63%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+386.04%)
google-translate-ttsNode library for Google Translate TTS (Text-to-Speech) API
Stars: ✭ 23 (-97.74%)
leopard-chat-ui-teneoLeopard Chat UI - A Teneo Chat Client based on Vue and Vuetify
Stars: ✭ 65 (-93.61%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-59.88%)
talkbotText-to-speech and translation bot for Discord
Stars: ✭ 27 (-97.35%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-96.07%)
apple airplayerMake your AirPlay devices as TTS speakers
Stars: ✭ 84 (-91.74%)
KurDescriptive Deep Learning
Stars: ✭ 811 (-20.26%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (-35%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-60.08%)
tt-vae-ganTimbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.
Stars: ✭ 37 (-96.36%)
BangalASRTransformer based Bangla Speech Recognition
Stars: ✭ 20 (-98.03%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-59.88%)
home-assistant-custom-components-linkplayLinkPlay based media devices integration for Home Assistant. Fully compatible with Mini Media Player card including speaker group management. Supports snapshot and restore functionality for TTS.
Stars: ✭ 62 (-93.9%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+489.87%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-60.67%)
Speech256An FPGA implementation of a classic 80ies speech synthesizer. Done for the Retro Challenge 2017/10.
Stars: ✭ 51 (-94.99%)
speak.awfAn Alfred 3 workflow that uses macOS's TTS (text-to-speech) feature to speak text aloud.
Stars: ✭ 29 (-97.15%)