All Projects → Deepvoice3_pytorch → Similar Projects or Alternatives

369 Open source projects that are alternatives of or similar to Deepvoice3_pytorch

Parrot
RNN-based generative models for speech.
Stars: ✭ 601 (-63.66%)
Mutual labels:  speech-synthesis
porfir
Голосовой ассистент Порфирьевич
Stars: ✭ 23 (-98.61%)
Mutual labels:  speech-synthesis
Tf Wavenet vocoder
Wavenet and its applications with Tensorflow
Stars: ✭ 58 (-96.49%)
Mutual labels:  speech-synthesis
ms-ra-forwarder
A free online TTS API
Stars: ✭ 397 (-76%)
Mutual labels:  tts
End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
Stars: ✭ 20 (-98.79%)
Mutual labels:  end-to-end
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (-89.66%)
Mutual labels:  speech-synthesis
Melgan Neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (-64.21%)
Mutual labels:  speech-synthesis
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-96.98%)
Mutual labels:  speech-synthesis
Espeak Ng
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
Stars: ✭ 799 (-51.69%)
Mutual labels:  speech-synthesis
Flutter tts
Flutter Text to Speech package
Stars: ✭ 263 (-84.1%)
Mutual labels:  tts
few-shot-transformer-tts
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
Stars: ✭ 60 (-96.37%)
Mutual labels:  speech-synthesis
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (-93.05%)
Mutual labels:  speech-processing
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-91.6%)
Mutual labels:  speech-synthesis
py-espeak-ng
Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.
Stars: ✭ 27 (-98.37%)
Mutual labels:  tts
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (-91.6%)
Mutual labels:  speech-synthesis
Real Time Voice Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Stars: ✭ 32,095 (+1840.45%)
Mutual labels:  tts
Xva Synth
Machine learning based speech synthesis Electron app, with voices from specific characters from video games
Stars: ✭ 136 (-91.78%)
Mutual labels:  speech-synthesis
wav2letter
Facebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 6,026 (+264.33%)
Mutual labels:  end-to-end
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-91.96%)
Mutual labels:  speech-synthesis
Pink Trombone
A programmable version of Neil Thapen's Pink Trombone
Stars: ✭ 54 (-96.74%)
Mutual labels:  speech-synthesis
Awesome Speech Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (-84.46%)
Mutual labels:  speech-processing
magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
Stars: ✭ 76 (-95.41%)
Mutual labels:  tts
MediumVC
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Stars: ✭ 46 (-97.22%)
Mutual labels:  speech-synthesis
Vq Vae Speech
PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]
Stars: ✭ 187 (-88.69%)
Mutual labels:  speech-processing
Joytan
Creative Audio/Textbook Maker 🎵 📖 See our YouTube channel
Stars: ✭ 91 (-94.5%)
Mutual labels:  tts
Sdk Js
Tanker client-side encryption SDK for JavaScript
Stars: ✭ 786 (-52.48%)
Mutual labels:  end-to-end
SpeechTransProgress
Tracking the progress in end-to-end speech translation
Stars: ✭ 139 (-91.6%)
Mutual labels:  speech-processing
FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech ✊
Stars: ✭ 64 (-96.13%)
Mutual labels:  tts
Tutorial separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (-90.87%)
Mutual labels:  speech-processing
pygtrans
谷歌翻译, 支持 APIKEY 一口气翻译十万条
Stars: ✭ 60 (-96.37%)
Mutual labels:  tts
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-91.17%)
Mutual labels:  speech-processing
Inspectit
inspectIT is the leading Open Source APM (Application Performance Management) tool for analyzing your Java (EE) applications.
Stars: ✭ 513 (-68.98%)
Mutual labels:  end-to-end
A Convolutional Recurrent Neural Network For Real Time Speech Enhancement
A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch
Stars: ✭ 123 (-92.56%)
Mutual labels:  speech-processing
DiscordEncryption
🔐 Configurable end to end encryption for Discord
Stars: ✭ 30 (-98.19%)
Mutual labels:  end-to-end
Sstd
Single Shot Text Detector with Regional Attention
Stars: ✭ 221 (-86.64%)
Mutual labels:  end-to-end
Fullsubnet
PyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Stars: ✭ 51 (-96.92%)
Mutual labels:  speech-processing
Kospeech
Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (-88.51%)
Mutual labels:  end-to-end
text-to-speech
⚡️ Capacitor plugin for synthesizing speech from text.
Stars: ✭ 50 (-96.98%)
Mutual labels:  tts
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (-89.42%)
Mutual labels:  end-to-end
Lanedetection end2end
End-to-end Lane Detection for Self-Driving Cars (ICCV 2019 Workshop)
Stars: ✭ 500 (-69.77%)
Mutual labels:  end-to-end
Listen Attend Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Stars: ✭ 147 (-91.11%)
Mutual labels:  end-to-end
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (-96.98%)
Mutual labels:  end-to-end
Mtts
A Demo of Mandarin/Chinese TTS frontend
Stars: ✭ 229 (-86.15%)
Mutual labels:  tts
Wave U Net For Speech Enhancement
Implement Wave-U-Net by PyTorch, and migrate it to the speech enhancement.
Stars: ✭ 106 (-93.59%)
Mutual labels:  speech-processing
Multi Tacotron Voice Cloning
Phoneme multilingual(Russian-English) voice cloning based on
Stars: ✭ 192 (-88.39%)
Mutual labels:  tts
shairport-sync
AirPlay audio player. Shairport Sync adds multi-room capability with Audio Synchronisation
Stars: ✭ 5,532 (+234.46%)
Mutual labels:  multi-speaker
Speaker adapted tts
Making a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (-88.94%)
Mutual labels:  tts
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-70.37%)
Mutual labels:  speech-synthesis
Gst Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
Stars: ✭ 175 (-89.42%)
Mutual labels:  tts
cypress-example-docker-circle-workflows
Cypress + Docker + CircleCI Workflows = ❤️
Stars: ✭ 29 (-98.25%)
Mutual labels:  end-to-end
Melnet
Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"
Stars: ✭ 161 (-90.27%)
Mutual labels:  tts
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-97.16%)
Mutual labels:  speech-processing
mimic2
Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.
Stars: ✭ 537 (-67.53%)
Mutual labels:  speech-synthesis
Tts
Text-to-Speech for Arduino
Stars: ✭ 118 (-92.87%)
Mutual labels:  tts
Tf Kaldi Speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Stars: ✭ 117 (-92.93%)
Mutual labels:  speech-processing
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (-8.77%)
Mutual labels:  speech-synthesis
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (-16.69%)
Mutual labels:  speech-synthesis
Drachtio Freeswitch Modules
A collection of open-sourced freeswitch modules that I use in various drachtio applications
Stars: ✭ 73 (-95.59%)
Mutual labels:  tts
Zhrtvc
Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统,包含语音编码器、语音合成器、声码器和可视化模块。
Stars: ✭ 771 (-53.39%)
Mutual labels:  tts
hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
Stars: ✭ 88 (-94.68%)
Mutual labels:  speech-processing
301-360 of 369 similar projects