smoke-trees / Voice Synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Stars: β 51
Programming Languages
python
139335 projects - #7 most used programming language
Projects that are alternatives of or similar to Voice Synthesis
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: β 490 (+860.78%)
Mutual labels: speech-to-text
Annyang
π¬ Speech recognition for your site
Stars: β 6,216 (+12088.24%)
Mutual labels: speech-to-text
Botium Speech Processing
Botium Speech Processing
Stars: β 908 (+1680.39%)
Mutual labels: speech-to-text
Sonus
π¬ /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: β 532 (+943.14%)
Mutual labels: speech-to-text
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: β 5,999 (+11662.75%)
Mutual labels: speech-to-text
Nonocaptcha
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Stars: β 744 (+1358.82%)
Mutual labels: speech-to-text
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: β 1,017 (+1894.12%)
Mutual labels: speech-to-text
Speechtotext Websockets Java
SDK & Sample to do speech recognition using websockets in Java
Stars: β 11 (-78.43%)
Mutual labels: speech-to-text
Nodejs Speech
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: β 545 (+968.63%)
Mutual labels: speech-to-text
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: β 772 (+1413.73%)
Mutual labels: speech-to-text
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: β 522 (+923.53%)
Mutual labels: speech-to-text
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: β 35 (-31.37%)
Mutual labels: speech-to-text
Speech To Text Benchmark
speech to text benchmark framework
Stars: β 481 (+843.14%)
Mutual labels: speech-to-text
Eesen
The official repository of the Eesen project
Stars: β 738 (+1347.06%)
Mutual labels: speech-to-text
Soloud
Free, easy, portable audio engine for games
Stars: β 1,048 (+1954.9%)
Mutual labels: speech-to-text
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: β 1,011 (+1882.35%)
Mutual labels: speech-to-text
Voice Cloning and Text to Speech Synthesis
A Standalone service for cloning your own voice and synthesize any text in English in your own voice.
Read more about the procedure we followed and the findings here
Functionalities
- [X] Clone voices after feeding samples to it
- [X] Synthesized voice on custom texts
- [X] Speech-to-text facility for input using microphones
- [X] RestAPI with a testing UI for testing the model
Instructions to run the trained models
If you want to try and test out the samples trained and how the model is performing on custom text you can follow these instructions.
-
Pre-requisites:
-
Directions to install:
- For windows
- Clone the repo
- Setup Virtualenv
virtualenv env cd env/scripts activate
- Install all requirements packages
pip install -r requirements.txt
- For Linux
- Run the run.sh file to install the project
./run.sh
- Run the run.sh file to install the project
After installing all the dependencies and environment prerequisites run the below file to check you are ready and good to go!
./test.sh
- For windows
-
Directions to execute
-
Test through interface
- Start the python flask server
python app.py
- Log on to localhost:5000 to test the model
- Start the python flask server
-
Test through the Synthesize function
- Follow the instructions given here
-
Test using Docker
docker build -t smoketrees/voice:latest . docker run smoketrees/voice:latest -p 5000:5000
-
Instructions to train your own models
If you want to work with the source code and want to train your own models on different dataset and different language medium you can check out the instructions mentioned here
For more information about the samples tested and there results you can get all the information from here
Contributors
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].