All Projects β†’ smoke-trees β†’ Voice Synthesis

smoke-trees / Voice Synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Voice Synthesis

Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+860.78%)
Mutual labels:  speech-to-text
Annyang
πŸ’¬ Speech recognition for your site
Stars: ✭ 6,216 (+12088.24%)
Mutual labels:  speech-to-text
Botium Speech Processing
Botium Speech Processing
Stars: ✭ 908 (+1680.39%)
Mutual labels:  speech-to-text
Sonus
πŸ’¬ /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+943.14%)
Mutual labels:  speech-to-text
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+11662.75%)
Mutual labels:  speech-to-text
Nonocaptcha
An asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Stars: ✭ 744 (+1358.82%)
Mutual labels:  speech-to-text
Speech Demo
语音apiη€ΊδΎ‹
Stars: ✭ 454 (+790.2%)
Mutual labels:  speech-to-text
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+1894.12%)
Mutual labels:  speech-to-text
Adapt
Adapt Intent Parser
Stars: ✭ 690 (+1252.94%)
Mutual labels:  speech-to-text
Speechtotext Websockets Java
SDK & Sample to do speech recognition using websockets in Java
Stars: ✭ 11 (-78.43%)
Mutual labels:  speech-to-text
Nodejs Speech
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+968.63%)
Mutual labels:  speech-to-text
Voicy
@voicybot Telegram bot main repository
Stars: ✭ 620 (+1115.69%)
Mutual labels:  speech-to-text
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+1413.73%)
Mutual labels:  speech-to-text
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+923.53%)
Mutual labels:  speech-to-text
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-31.37%)
Mutual labels:  speech-to-text
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+843.14%)
Mutual labels:  speech-to-text
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+1347.06%)
Mutual labels:  speech-to-text
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (+1954.9%)
Mutual labels:  speech-to-text
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+1882.35%)
Mutual labels:  speech-to-text
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+1490.2%)
Mutual labels:  speech-to-text

Voice Cloning and Text to Speech Synthesis

A Standalone service for cloning your own voice and synthesize any text in English in your own voice.


Read more about the procedure we followed and the findings here


Functionalities

  • [X] Clone voices after feeding samples to it
  • [X] Synthesized voice on custom texts
  • [X] Speech-to-text facility for input using microphones
  • [X] RestAPI with a testing UI for testing the model

Instructions to run the trained models

If you want to try and test out the samples trained and how the model is performing on custom text you can follow these instructions.

  • Pre-requisites:

    • For Windows

      • python (3.6 or 3.7 works best)
      • virtualenv

        If you dont have virtualenv check it out here to install

      • Trained embeddings from here
    • For linux

      • Bash shell for executing the scripts
  • Directions to install:

    • For windows
      • Clone the repo
      • Setup Virtualenv
         virtualenv env
         cd env/scripts
         activate
        
      • Install all requirements packages
         pip install -r requirements.txt
        
    • For Linux
      • Run the run.sh file to install the project
         ./run.sh
        

    After installing all the dependencies and environment prerequisites run the below file to check you are ready and good to go!
     ./test.sh
    
  • Directions to execute

    • Test through interface

      • Start the python flask server
         python app.py
        
      • Log on to localhost:5000 to test the model
    • Test through the Synthesize function

      • Follow the instructions given here
    • Test using Docker

        docker build -t smoketrees/voice:latest .
        docker run smoketrees/voice:latest -p 5000:5000
      

Instructions to train your own models

If you want to work with the source code and want to train your own models on different dataset and different language medium you can check out the instructions mentioned here

For more information about the samples tested and there results you can get all the information from here

Contributors

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].