All Projects → sotelo → Parrot

sotelo / Parrot

RNN-based generative models for speech.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Parrot

STORN-keras
This is a STORN (Stochastical Recurrent Neural Network) implementation for keras!
Stars: ✭ 23 (-96.17%)
Mutual labels:  theano, recurrent-neural-networks
Merlin
This is now the official location of the Merlin project.
Stars: ✭ 1,168 (+94.34%)
Mutual labels:  speech-synthesis, theano
Punctuator2
A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text
Stars: ✭ 483 (-19.63%)
Mutual labels:  recurrent-neural-networks, theano
Theano Kaldi Rnn
THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.
Stars: ✭ 31 (-94.84%)
Mutual labels:  recurrent-neural-networks, theano
rnn benchmarks
RNN benchmarks of pytorch, tensorflow and theano
Stars: ✭ 85 (-85.86%)
Mutual labels:  theano, recurrent-neural-networks
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (-63.39%)
Mutual labels:  recurrent-neural-networks, theano
Machine Learning Curriculum
💻 Make machines learn so that you don't have to struggle to program them; The ultimate list
Stars: ✭ 761 (+26.62%)
Mutual labels:  recurrent-neural-networks, theano
sequence-rnn-py
Sequence analyzing using Recurrent Neural Networks (RNN) based on Keras
Stars: ✭ 28 (-95.34%)
Mutual labels:  theano, recurrent-neural-networks
Returnn
The RWTH extensible training framework for universal recurrent neural networks
Stars: ✭ 290 (-51.75%)
Mutual labels:  recurrent-neural-networks, theano
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-18.47%)
Mutual labels:  speech-synthesis
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-9.82%)
Mutual labels:  speech-synthesis
Autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Stars: ✭ 485 (-19.3%)
Mutual labels:  speech-synthesis
Nmt Keras
Neural Machine Translation with Keras
Stars: ✭ 501 (-16.64%)
Mutual labels:  theano
Trending Deep Learning
Top 100 trending deep learning repositories sorted by the number of stars gained on a specific day.
Stars: ✭ 543 (-9.65%)
Mutual labels:  recurrent-neural-networks
Tensorflow Vae Gan Draw
A collection of generative methods implemented with TensorFlow (Deep Convolutional Generative Adversarial Networks (DCGAN), Variational Autoencoder (VAE) and DRAW: A Recurrent Neural Network For Image Generation).
Stars: ✭ 577 (-3.99%)
Mutual labels:  recurrent-neural-networks
Deepalignmentnetwork
A deep neural network for face alignment
Stars: ✭ 480 (-20.13%)
Mutual labels:  theano
Practical rl
A course in reinforcement learning in the wild
Stars: ✭ 4,741 (+688.85%)
Mutual labels:  theano
Melgan Neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
Stars: ✭ 592 (-1.5%)
Mutual labels:  speech-synthesis
Deep Trading Agent
Deep Reinforcement Learning based Trading Agent for Bitcoin
Stars: ✭ 573 (-4.66%)
Mutual labels:  recurrent-neural-networks
Ner Lstm
Named Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (-11.48%)
Mutual labels:  recurrent-neural-networks

Speech synthesis using recurrent neural networks.

This repo has the code for our ICLR submission:

Jose Sotelo, Soroush Mehri, Kundan Kumar, João Felipe Santos, Kyle Kastner, Aaron Courville, Yoshua Bengio. Char2Wav: End-to-End Speech Synthesis.

The website is here.

NOTE: The code is currently being refactored/cleaned/documented. We wanted to make it available as soon as possible but we know well that the current version is not ready for replication. Therefore, if you're interested in this, please come back later. Alternatively, you can send me an email and I will let you know when it's ready.

NOTE(2): The code for the neural vocoder is based on sampleRNN.

Updates:

  • SampleRNN modules added
  • End-to-end model can generate more than 4s of audio in 1s on p6000 gpu. (This is achieved by generating 200 samples of length 10s in a single batch. Total time taken for this is 448 seconds.)
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].