Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Stars: ✭ 31 (-94.84%)

Mutual labels: recurrent-neural-networks, theano

rnn benchmarks

RNN benchmarks of pytorch, tensorflow and theano

Stars: ✭ 85 (-85.86%)

Mutual labels: theano, recurrent-neural-networks

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (-63.39%)

Mutual labels: recurrent-neural-networks, theano

Machine Learning Curriculum

💻 Make machines learn so that you don't have to struggle to program them; The ultimate list

Stars: ✭ 761 (+26.62%)

Mutual labels: recurrent-neural-networks, theano

sequence-rnn-py

Sequence analyzing using Recurrent Neural Networks (RNN) based on Keras

Stars: ✭ 28 (-95.34%)

Mutual labels: theano, recurrent-neural-networks

Returnn

The RWTH extensible training framework for universal recurrent neural networks

Stars: ✭ 290 (-51.75%)

Mutual labels: recurrent-neural-networks, theano

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-18.47%)

Mutual labels: speech-synthesis

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (-9.82%)

Mutual labels: speech-synthesis

Autovc

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Stars: ✭ 485 (-19.3%)

Mutual labels: speech-synthesis

Nmt Keras

Neural Machine Translation with Keras

Stars: ✭ 501 (-16.64%)

Mutual labels: theano

Trending Deep Learning

Top 100 trending deep learning repositories sorted by the number of stars gained on a specific day.

Stars: ✭ 543 (-9.65%)

Mutual labels: recurrent-neural-networks

Tensorflow Vae Gan Draw

A collection of generative methods implemented with TensorFlow (Deep Convolutional Generative Adversarial Networks (DCGAN), Variational Autoencoder (VAE) and DRAW: A Recurrent Neural Network For Image Generation).

Stars: ✭ 577 (-3.99%)

Mutual labels: recurrent-neural-networks

Deepalignmentnetwork

A deep neural network for face alignment

Stars: ✭ 480 (-20.13%)

Mutual labels: theano

Practical rl

A course in reinforcement learning in the wild

Stars: ✭ 4,741 (+688.85%)

Mutual labels: theano

Melgan Neurips

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Stars: ✭ 592 (-1.5%)

Mutual labels: speech-synthesis

Deep Trading Agent

Deep Reinforcement Learning based Trading Agent for Bitcoin

Stars: ✭ 573 (-4.66%)

Mutual labels: recurrent-neural-networks

Ner Lstm

Named Entity Recognition using multilayered bidirectional LSTM

Stars: ✭ 532 (-11.48%)

Mutual labels: recurrent-neural-networks

View All Similar Projects ➔

Speech synthesis using recurrent neural networks.

This repo has the code for our ICLR submission:

Jose Sotelo, Soroush Mehri, Kundan Kumar, João Felipe Santos, Kyle Kastner, Aaron Courville, Yoshua Bengio. Char2Wav: End-to-End Speech Synthesis.

The website is here.

NOTE: The code is currently being refactored/cleaned/documented. We wanted to make it available as soon as possible but we know well that the current version is not ready for replication. Therefore, if you're interested in this, please come back later. Alternatively, you can send me an email and I will let you know when it's ready.

NOTE(2): The code for the neural vocoder is based on sampleRNN.

Updates:

SampleRNN modules added
End-to-end model can generate more than 4s of audio in 1s on p6000 gpu. (This is achieved by generating 200 samples of length 10s in a single batch. Total time taken for this is 448 seconds.)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 601

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (7) 🔗