All Projects → richardassar → Samplernn_torch

richardassar / Samplernn_torch

Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

Programming Languages

lua
6591 projects

Projects that are alternatives of or similar to Samplernn torch

Neural Style Audio Torch
Torch implementation for audio neural style.
Stars: ✭ 130 (-10.96%)
Mutual labels:  torch, audio
React Native Sound Player
Play sound file in ReactNative
Stars: ✭ 144 (-1.37%)
Mutual labels:  audio
Mediafile
A unified reader of metadata from audio & video files.
Stars: ✭ 138 (-5.48%)
Mutual labels:  audio
Synthesize3dviadepthorsil
[CVPR 2017] Generation and reconstruction of 3D shapes via modeling multi-view depth maps or silhouettes
Stars: ✭ 141 (-3.42%)
Mutual labels:  torch
Audio Plugin Development Resources
Various resources related to developing plugins for audio production.
Stars: ✭ 136 (-6.85%)
Mutual labels:  audio
Esp32 Projects
ESP32 好玩、有趣、实用的项目
Stars: ✭ 142 (-2.74%)
Mutual labels:  audio
Prediction Flow
Deep-Learning based CTR models implemented by PyTorch
Stars: ✭ 138 (-5.48%)
Mutual labels:  torch
Digital video introduction
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding).
Stars: ✭ 12,184 (+8245.21%)
Mutual labels:  audio
Musicplayer
A minimal music player built on electron.
Stars: ✭ 145 (-0.68%)
Mutual labels:  audio
Openob
The Open Outside Broadcast project for radio contribution links and studio-transmitter links.
Stars: ✭ 140 (-4.11%)
Mutual labels:  audio
Mkchromecast
Cast macOS and Linux Audio/Video to your Google Cast and Sonos Devices
Stars: ✭ 1,837 (+1158.22%)
Mutual labels:  audio
Defer.js
🥇 A super small, super efficient library that helps you lazy load everything like images, video, audio, iframe as well as stylesheets, and JavaScript.
Stars: ✭ 138 (-5.48%)
Mutual labels:  audio
Youcast
Turn YouTube Channels into Subscribable Podcasts.
Stars: ✭ 142 (-2.74%)
Mutual labels:  audio
Prism Media
Easily transcode media using Node.js 🎶
Stars: ✭ 136 (-6.85%)
Mutual labels:  audio
Learningcoreaudiowithswift2.0
All the examples of the Learning Core Audio book rewritten with Swift 2.0
Stars: ✭ 145 (-0.68%)
Mutual labels:  audio
Malgo
Mini audio library
Stars: ✭ 138 (-5.48%)
Mutual labels:  audio
Plugin.audiorecorder
Audio Recorder plugin for Xamarin and Windows
Stars: ✭ 140 (-4.11%)
Mutual labels:  audio
Airsonic
📡 ☁️ 🎶Airsonic, a Free and Open Source community driven media server (fork of Subsonic and Libresonic)
Stars: ✭ 1,876 (+1184.93%)
Mutual labels:  audio
Audio Steganography Algorithms
A Library of Audio Steganography & Watermarking Algorithms
Stars: ✭ 146 (+0%)
Mutual labels:  audio
Essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
Stars: ✭ 1,985 (+1259.59%)
Mutual labels:  audio

SampleRNN_torch

A Torch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model.

A visual representation of the SampleRNN architecture

Samples

Listen to a selection of generated output at the following links:

Feel free to submit links to any interesting output you generate or dataset creation scripts as a pull request.

Dependencies

The following packages are required to run SampleRNN_torch:

  • nn
  • cunn
  • cudnn
  • rnn
  • optim
  • audio
  • xlua
  • gnuplot

NOTE: Update nn and cudnn even if they were already installed as fixes have been submitted which affect this project.

Datasets

To retrieve and prepare the piano dataset, as used in the reference implementation, run:

cd datasets/piano/
./create_dataset.sh

Other dataset preparation scripts may be found under datasets/.

Custom datasets may be created by using scripts/generate_dataset.lua to slice multiple audio files into segments for training, audio must be placed in datasets/[dataset]/data/.

Training

To start a training session run th train.lua -dataset piano. To view a description of all accepted arguments run th train.lua -help.

To view the progress of training run th generate_plots, the loss and gradient norm curve will be saved in sessions/[session]/plots/.

Sampling

By default samples are generated at the end of every training epoch but they can also be generated separately using th train.lua -generate_samples with the session parameter to specify the model.

Multiple samples are generated in batch mode for efficiency, however generating a single audio sample is faster with th fast_sample.lua. See -help for a description of the arguments.

Models

A pretrained model of the piano dataset is available here. Download and copy it into your sessions/ directory and then extract it in place.

More models will be uploaded soon.

Theano version

This code is based on the reference implementation in Theano.

https://github.com/soroushmehr/sampleRNN_ICLR2017

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].