All Projects → dansuh17 → Segan Pytorch

dansuh17 / Segan Pytorch

Licence: gpl-3.0
SEGAN pytorch implementation https://arxiv.org/abs/1703.09452

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Segan Pytorch

Looking To Listen At The Cocktail Party
Executable code based on Google articles
Stars: ✭ 83 (-8.79%)
Mutual labels:  audio
Ml Audio Start
Suggestions for those interested in developing audio applications of machine learning
Stars: ✭ 87 (-4.4%)
Mutual labels:  audio
Openaudiomc
A real time web client for minecraft to play and manage sounds, game notifications and much more. This project includes the plugin (bungee + spigot + velocity), common java library, web client and build scripts.
Stars: ✭ 89 (-2.2%)
Mutual labels:  audio
Audiokitsynthone
AudioKit Synth One: Open-Source iOS Synthesizer App
Stars: ✭ 1,258 (+1282.42%)
Mutual labels:  audio
Soma Fm Player
This is a Vue.js web application for streaming radio stations from Somafm.com. This app uses the public SomaFM JSON channels API endpoint to pull in a list of stations and makes it easy to switch between stations. This app also uses Three.js and the HTML5 Web Audio Context API to sample audio data and create a visualizer effect for the selected station.
Stars: ✭ 86 (-5.49%)
Mutual labels:  audio
Beet.js
Polyrhythmic Sequencer library for Web Audio API.
Stars: ✭ 87 (-4.4%)
Mutual labels:  audio
Minibae
The platform-neutral Beatnik Audio Engine, Mini Edition (miniBAE) is an exceptionally mature, well-rounded, and reliable computer music and sound system specially customized for small-footprint and embedded applications.
Stars: ✭ 82 (-9.89%)
Mutual labels:  audio
Balena Sound
Build a single or multi-room streamer for an existing audio device using a Raspberry Pi! Supports Bluetooth, Airplay and Spotify Connect
Stars: ✭ 1,306 (+1335.16%)
Mutual labels:  audio
Opusfile
Stand-alone decoder library for .opus streams
Stars: ✭ 86 (-5.49%)
Mutual labels:  audio
Libcrtc
WebRTC C++ library built on top of chromium webrtc.
Stars: ✭ 89 (-2.2%)
Mutual labels:  audio
Fastai
R interface to fast.ai
Stars: ✭ 85 (-6.59%)
Mutual labels:  audio
Arduino Music Player
MOD/S3M/XM/IT Music Player for Arduino
Stars: ✭ 85 (-6.59%)
Mutual labels:  audio
Mpd
Music Player Daemon
Stars: ✭ 1,289 (+1316.48%)
Mutual labels:  audio
Aurio
Audio Fingerprinting & Retrieval for .NET
Stars: ✭ 84 (-7.69%)
Mutual labels:  audio
Beep.js
Beep is a JavaScript toolkit for building browser-based synthesizers.
Stars: ✭ 1,294 (+1321.98%)
Mutual labels:  audio
Rust Game Development Frameworks
List of curated frameworks by the **Game Development in Rust** community.
Stars: ✭ 81 (-10.99%)
Mutual labels:  audio
Pipe
DSP pipeline
Stars: ✭ 87 (-4.4%)
Mutual labels:  audio
Backgroundmusic
Background Music, a macOS audio utility: automatically pause your music, set individual apps' volumes and record system audio.
Stars: ✭ 10,652 (+11605.49%)
Mutual labels:  audio
Pizzicato
Library to simplify the way you create and manipulate sounds with the Web Audio API.
Stars: ✭ 1,296 (+1324.18%)
Mutual labels:  audio
Sjmediacacheserver
A HTTP Media Caching Framework. It can cache FILE or HLS media. 音视频边播边缓存框架, 支持 HLS(m3u8) 和 FILE(mp4, mp3等).
Stars: ✭ 87 (-4.4%)
Mutual labels:  audio

Pytorch Implementation of SEGAN (Speech Enhancement GAN)

Implementation of SEGAN by Pascual et al. in 2017, using pytorch. Original Tensorflow version can be found here.

Prerequisites

  • python v3.5.2 or higher
  • pytorch v0.4.0
  • CUDA preferred
  • noisy speech dataset downloaded from here
  • libraries specified in requirements.txt

Installing Required Libraries

pip install -r requirements.txt

Data Preprocessing

Use data_preprocess.py file to preprocess downloaded data. Adjust the file paths at the beginning of the file to properly locate the data files, output folder, etc. Uncomment functions in __main__ to perform desired preprocessing stage.

Data preprocessing consists of three main stages:

  1. Downsampling - downsample original audio files (48k) to sampling rate of 16000.
  2. Serialization - Splitting the audio files into 2^14-sample (about 1 second) snippets.
  3. Verification - whether it contains proper number of samples.

Note that the second stage takes a fairly long time - more than an hour.

Training

python model.py

Again, fix and adjust datapaths in model.py according to your needs. Especially, provide accurate path to where serialized data are stored.

Using Tensorboard

In order to use tensorboard, you need to first install tensorboard:

pip install tensorboard

Then run tensorboard by specifing the log directory.

tensorboard --logdir=segan_data_out/tblogs
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].