All Projects → andimarafioti → audioContextEncoder

andimarafioti / audioContextEncoder

Licence: other
A context encoder for audio inpainting

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
CSS
56736 projects
HTML
75241 projects
matlab
3953 projects

Projects that are alternatives of or similar to audioContextEncoder

minie
An open information extraction system that provides compact extractions
Stars: ✭ 83 (+361.11%)
Mutual labels:  paper
heinsen routing
Official implementation of "An Algorithm for Routing Capsules in All Domains" (Heinsen, 2019) in PyTorch.
Stars: ✭ 41 (+127.78%)
Mutual labels:  paper
ghiaseddin
Author's implementation of the paper "Deep Relative Attributes" (ACCV 2016)
Stars: ✭ 41 (+127.78%)
Mutual labels:  paper
best AI papers 2021
A curated list of the latest breakthroughs in AI (in 2021) by release date with a clear video explanation, link to a more in-depth article, and code.
Stars: ✭ 2,740 (+15122.22%)
Mutual labels:  paper
PhD
Incremental Methods of Deep Learning for Detection and Classifcation in a Robotics Environment
Stars: ✭ 13 (-27.78%)
Mutual labels:  paper
pFedMe
Personalized Federated Learning with Moreau Envelopes (pFedMe) using Pytorch (NeurIPS 2020)
Stars: ✭ 196 (+988.89%)
Mutual labels:  paper
NLP-Natural-Language-Processing
Projects and useful articles / links
Stars: ✭ 149 (+727.78%)
Mutual labels:  paper
yuanxiaosc.github.io
个人博客;论文;机器学习;深度学习;Python学习;C++学习;
Stars: ✭ 19 (+5.56%)
Mutual labels:  paper
pluGET
📦 Powerful Package manager which updates plugins & server software for minecraft servers
Stars: ✭ 87 (+383.33%)
Mutual labels:  paper
Awesome-Lane-Detection
A paper list with code of lane detection.
Stars: ✭ 34 (+88.89%)
Mutual labels:  paper
bug-localization
Source code of the paper "Leveraging textual properties of bug reports to localize relevant source files".
Stars: ✭ 15 (-16.67%)
Mutual labels:  paper
Orion
Mixin loader for Paper
Stars: ✭ 46 (+155.56%)
Mutual labels:  paper
Text-Summarization-Repo
텍스트 요약 분야의 주요 연구 주제, Must-read Papers, 이용 가능한 model 및 data 등을 추천 자료와 함께 정리한 저장소입니다.
Stars: ✭ 213 (+1083.33%)
Mutual labels:  paper
material-appearance-similarity
Code for the paper "A Similarity Measure for Material Appearance" presented in SIGGRAPH 2019 and published in ACM Transactions on Graphics (TOG).
Stars: ✭ 22 (+22.22%)
Mutual labels:  paper
MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Stars: ✭ 15 (-16.67%)
Mutual labels:  paper
STACP
Joint Geographical and Temporal Modeling based on Matrix Factorization for Point-of-Interest Recommendation - ECIR 2020
Stars: ✭ 19 (+5.56%)
Mutual labels:  paper
SportPaper
Performance-tuned Minecraft 1.8 spigot server
Stars: ✭ 122 (+577.78%)
Mutual labels:  paper
LMMS
Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings
Stars: ✭ 79 (+338.89%)
Mutual labels:  paper
TiDB-A-Raft-based-HTAP-Database
Unofficial! English original and Chinese translation of the paper.
Stars: ✭ 42 (+133.33%)
Mutual labels:  paper
sdn-nfv-papers
This is a paper list about Resource Allocation in Network Functions Virtualization (NFV) and Software-Defined Networking (SDN).
Stars: ✭ 40 (+122.22%)
Mutual labels:  paper

Audio inpainting with a context encoder

This project accompanies the research work on audio inpainting of small gaps done at the Acoustics Research Institute in Vienna collaborating with the Swiss Data Science Center. The paper was published at IEEE TASLP available now: https://ieeexplore.ieee.org/document/8867915.

Installation

Install the requirements with pip install -r requirements.txt. For windows users, the numpy version should be 1.14.0+mkl (find it here). For the FMA dataset, librosa requires ffmpeg as an mp3 backend.

Instructions

The paper uses both google's Nsynth dataset and the FMA dataset. In order to recreate the used dataset, execute in the parent folder either python make_nsynthdataset.py or python make_fmadataset.py. The output of the scripts are three tfrecord files for training, validating and testing the model.

The default parameters for the network come pickled in the file magnitude_network_parameters.pkl and complex_network_parameters.pkl. In order to make other architectures use saveParameters.py.

To train the network, execute in the parent folder python trainMagnitudeNetwork.py or python trainComplexNetwork.py. This will train the network for 600k steps with a learning rate of 1e-3. You can select on which tfrecords to train the network, the script assumes you have created the nsynth dataset.

Sound examples

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].