All Projects → fastaudio → Fastai_audio

fastaudio / Fastai_audio

Licence: mit
[DEPRECATED] 🔊️ Audio with fastaiv1

Projects that are alternatives of or similar to Fastai audio

Davsod
Shifting More Attention to Video Salient Objection Detection, CVPR 2019 (Best paper finalist & Oral)
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Fcn For Semantic Segmentation
Implemention of FCN-8 and FCN-16 with Keras and uses CRF as post processing
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Deep Q Learning
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Stocks
Programs for stock prediction and evaluation
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Mgwr
Multiscale Geographically Weighted Regression (MGWR)
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Learningtosee
Stars: ✭ 154 (-1.28%)
Mutual labels:  jupyter-notebook
Pyportfolioopt
Financial portfolio optimisation in python, including classical efficient frontier, Black-Litterman, Hierarchical Risk Parity
Stars: ✭ 2,502 (+1503.85%)
Mutual labels:  jupyter-notebook
Rnn lstm from scratch
How to build RNNs and LSTMs from scratch with NumPy.
Stars: ✭ 156 (+0%)
Mutual labels:  jupyter-notebook
Copulas
A library to model multivariate data using copulas.
Stars: ✭ 149 (-4.49%)
Mutual labels:  jupyter-notebook
Carnd Camera Calibration
Images and notebook for camera calibration
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Pytorchmedicalai
This is the hands-on deep learning tutorial series for the 2018/2019 Medical AI course by DeepOncology AI.
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Surgery Robot Detection Segmentation
Object detection and segmentation for a surgery robot using Mask-RCNN on Python 3, Keras, and TensorFlow..
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Ml Training Advanced
Materials for the "Advanced Scikit-learn" class in the afternoon
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Your First Kaggle Submission
How to perform an exploratory data analysis on the Kaggle Titanic dataset and make a submission to the leaderboard.
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Ipystata
Enables the use of Stata together with Python via Jupyter (IPython) notebooks.
Stars: ✭ 154 (-1.28%)
Mutual labels:  jupyter-notebook
Jupyter Vim Binding
Jupyter meets Vim. Vimmer will fall in love.
Stars: ✭ 1,965 (+1159.62%)
Mutual labels:  jupyter-notebook
Tencent social ads2017 mobile app pcvr
Tencent Social Ads 2017 contest rank 20
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook
Corus
Links to Russian corpora + Python functions for loading and parsing
Stars: ✭ 154 (-1.28%)
Mutual labels:  jupyter-notebook
Datagene
DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)
Stars: ✭ 156 (+0%)
Mutual labels:  jupyter-notebook
Py Quantmod
Powerful financial charting library based on R's Quantmod | http://py-quantmod.readthedocs.io/en/latest/
Stars: ✭ 155 (-0.64%)
Mutual labels:  jupyter-notebook

Build Status

IMPORTANT

This pacakge is now deprecated so please use the new library: https://github.com/fastaudio/fastaudio

Fast AI Audio

This is an audio module built on top of FastAI to allow you to quickly and easily build machine learning models for a wide variety of audio applications. We are an unofficial library and have no official connection to fastai except that we are huge fans and want to help make their tools more widely available and applicable to audio.

Quick Start

Google Colab Notebook

Installation

You install the library with pip but it is recommended to clone the repo if you are new to audio so that you can follow one of the notebooks.

  1. Git Clone
git clone https://github.com/fastaudio/fastai_audio
cd fastai_audio
pip install -e .
jupyter notebook
  1. Pip
pip install git+https://github.com/fastaudio/[email protected]

Tests

Dependencies for testing are listed in the dev section of the Pipfile. Automated CI Tests are run on travis.ci via the .travis.yml file.

Setup for testing:

$ pip install pipenv
$ pipenv --three
$ pipenv shell
$ pipenv install --dev --skip-lock

Run tests:

$ pytest

Features

Audio Transform Pre-Processors

We provide a way to resample, remove silence and segment your items before generating spectrograms.

These operations are cached and depend on each other in the order below. If you change your segment settings, you won't need to resample etc.

  • Resample: e.g. 44kHz to 16KHz
  • Remove Silence: Options to trim silence, split by silence, and remove all silence.
  • Segment: Chop up along clip into segments e.g. 12s sample into 5s segments = [5s, 5s, 2s]

Traintime Features

  • Realtime Spectrogram Generation
  • Spectrogram Caching
  • Data Augmentation for Spectrograms (SpecAugment, rolling, size changes)
  • MFCC (Mel-Frequency Cepstral Coefficient) Generation
  • Option to append delta/accelerate
  • and much more...

Tutorials

  • 00-Getting Started - Shows basic functionality of the library and how to train a simple audio model
  • 01-Intro to Audio - A detailed intro guide to the basics of audio processing, librosa, and spectrograms. Not ML-specific and doesn't use this library
  • 02-Features - A detailed walk through all the libraries features.
  • 03-ESC-50 - Our first real audio model, getting a new state-of-the-art on an Environmental Sound Classification problem using melspectrograms, mixup, and a simple setup.
  • 04-Freesound Kaggle - A guide to using the library and it's features for the Kaggle Freesound 2018 competition on acoustic scene classification. Also uses melspectrograms and mixup and includes inference on a test set.
  • Coming Soon: 05a-Googlespeech Kaggle MFCC+Delta - Using MFCC's with delta/accelerate stacking to enter the Google Tensorflow speech challenge from 2018. Includes semisupervised learning (using a model to pseudolabel an unlabeled set).
  • Coming Soon: 05b-Googlespeech Kaggle Melspec Ensemble - An alternate model that uses melspectrograms and SpecAugment (no mixup). We then ensemble this with the model from 05a to do inference on a test set and submit.

Known Issues

  • We don't currently understand normalization for audio and the best way to implement it.
  • Inference and Exporting models work in some cases but are broken in others, check your outputs before using.
  • Stats method can be extremely slow.
  • Cache folder can get extremely large. Remember to clear it using config.clear_cache() or by manually removing your cache folders.

Contributors

We are looking for contributors of all skill levels. If you don't have time to contribute, please at least reach out and give us some feedback on the library by posting in the fastai audio thread or contact us via PM @baz or @madeupmasters

Citation

 @misc{coultas_blum_bracco_2019, title={fastaudio/fastai_audio}, url={https://github.com/fastaudio/fastai_audio}, journal={GitHub}, author={Coultas Blum, Harry A and Bracco, Robert}, year={2019}, month={Jul}}
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].