All Projects → SuperKogito → spafe

SuperKogito / spafe

Licence: BSD-3-Clause license
🔉 spafe: Simplified Python Audio Features Extraction

Programming Languages

python
139335 projects - #7 most used programming language
TeX
3793 projects
Makefile
30231 projects

Projects that are alternatives of or similar to spafe

Aubio
a library for audio and music analysis
Stars: ✭ 2,601 (+739.03%)
Mutual labels:  sound, beat, mfcc, pitch
vamp-aubio-plugins
aubio plugins for Vamp
Stars: ✭ 38 (-87.74%)
Mutual labels:  music-information-retrieval, beat, mfcc
DTMF-Decoder
A Java program to implement a DMTF Decoder.
Stars: ✭ 28 (-90.97%)
Mutual labels:  signal-processing, dsp, frequencies
Madmom
Python audio and music signal processing library
Stars: ✭ 728 (+134.84%)
Mutual labels:  signal-processing, audio-analysis, music-information-retrieval
Essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
Stars: ✭ 1,985 (+540.32%)
Mutual labels:  dsp, audio-analysis, music-information-retrieval
Pyo
Python DSP module
Stars: ✭ 904 (+191.61%)
Mutual labels:  signal-processing, dsp, sound
Tutorial separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (-51.29%)
Mutual labels:  signal-processing, speech-processing
Moogladders
🔉 Collected C++ implementations of the classic 4-pole moog ladder filter
Stars: ✭ 211 (-31.94%)
Mutual labels:  signal-processing, dsp
SpleeterRT
Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (-64.19%)
Mutual labels:  signal-processing, dsp
da-tacos
A Dataset for Cover Song Identification and Understanding
Stars: ✭ 50 (-83.87%)
Mutual labels:  audio-analysis, music-information-retrieval
Strugatzki
Algorithms for matching audio file similarities. Mirror of https://git.iem.at/sciss/Strugatzki
Stars: ✭ 38 (-87.74%)
Mutual labels:  signal-processing, music-information-retrieval
FftSharp
A .NET Standard library for computing the Fast Fourier Transform (FFT) of real or complex data
Stars: ✭ 132 (-57.42%)
Mutual labels:  frequency, signal-processing
gensound
Pythonic audio processing and generation framework
Stars: ✭ 69 (-77.74%)
Mutual labels:  dsp, sound
Edsp
A cross-platform DSP library written in C++ 11/14. This library harnesses the power of C++ templates to implement a complete set of DSP algorithms.
Stars: ✭ 116 (-62.58%)
Mutual labels:  signal-processing, dsp
Aca Code
Matlab scripts accompanying the book "An Introduction to Audio Content Analysis" (www.AudioContentAnalysis.org)
Stars: ✭ 67 (-78.39%)
Mutual labels:  signal-processing, music-information-retrieval
setigen
Python library for generating and injecting artificial narrow-band signals into radio frequency data
Stars: ✭ 19 (-93.87%)
Mutual labels:  frequency, signal-processing
8d Audio
Some dsp to make songs "8D"
Stars: ✭ 43 (-86.13%)
Mutual labels:  signal-processing, dsp
ooura
Javascript port of Ooura FFT implementation
Stars: ✭ 23 (-92.58%)
Mutual labels:  signal-processing, dsp
ACA-Slides
Slides and Code for "An Introduction to Audio Content Analysis," also taught at Georgia Tech as MUSI-6201. This introductory course on Music Information Retrieval is based on the text book "An Introduction to Audio Content Analysis", Wiley 2012/2022
Stars: ✭ 84 (-72.9%)
Mutual labels:  audio-analysis, music-information-retrieval
Apollo
Apollo is a Open-Source music player for playback and organization of audio files on Microsoft Windows, built using Python.
Stars: ✭ 13 (-95.81%)
Mutual labels:  signal-processing, dsp

Spafe

Simplified Python Audio Features Extraction

Build Status docs.rs License Python codecov codebeat badge PyPI version anaconda Downloads DOI

Table of Contents

Structure

spafe aims to simplify feature extractions from mono audio files. Spafe includes various computations related to filter banks, spectrograms, frequencies and cepstral features . The library has the following structure:

Filter banks

  • Bark filter banks
  • Gammatone filter banks
  • Linear filter banks
  • Mel filter banks

Spectrograms

  • Bark spectrogram
  • CQT spectrogram
  • Erb spectrogram
  • Mel spectrogram

Features

  • Bark Frequency Cepstral Coefficients (BFCCs)
  • Constant Q-transform Cepstral Coefficients (CQCCs)
  • Gammatone Frequency Cepstral Coefficients (GFCCs)
  • Linear Frequency Cepstral Coefficients (LFCCs)
  • Linear Prediction Components (LPCs)
  • Mel Frequency Cepstral Coefficients (MFCCs)
  • Inverse Mel Frequency Cepstral Coefficients (IMFCCs)
  • Magnitude based Spectral Root Cepstral Coefficients (MSRCCs)
  • Normalized Gammachirp Cepstral Coefficients (NGCCs)
  • Power-Normalized Cepstral Coefficients (PNCCs)
  • Phase based Spectral Root Cepstral Coefficients (PSRCCs)
  • Perceptual Linear Prediction Coefficents (PLPs)
  • Rasta Perceptual Linear Prediction Coefficents (RPLPs)

The theory behind features computed using spafe can be summmarized in the following graph:

Frequencies

  • Dominant frequencies
  • Fundamental frequencies

Installation

Dependencies

spafe requires:

if you want to use the visualization module/ functions of spafe, you will need to install:

Installation guide

Once you have the Dependencies installed, use one of the following install options.

Install from PyPI

  • To freshly install spafe:
pip install spafe
  • To update an existing installation:
pip install -U spafe

Install from Anaconda

  • Spafe is also available on anaconda:
conda install spafe

Install from source

  • You can build spafe from source, by following:
git clone [email protected]:SuperKogito/spafe.git
cd spafe
python setup.py install

Why use Spafe?

Unlike most existing audio feature extraction libraries (python_speech_features, SpeechPy, surfboard and Bob), Spafe provides more options for spectral features extraction algorithms, notably:

  • Bark Frequency Cepstral Coefficients (BFCCs)
  • Constant Q-transform Cepstral Coefficients (CQCCs)
  • Gammatone Frequency Cepstral Coefficients (GFCCs)
  • Power-Normalized Cepstral Coefficients (PNCCs)
  • Phase based Spectral Root Cepstral Coefficients (PSRCCs)

Most existing libraries and to their credits provide great implementations for features extraction but are unfortunately limited to the Mel Frequency Features (MFCC) and at best have Bark frequency and linear predictive coefficients additionally. Librosa for example includes great implementation of various algorithms (only MFCC and LPC are included), based on the Short Time Fourrier Transform (STFT), which is theoretically more accurate but slower than the Discret Fourrier Transform used in Spafe's implementation.

How to use

Various examples on how to use spafe are present in the documentation https://superkogito.github.io/spafe.

<!> Please make sure you are referring to the correct documentation version.

Contributing

Contributions are welcome and encouraged. To learn more about how to contribute to spafe please refer to the Contributing guidelines

Citing

  • If you want to cite spafe as a software used in your work, please cite the version used as indexed at Zenodo: DOI
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].