SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+140.25%)
EdspA cross-platform DSP library written in C++ 11/14. This library harnesses the power of C++ templates to implement a complete set of DSP algorithms.
Stars: ✭ 116 (-63.52%)
DtlnTensorflow 2.x implementation of the DTLN real time speech denoising model. With TF-lite, ONNX and real-time audio processing support.
Stars: ✭ 147 (-53.77%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-85.22%)
Nara wpeDifferent implementations of "Weighted Prediction Error" for speech dereverberation
Stars: ✭ 265 (-16.67%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-88.05%)
AurioAudio Fingerprinting & Retrieval for .NET
Stars: ✭ 84 (-73.58%)
DawdreamerDigital Audio Workstation with Python; VST instruments/effects, parameter automation, and native processors
Stars: ✭ 119 (-62.58%)
Noise reductionSpeech noise reduction which was generated using existing post-production techniques implemented in Python
Stars: ✭ 130 (-59.12%)
AvdemoDemo projects for iOS Audio & Video development.
Stars: ✭ 136 (-57.23%)
OttoSampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
Stars: ✭ 2,390 (+651.57%)
FullsubnetPyTorch implementation of "A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Stars: ✭ 51 (-83.96%)
BeepA little package that brings sound to any Go application. Suitable for playback and audio-processing.
Stars: ✭ 1,168 (+267.3%)
Aukitaudio toolkit. 好用的语音处理工具箱,包含语音降噪、音频格式转换、特征频谱生成等模块。
Stars: ✭ 105 (-66.98%)
Mad TwinnetThe code for the MaD TwinNet. Demo page:
Stars: ✭ 99 (-68.87%)
ScaperA library for soundscape synthesis and augmentation
Stars: ✭ 186 (-41.51%)
AudinoOpen source audio annotation tool for humans™
Stars: ✭ 740 (+132.7%)
GiadaYour Hardcore Loop Machine.
Stars: ✭ 903 (+183.96%)
PliersAutomated feature extraction in Python
Stars: ✭ 243 (-23.58%)
Mne CppMNE-CPP: A Framework for Electrophysiology
Stars: ✭ 104 (-67.3%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (-19.18%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (-65.09%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (-80.19%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (-2.52%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-93.08%)
KfrFast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
Stars: ✭ 985 (+209.75%)
GuitardNode based multi effects audio processor
Stars: ✭ 31 (-90.25%)
Awesome Web AudioA list of resources and projects to help learn about audio
Stars: ✭ 73 (-77.04%)
PyoPython DSP module
Stars: ✭ 904 (+184.28%)
Audio SnrMixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
Stars: ✭ 100 (-68.55%)
Libopenshot AudioOpenShot Audio Library (libopenshot-audio) is a free, open-source project that enables high-quality editing and playback of audio, and is based on the amazing JUCE library.
Stars: ✭ 120 (-62.26%)
EuterpeReal-time Audio-to-audio Karaoke Generation System for Monaural Music
Stars: ✭ 19 (-94.03%)
Img EncodeEncode an image to sound and view it as a spectrogram - turn your images into music
Stars: ✭ 157 (-50.63%)
MwengineAudio engine and DSP for Android, written in C++ providing low latency performance in a musical context, supporting both OpenSL and AAudio.
Stars: ✭ 190 (-40.25%)
Prism MediaEasily transcode media using Node.js 🎶
Stars: ✭ 136 (-57.23%)
MixingBearPackage for automatic beat-mixing of music files in Python 🐻🎚
Stars: ✭ 73 (-77.04%)
RS-METCodebase for RS-MET products (Robin Schmidt's Music Engineering Tools)
Stars: ✭ 32 (-89.94%)
awesome-multimodal-mlReading list for research topics in multimodal machine learning
Stars: ✭ 3,125 (+882.7%)
Speech signal processing and classificationFront-end speech processing aims at extracting proper features from short- term segments of a speech utterance, known as frames. It is a pre-requisite step toward any pattern recognition problem employing speech or audio (e.g., music). Here, we are interesting in voice disorder classification. That is, to develop two-class classifiers, which can discriminate between utterances of a subject suffering from say vocal fold paralysis and utterances of a healthy subject.The mathematical modeling of the speech production system in humans suggests that an all-pole system function is justified [1-3]. As a consequence, linear prediction coefficients (LPCs) constitute a first choice for modeling the magnitute of the short-term spectrum of speech. LPC-derived cepstral coefficients are guaranteed to discriminate between the system (e.g., vocal tract) contribution and that of the excitation. Taking into account the characteristics of the human ear, the mel-frequency cepstral coefficients (MFCCs) emerged as descriptive features of the speech spectral envelope. Similarly to MFCCs, the perceptual linear prediction coefficients (PLPs) could also be derived. The aforementioned sort of speaking tradi- tional features will be tested against agnostic-features extracted by convolu- tive neural networks (CNNs) (e.g., auto-encoders) [4]. The pattern recognition step will be based on Gaussian Mixture Model based classifiers,K-nearest neighbor classifiers, Bayes classifiers, as well as Deep Neural Networks. The Massachussets Eye and Ear Infirmary Dataset (MEEI-Dataset) [5] will be exploited. At the application level, a library for feature extraction and classification in Python will be developed. Credible publicly available resources will be 1used toward achieving our goal, such as KALDI. Comparisons will be made against [6-8].
Stars: ✭ 155 (-51.26%)
Nwaves.NET library for 1D signal processing focused specifically on audio processing
Stars: ✭ 151 (-52.52%)
AudioowlFast and simple music and audio analysis using RNN in Python 🕵️♀️ 🥁
Stars: ✭ 151 (-52.52%)
DDCToolboxCreate and edit DDC headset correction files
Stars: ✭ 70 (-77.99%)
FluXA convenient way of processing digital signals in F#
Stars: ✭ 17 (-94.65%)
DiViMeACLEW Diarization Virtual Machine
Stars: ✭ 28 (-91.19%)
StrugatzkiAlgorithms for matching audio file similarities. Mirror of https://git.iem.at/sciss/Strugatzki
Stars: ✭ 38 (-88.05%)
antropyAntroPy: entropy and complexity of (EEG) time-series in Python
Stars: ✭ 111 (-65.09%)
Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (-75.47%)
pyssppython speech signal processing library
Stars: ✭ 18 (-94.34%)
FfmediaelementFFME: The Advanced WPF MediaElement (based on FFmpeg)
Stars: ✭ 733 (+130.5%)
MltMLT Multimedia Framework
Stars: ✭ 836 (+162.89%)
Tutorial separationThis repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (-52.52%)
python-soxrFast and high quality sample-rate conversion library for Python
Stars: ✭ 25 (-92.14%)
RTspiceA real-time netlist based audio circuit plugin
Stars: ✭ 51 (-83.96%)
DTMF-DecoderA Java program to implement a DMTF Decoder.
Stars: ✭ 28 (-91.19%)