SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+1112.7%)
ConvolutionaNeuralNetworksToEnhanceCodedSpeechIn this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
Stars: ✭ 25 (-60.32%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1234.92%)
SpleeterRTReal time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
Stars: ✭ 111 (+76.19%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+255.56%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-39.68%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+284.13%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-36.51%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+49.21%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-25.4%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (+12.7%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+7095.24%)
awesome-speech-enhancementA curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (-23.81%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-31.75%)
speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-71.43%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+131.75%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+611.11%)
DlaDeep learning for audio processing
Stars: ✭ 142 (+125.4%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-57.14%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+968.25%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-73.02%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-65.08%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (+392.06%)
pyssppython speech signal processing library
Stars: ✭ 18 (-71.43%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+225.4%)
Tutorial separationThis repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
Stars: ✭ 151 (+139.68%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+138.1%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+87.3%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (+307.94%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (+404.76%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+226.98%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-14.29%)
PytorchwaveletsPyTorch implementation of the wavelet analysis from Torrence & Compo (1998)
Stars: ✭ 197 (+212.7%)
ResdetDetect source resolution of upscaled images
Stars: ✭ 191 (+203.17%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-15.87%)
TftbA Python module for time-frequency analysis
Stars: ✭ 185 (+193.65%)
RustfftA mixed-radix FFT library written in pure Rust
Stars: ✭ 183 (+190.48%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (-7.94%)
PycbcCore package to analyze gravitational-wave data, find signals, and study their parameters. This package was used in the first direct detection of gravitational waves (GW150914), and is used in the ongoing analysis of LIGO/Virgo data.
Stars: ✭ 177 (+180.95%)
BrainflowBrainFlow is a library intended to obtain, parse and analyze EEG, EMG, ECG and other kinds of data from biosensors
Stars: ✭ 170 (+169.84%)
pyRiemannPython machine learning package based on sklearn API for multivariate data processing and statistical analysis of symmetric positive definite matrices via Riemannian geometry
Stars: ✭ 470 (+646.03%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+225.4%)
Audio Reactive Led Strip🎵 🌈 Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi
Stars: ✭ 2,217 (+3419.05%)
Signals And Systems LectureContinuous- and Discrete-Time Signals and Systems - Theory and Computational Examples
Stars: ✭ 166 (+163.49%)
ApolloApollo is a Open-Source music player for playback and organization of audio files on Microsoft Windows, built using Python.
Stars: ✭ 13 (-79.37%)
StocksPrograms for stock prediction and evaluation
Stars: ✭ 155 (+146.03%)
oouraJavascript port of Ooura FFT implementation
Stars: ✭ 23 (-63.49%)
Computer Vision Video LecturesA curated list of free, high-quality, university-level courses with video lectures related to the field of Computer Vision.
Stars: ✭ 154 (+144.44%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+368.25%)
PycwtA Python module for continuous wavelet spectral analysis. It includes a collection of routines for wavelet transform and statistical analysis via FFT algorithm. In addition, the module also includes cross-wavelet transforms, wavelet coherence tests and sample scripts.
Stars: ✭ 146 (+131.75%)
PycroscopyScientific analysis of nanoscale materials imaging data
Stars: ✭ 144 (+128.57%)
multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Stars: ✭ 122 (+93.65%)
wv⏰ This R package provides the tools to perform standard and robust wavelet variance analysis for time series (signal processing). Among others, aside from computing the wavelet variance and cross-covariance (classic and robust), the package provides inference tools (e.g. confidence intervals) and plotting tools allowing to perform some visual an…
Stars: ✭ 14 (-77.78%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+461.9%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-66.67%)