PygspGraph Signal Processing in Python
Stars: ✭ 270 (+83.67%)
Transformer-TransducerPyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
Stars: ✭ 61 (-58.5%)
ConvDecoderAn un-trained neural network with a potential application in accelerated MRI
Stars: ✭ 21 (-85.71%)
antropyAntroPy: entropy and complexity of (EEG) time-series in Python
Stars: ✭ 111 (-24.49%)
adaptive-filtersMy collection of implementations of adaptive filters.
Stars: ✭ 32 (-78.23%)
Fengshenbang-LMFengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
Stars: ✭ 1,813 (+1133.33%)
Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (-46.94%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+3918.37%)
Nara wpeDifferent implementations of "Weighted Prediction Error" for speech dereverberation
Stars: ✭ 265 (+80.27%)
kospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
Stars: ✭ 456 (+210.2%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+330.61%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (+74.83%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+319.73%)
wv⏰ This R package provides the tools to perform standard and robust wavelet variance analysis for time series (signal processing). Among others, aside from computing the wavelet variance and cross-covariance (classic and robust), the package provides inference tools (e.g. confidence intervals) and plotting tools allowing to perform some visual an…
Stars: ✭ 14 (-90.48%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (-75.51%)
ssqueezepySynchrosqueezing, wavelet transforms, and time-frequency analysis in Python
Stars: ✭ 315 (+114.29%)
pyssppython speech signal processing library
Stars: ✭ 18 (-87.76%)
filter-cElegant Butterworth and Chebyshev filter implemented in C, with float/double precision support. Works well on many platforms. You can also use this package in C++ and bridge to many other languages for good performance.
Stars: ✭ 56 (-61.9%)
dspfunSet of *nix utilities for experimentation and learning about spectral analysis of images
Stars: ✭ 21 (-85.71%)
qEEG feature setNEURAL: a neonatal EEG feature set in Matlab
Stars: ✭ 29 (-80.27%)
KodiSharpUse Kodi python APIs in C#, and write rich addons using the .NET framework/Mono
Stars: ✭ 22 (-85.03%)
fpbinaryFixed point package for Python.
Stars: ✭ 30 (-79.59%)
FftSharpA .NET Standard library for computing the Fast Fourier Transform (FFT) of real or complex data
Stars: ✭ 132 (-10.2%)
filtering-stft-and-laplace-transformSimple demo of filtering signal with an LPF and plotting its Short-Time Fourier Transform (STFT) and Laplace transform, in Python.
Stars: ✭ 50 (-65.99%)
mdctA fast MDCT implementation using SciPy and FFTs
Stars: ✭ 42 (-71.43%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-74.15%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-36.05%)
2D 3D PolarFourierTransformC++, CUDA, and MATLAB codes for the paper "An Exact and Fast Computation of Discrete Fourier Transform for Polar and Spherical Grid"
Stars: ✭ 31 (-78.91%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+170.75%)
ksvd regRegularized K-SVD Algorithm
Stars: ✭ 29 (-80.27%)
dictlearnDictionary Learning for image processing
Stars: ✭ 23 (-84.35%)
bsuir-csn-cmsn-helperRepository containing ready-made laboratory works in the specialty of computing machines, systems and networks
Stars: ✭ 43 (-70.75%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+2.04%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-42.86%)
CCWTComplex Continuous Wavelet Transform
Stars: ✭ 136 (-7.48%)
pySmoothA unique time series library in Python that consists of Kalman filters (discrete, extended, and unscented), online ARIMA, and time difference model.
Stars: ✭ 29 (-80.27%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (+157.82%)
FScape-nextAudio rendering software, based on UGen graphs. Issue tracker: https://codeberg.org/sciss/FScape-next/issues
Stars: ✭ 13 (-91.16%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+2983.67%)
RTspiceA real-time netlist based audio circuit plugin
Stars: ✭ 51 (-65.31%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+140.82%)
QuakeMigrateA Python package for automatic earthquake detection and location using waveform migration and stacking.
Stars: ✭ 101 (-31.29%)
vor-python-decoderDecodes VOR signal from WAV file to get the bearing to your position
Stars: ✭ 33 (-77.55%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (+116.33%)
kafboxA Matlab benchmarking toolbox for kernel adaptive filtering
Stars: ✭ 70 (-52.38%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (+102.72%)
Metu-CENGAll the homeworks, studies and projects I've done at Metu-CENG
Stars: ✭ 32 (-78.23%)
Alan Sdk AndroidAlan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (+89.12%)
iirjAn efficient IIR filter library written in JAVA
Stars: ✭ 95 (-35.37%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+69.39%)
FScapeA standalone audio rendering software for time domain and spectral signal processing.
Stars: ✭ 61 (-58.5%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+68.71%)
esappAn unsupervised Chinese word segmentation tool.
Stars: ✭ 13 (-91.16%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-51.7%)
praiseDo stuff with your voice in the browser.
Stars: ✭ 13 (-91.16%)