Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+212%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-12%)
PeekingDuckA modular framework built to simplify Computer Vision inference workloads.
Stars: ✭ 143 (+472%)
transtatsTrack translations and automate workflow.
Stars: ✭ 31 (+24%)
pipeFunctional Pipeline in Go
Stars: ✭ 30 (+20%)
XProc-ZA platform for running XProc pipelines as web applications in a Java servlet container
Stars: ✭ 20 (-20%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+720%)
pd3f🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based
Stars: ✭ 132 (+428%)
PyllusionA Parametric Framework to Generate Visual Illusions using Python
Stars: ✭ 35 (+40%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (+112%)
hicAnalysis of Chromosome Conformation Capture data (Hi-C)
Stars: ✭ 45 (+80%)
ApolloApollo is a Open-Source music player for playback and organization of audio files on Microsoft Windows, built using Python.
Stars: ✭ 13 (-48%)
dspfunSet of *nix utilities for experimentation and learning about spectral analysis of images
Stars: ✭ 21 (-16%)
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (+0%)
IMS-ToucanText-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+1080%)
txt2speechConvert text to speech using Google Translate API
Stars: ✭ 38 (+52%)
nextNEOpinextNEOpi: a comprehensive pipeline for computational neoantigen prediction
Stars: ✭ 42 (+68%)
VQMIVCOfficial implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
Stars: ✭ 278 (+1012%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+616%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (+52%)
OpenMaterial3D model exchange format with physical material properties for virtual development, test and validation of automated driving.
Stars: ✭ 23 (-8%)
AnimationDNAMaya > Arnold > Nuke pipeline
Stars: ✭ 101 (+304%)
pyRiemannPython machine learning package based on sklearn API for multivariate data processing and statistical analysis of symmetric positive definite matrices via Riemannian geometry
Stars: ✭ 470 (+1780%)
Robotics-ResourcesList of commonly used robotics libraries and packages
Stars: ✭ 71 (+184%)
ewtpyEmpirical wavelet transform (EWT) in Python
Stars: ✭ 52 (+108%)
lncpipeUNDER DEVELOPMENT--- Analysis of long non-coding RNAs from RNA-seq datasets
Stars: ✭ 24 (-4%)
wv⏰ This R package provides the tools to perform standard and robust wavelet variance analysis for time series (signal processing). Among others, aside from computing the wavelet variance and cross-covariance (classic and robust), the package provides inference tools (e.g. confidence intervals) and plotting tools allowing to perform some visual an…
Stars: ✭ 14 (-44%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+132%)
capeContinuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (+16%)
makepipeTools for constructing simple make-like pipelines in R.
Stars: ✭ 23 (-8%)
ventib📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (+72%)
scATAC-proA comprehensive tool for processing, analyzing and visulizing single cell chromatin accessibility sequencing data
Stars: ✭ 63 (+152%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (+152%)
oouraJavascript port of Ooura FFT implementation
Stars: ✭ 23 (-8%)
pytorch-pcenPyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (+220%)
gr-eventstreamgr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.
Stars: ✭ 38 (+52%)
gulp-sortSort files in stream by path or any custom sort comparator
Stars: ✭ 22 (-12%)
fpbinaryFixed point package for Python.
Stars: ✭ 30 (+20%)
pipelinePipelines using goroutines
Stars: ✭ 46 (+84%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+236%)
datajobBuild and deploy a serverless data pipeline on AWS with no effort.
Stars: ✭ 101 (+304%)
form2fit[ICRA 2020] Train generalizable policies for kit assembly with self-supervised dense correspondence learning.
Stars: ✭ 78 (+212%)
ember-pipelineRailway oriented programming in Ember
Stars: ✭ 17 (-32%)
FScape-nextAudio rendering software, based on UGen graphs. Issue tracker: https://codeberg.org/sciss/FScape-next/issues
Stars: ✭ 13 (-48%)
functionsAn Open Source Serverless Platform
Stars: ✭ 44 (+76%)
browser-apis🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (-16%)
nemesystGeneralised and highly customisable, hybrid-parallelism, database based, deep learning framework.
Stars: ✭ 17 (-32%)
lectures-allCentral repository for all lectures on deep learning at UPC ETSETB TelecomBCN.
Stars: ✭ 46 (+84%)
assume-role-arn🤖🎩assume-role-arn allows you to easily assume an AWS IAM role in your CI/CD pipelines, without worrying about external dependencies.
Stars: ✭ 54 (+116%)
CNCC-2019Computational Neuroscience Crash Course (CNCC 2019)
Stars: ✭ 26 (+4%)
NBSSThe official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+208%)