kaldi-timit-sre-ivectorDevelop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-79.76%)
meta-SRPytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
Stars: ✭ 58 (-30.95%)
D-TDNNPyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (-28.57%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-35.71%)
GE2E-LossPytorch implementation of Generalized End-to-End Loss for speaker verification
Stars: ✭ 72 (-14.29%)
Speaker-RecognitionThis repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Stars: ✭ 94 (+11.9%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-54.76%)
dropclass speakerDropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-76.19%)
FreeSRA Free Library for Speaker Recognition (Verification),implemented by ncnn.
Stars: ✭ 21 (-75%)
MiniVoxCode for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Stars: ✭ 15 (-82.14%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+166.67%)
speaker extractiontarget speaker extraction and verification for multi-talker speech
Stars: ✭ 85 (+1.19%)
AESRC2020a deep accent recognition network
Stars: ✭ 35 (-58.33%)
pytorch-mfccA pytorch implementation of MFCC.
Stars: ✭ 30 (-64.29%)
meta-embeddingsMeta-embeddings are a probabilistic generalization of embeddings in machine learning.
Stars: ✭ 22 (-73.81%)
timit-preprocessorExtract mfcc vectors and phones from TIMIT dataset
Stars: ✭ 14 (-83.33%)
spafe🔉 spafe: Simplified Python Audio Features Extraction
Stars: ✭ 310 (+269.05%)
PiwhoSpeaker recognition library based on MARF for raspberry pi and other SBCs.
Stars: ✭ 50 (-40.48%)
ConvolutionaNeuralNetworksToEnhanceCodedSpeechIn this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral d…
Stars: ✭ 25 (-70.24%)
Voice-MLMobileNet trained with VoxCeleb dataset and used for voice verification
Stars: ✭ 15 (-82.14%)
sonopyA simple audio feature extraction library
Stars: ✭ 72 (-14.29%)
AutoSpeech[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei Zha, Zhangyang Wang
Stars: ✭ 195 (+132.14%)
Aubioa library for audio and music analysis
Stars: ✭ 2,601 (+2996.43%)
Numpy MlMachine learning, in numpy
Stars: ✭ 11,100 (+13114.29%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-79.76%)
speakerIdentificationNeuralNetworks⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The hi…
Stars: ✭ 26 (-69.05%)
Kaldikaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+13175%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1660.71%)
ASVspoof2019 systemD3M - Dynamic Data Discrepancy Mitigation for Anti-spoofing - Implementation of work Dynamically Mitigating Data Discrepancy with Balanced Focal Loss for Replay Attack Detection
Stars: ✭ 22 (-73.81%)