All Projects → cvqluu → GE2E-Loss

cvqluu / GE2E-Loss

Licence: other
Pytorch implementation of Generalized End-to-End Loss for speaker verification

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to GE2E-Loss

KaldiBasedSpeakerVerification
Kaldi based speaker verification
Stars: ✭ 43 (-40.28%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-identification
Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Stars: ✭ 34 (-52.78%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-identification
wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (-25%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-identification
deepaudio-speaker
neural network based speaker embedder
Stars: ✭ 19 (-73.61%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-diarization
dropclass speaker
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
Stars: ✭ 20 (-72.22%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-identification
D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (-16.67%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-diarization
Speaker-Recognition
This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1
Stars: ✭ 94 (+30.56%)
Mutual labels:  speaker-recognition, speaker-verification, speaker-identification
UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (+30.56%)
Mutual labels:  speaker-diarization, speaker-identification
speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
Stars: ✭ 92 (+27.78%)
Mutual labels:  speaker-recognition, speaker-verification
Voice-ML
MobileNet trained with VoxCeleb dataset and used for voice verification
Stars: ✭ 15 (-79.17%)
Mutual labels:  speaker-verification, speaker-identification
Speaker-Identification
A program for automatic speaker identification using deep learning techniques.
Stars: ✭ 84 (+16.67%)
Mutual labels:  speaker-recognition, speaker-verification
bob
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-47.22%)
Mutual labels:  speaker-recognition, speaker-verification
kaldi-timit-sre-ivector
Develop speaker recognition model based on i-vector using TIMIT database
Stars: ✭ 17 (-76.39%)
Mutual labels:  speaker-recognition, speaker-verification
MiniVox
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
Stars: ✭ 15 (-79.17%)
Mutual labels:  speaker-recognition, speaker-diarization
speakerIdentificationNeuralNetworks
⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's voice is recorded and typical number of features are extracted to form a model. ⇨ During the Recognition phase, a speech sample is compared against a previously created voice print stored in the database. ⇨ The hi…
Stars: ✭ 26 (-63.89%)
Mutual labels:  speaker-recognition, speaker-identification
meta-SR
Pytorch implementation of Meta-Learning for Short Utterance Speaker Recognition with Imbalance Length Pairs (Interspeech, 2020)
Stars: ✭ 58 (-19.44%)
Mutual labels:  speaker-recognition, speaker-verification
VoiceprintRecognition-PaddlePaddle
使用PaddlePaddle实现声纹识别
Stars: ✭ 57 (-20.83%)
Mutual labels:  speaker-recognition
VoiceprintRecognition-Pytorch
本项目使用了EcapaTdnn模型实现的声纹识别
Stars: ✭ 140 (+94.44%)
Mutual labels:  speaker-recognition
VoiceprintRecognition-Keras
基于Kersa实现的声纹识别模型
Stars: ✭ 70 (-2.78%)
Mutual labels:  speaker-recognition
speaker-recognition-pytorch
Speaker recognition ,Voiceprint recognition
Stars: ✭ 49 (-31.94%)
Mutual labels:  speaker-recognition

GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification, proposed in https://arxiv.org/pdf/1710.10467.pdf [1].

Includes an argument to define whether to use the 'softmax' or 'contrast' type loss (equations 6 and 7 respectively in [1]). Uses vector operations to speed up calculations of the cosine similarity scores for an utterance embedding against all the other speaker embedding centroids.

Below is some example code for how to use this. The example values for certain parameters are taken from [1]

import torch
from ge2e import GE2ELoss

criterion = GE2ELoss(init_w=10.0, init_b=-5.0, loss_method='softmax') #for softmax loss
criterion = GE2ELoss(init_w=10.0, init_b=-5.0, loss_method='contrast') #for contrast loss

N = 64 #Number of speakers in a batch
M = 10 #Number of utterances for each speaker
D = 256 #Dimensions of the speaker embeddings, such as a d-vector or x-vector

test_input = torch.rand(N, M, D)
loss = criterion(test_input) #output is a scalar
loss.backward()

[1] GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION, https://arxiv.org/pdf/1710.10467.pdf

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].