StevenLOL / LIUM

Licence: MIT License

Scripts for LIUM SpkDiarization tools

Programming Languages

77523 projects

Projects that are alternatives of or similar to LIUM

PyTorch implementation of VQ-VAE + WaveNet by [Chorowski et al., 2019] and VQ-VAE on speech signals by [van den Oord et al., 2017]

Stars: ✭ 187 (+567.86%)

Mutual labels: speech, speech-processing

Dnc

Discriminative Neural Clustering for Speaker Diarisation

Stars: ✭ 60 (+114.29%)

Mutual labels: clustering, speech-processing

Neural Voice Cloning With Few Samples

Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu

Stars: ✭ 211 (+653.57%)

Mutual labels: speech, speech-processing

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (+310.71%)

Mutual labels: speech, speech-processing

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+464.29%)

Mutual labels: speech, speech-processing

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+6778.57%)

Mutual labels: speech, speech-processing

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+764.29%)

Mutual labels: speech, speech-processing

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-10.71%)

Mutual labels: speech, speaker-diarization

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-21.43%)

Mutual labels: speech, speech-processing

IMS-Toucan

Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.

Stars: ✭ 295 (+953.57%)

Mutual labels: speech, speech-processing

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+1742.86%)

Mutual labels: speech, speech-processing

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stars: ✭ 60 (+114.29%)

Mutual labels: speech, speaker-diarization

Pysptk

A python wrapper for Speech Signal Processing Toolkit (SPTK).

Stars: ✭ 297 (+960.71%)

Mutual labels: speech, speech-processing

React Native Dialogflow

A React-Native Bridge for the Google Dialogflow (API.AI) SDK

Stars: ✭ 182 (+550%)

Mutual labels: speech, speech-processing

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (+214.29%)

Mutual labels: speech, speech-processing

Gcc Nmf

Real-time GCC-NMF Blind Speech Separation and Enhancement

Stars: ✭ 231 (+725%)

Mutual labels: speech, speech-processing

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (+235.71%)

Mutual labels: speech-processing, speaker-diarization

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-14.29%)

Mutual labels: clustering, speech

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+700%)

Mutual labels: speech, speech-processing

genie

Genie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)

Stars: ✭ 21 (-25%)

Mutual labels: clustering

View All Similar Projects ➔

LIUM

A demo on LIUM ILP clustering

LIUM_SpkDiarization is a software dedicated to speaker diarization (ie speaker segmentation and clustering), developed by Laboratoire d'Informatique de l'Université du Maine (LIUM - EA 4023).

Homepage: http://www-lium.univ-lemans.fr/diarization/doku.php/welcome

Download http://www-lium.univ-lemans.fr/diarization/doku.php/download

Related project: https://github.com/bbcrd/diarize-jruby

Prerequisites:

1)glpk for ILP clustering:

sudo apt-get install glpk

or

sudo apt-get install glpk-utils

http://www.gnu.org/software/glpk/

2)UBM and models http://www-lium.univ-lemans.fr/diarization/lib/exe/fetch.php/data_ilp.tgz

After download LIUM, UBM and other models, put them in this folder, you should have following file structure, then just $./go.sh, output segmetns will be in test_out folder

./
|-- LIUM_SpkDiarization-8.4.1.jar
|-- go.sh
|-- ilp_diarization2.sh
|-- LICENSE
|-- README.md
|-- mat
|   |-- wld.efn.xml
|   |-- wld.mahanalobis.mat
|   `-- wld.tv.mat
|-- models
|   |-- gender.gmms
|   |-- s.gmms
|   |-- sms.gmms
|   `-- ubm.gmm
|-- test_out
|-- test_wav
|   `-- t001.wav
`-- ubm
    `-- wld.gmm

Details are in ilp_diarization2.sh

Known issues:

#1 LIUM works fine with JDK 1.7 but has some problem with JDK 1.8

Please check out this project ,it works with JDK1.8 .

Update:

LIUM=Laboratoire d'Informatique de l'Université du MaineLaboratoire d'Informatique de l'Université du Maine

http://www-lium.univ-lemans.fr/

Speaker diarization

Speaker diarization is one research topic in audio/speech processing. It is refer to unsupervised cluster audio sequences into speaker groups.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

StevenLOL / LIUM

Programming Languages

Labels

Projects that are alternatives of or similar to LIUM

LIUM

Update:

LIUM=Laboratoire d'Informatique de l'Université du MaineLaboratoire d'Informatique de l'Université du Maine

Speaker diarization