All Projects → zhenghuatan → rVAD

zhenghuatan / rVAD

Licence: other
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Programming Languages

matlab
3953 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to rVAD

Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
Stars: ✭ 81 (+76.09%)
Mutual labels:  voice-activity-detection, noise-robust
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+13.04%)
Mutual labels:  voice-activity-detection
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-41.3%)
Mutual labels:  voice-activity-detection
cobra
On-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (+65.22%)
Mutual labels:  voice-activity-detection
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+1728.26%)
Mutual labels:  voice-activity-detection
voice-activity-detection
Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021
Stars: ✭ 82 (+78.26%)
Mutual labels:  voice-activity-detection
rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.
Stars: ✭ 80 (+73.91%)
Mutual labels:  voice-activity-detection
Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
Stars: ✭ 34 (-26.09%)
Mutual labels:  voice-activity-detection
voice gender detection
♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
Stars: ✭ 51 (+10.87%)
Mutual labels:  voice-activity-detection
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+15.22%)
Mutual labels:  voice-activity-detection
android-vad
This VAD library can process audio in real-time utilizing GMM which helps identify presence of human speech in an audio sample that contains a mixture of speech and noise.
Stars: ✭ 64 (+39.13%)
Mutual labels:  voice-activity-detection
Ffsubsync
Automagically synchronize subtitles with video.
Stars: ✭ 5,167 (+11132.61%)
Mutual labels:  voice-activity-detection
Noisetorch
Real-time microphone noise suppression on Linux.
Stars: ✭ 5,199 (+11202.17%)
Mutual labels:  voice-activity-detection

rVAD

Description

Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD) or speech activity detection (SAD), as presented in rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

The rVAD method consists of two passes of denoising followed by a VAD stage. It has been applied as a preprocessor for speech recognition, speaker identification, language identification, age and gender identification, human-robot interaction, audio archive segmentation, and so on. More info on the rVAD webpage.

Source code for rVAD:

Source code in Matlab for rVAD (including rVAD-fast) is available under the rVAD2.0 folder. It is straightforward to use: Simply call the function vad.m. Some Matlab functions and their modified versions from the publicly available VoiceBox are included with kind permission of Mike Brookes.

Source code in Python for rVAD-fast is available under the rVADfast_py_2.0 folder.

Reference VAD for Aurora 2 database:

The frame-by-frame reference VAD was generated from the clean set of Aurora 2 using forced-alignment speech recognition and has been used as a 'ground truth' for evaluating VAD algorithms. Our study shows that forced-alignment ASR performs as well as a human expert labeler for generating VAD references, as detailed in Comparison of Forced-Alignment Speech Recognition and Humans for Generating Reference VAD. Here are the generated reference VAD for the training set and the reference VAD for the test set.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].