SER-datasetsA collection of datasets for the purpose of emotion recognition/detection in speech.
Stars: ✭ 74 (+85%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+412.5%)
melganMelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (+35%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+460%)
wavenet-classifierKeras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (+35%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (+137.5%)
AdaSpeechAdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+170%)
fadeA Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-70%)
Zero-Shot-TTSUnofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-17.5%)
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-70%)
simple-obs-sttSpeech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+122.5%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+34575%)
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+25%)
D-TDNNPyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (+50%)
jackpairp2p speech encrypting device with analog audio interface suitable for GSM phones
Stars: ✭ 26 (-35%)
data-at-hand-mobileMobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (+25%)
Voice2MeshCVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (+67.5%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+30%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-65%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+182.5%)
audio noise clusteringhttps://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-40%)
nabaztag-phpa simple php implementation of a Nabaztag server
Stars: ✭ 14 (-65%)
KeenASR-Android-PoCA proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-47.5%)
linear16Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-65%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-47.5%)
VAD-LTSDEfficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-7.5%)
LIGHT-SERNETLight-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Stars: ✭ 20 (-50%)
JD-NMFJoint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-50%)
speech recognition ctcUse ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Stars: ✭ 40 (+0%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+105%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-12.5%)
kaldi helpers🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-67.5%)
Fre-GAN-pytorchFre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (+82.5%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2002.5%)
TASNETTime-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-55%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
CVCCVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+12.5%)
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+5%)
PhomemeSimple sentence mixing tool (work in progress)
Stars: ✭ 18 (-55%)
web-speech-demoLearn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-52.5%)
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-52.5%)
StyleSpeechOfficial implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+302.5%)
SpeechEmoRecSpeech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
Stars: ✭ 44 (+10%)
ShifterPitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-45%)
TFGANTFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+62.5%)
Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+95%)
room-impulse-responsesA list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+257.5%)
HTKThe Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-42.5%)
lidboxEnd-to-end spoken language identification out of the box.
Stars: ✭ 39 (-2.5%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+0%)
jarvisJarvis Home Automation
Stars: ✭ 81 (+102.5%)
wikipronMassively multilingual pronunciation mining
Stars: ✭ 167 (+317.5%)
LIUMScripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-30%)
KARENKAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-55%)