All Projects → ser-with-w2v2 → Similar Projects or Alternatives

192 Open source projects that are alternatives of or similar to ser-with-w2v2

SER-datasets
A collection of datasets for the purpose of emotion recognition/detection in speech.
Stars: ✭ 74 (+85%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+412.5%)
Mutual labels:  speech, wav2vec2
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (+35%)
Mutual labels:  speech
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+460%)
Mutual labels:  speech
wavenet-classifier
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Stars: ✭ 54 (+35%)
Interaction-Aware-Attention-Network
[ICASSP19] An Interaction-aware Attention Network for Speech Emotion Recognition in Spoken Dialogs
Stars: ✭ 32 (-20%)
nlp-class
A Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (+137.5%)
Mutual labels:  speech
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+170%)
Mutual labels:  speech
fade
A Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-70%)
Mutual labels:  speech
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-17.5%)
Mutual labels:  speech
gtranscribe
Software for interview transcription
Stars: ✭ 12 (-70%)
Mutual labels:  speech
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+122.5%)
Mutual labels:  speech
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+34575%)
Mutual labels:  speech
opensnips
Open source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+25%)
Mutual labels:  speech
D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (+50%)
Mutual labels:  speech
jackpair
p2p speech encrypting device with analog audio interface suitable for GSM phones
Stars: ✭ 26 (-35%)
Mutual labels:  speech
data-at-hand-mobile
Mobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (+25%)
Mutual labels:  speech
Voice2Mesh
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (+67.5%)
Mutual labels:  speech
MajorDomo-Scenarios
Сценарии для системы домашней автоматизации Majordomo
Stars: ✭ 12 (-70%)
Mutual labels:  speech
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+30%)
Mutual labels:  speech
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
soxan
Wav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (+182.5%)
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-40%)
Mutual labels:  speech
nabaztag-php
a simple php implementation of a Nabaztag server
Stars: ✭ 14 (-65%)
Mutual labels:  speech
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
linear16
Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
VAD-LTSD
Efficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-7.5%)
Mutual labels:  speech
LIGHT-SERNET
Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Stars: ✭ 20 (-50%)
JD-NMF
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-50%)
Mutual labels:  speech
speech recognition ctc
Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Stars: ✭ 40 (+0%)
Mutual labels:  speech
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+105%)
Mutual labels:  speech
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-12.5%)
Mutual labels:  speech
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-67.5%)
Mutual labels:  speech
Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
Stars: ✭ 73 (+82.5%)
Mutual labels:  speech
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+2002.5%)
TASNET
Time-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-55%)
Mutual labels:  speech
voice-based-email-for-blind
Emailing System for visually impaired persons
Stars: ✭ 35 (-12.5%)
Mutual labels:  speech
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
Mutual labels:  speech
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+12.5%)
Mutual labels:  speech
Audio Signal Processing
Audio or speech signal processing guide.
Stars: ✭ 45 (+12.5%)
Mutual labels:  speech
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (-67.5%)
Mutual labels:  speech
torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+5%)
Mutual labels:  speech
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (-55%)
Mutual labels:  speech
web-speech-demo
Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
attentive-modality-hopping-for-SER
TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20
Stars: ✭ 25 (-37.5%)
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+302.5%)
Mutual labels:  speech
SpeechEmoRec
Speech Emotion Recognition Using Deep Convolutional Neural Network and Discriminant Temporal Pyramid Matching
Stars: ✭ 44 (+10%)
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-45%)
Mutual labels:  speech
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+62.5%)
Mutual labels:  speech
Speech Feature Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+95%)
Mutual labels:  speech
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+257.5%)
Mutual labels:  speech
HTK
The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-42.5%)
Mutual labels:  speech
lidbox
End-to-end spoken language identification out of the box.
Stars: ✭ 39 (-2.5%)
Mutual labels:  speech
speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+0%)
Mutual labels:  speech
jarvis
Jarvis Home Automation
Stars: ✭ 81 (+102.5%)
Mutual labels:  speech
wikipron
Massively multilingual pronunciation mining
Stars: ✭ 167 (+317.5%)
Mutual labels:  speech
LIUM
Scripts for LIUM SpkDiarization tools
Stars: ✭ 28 (-30%)
Mutual labels:  speech
KAREN
KAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-55%)
Mutual labels:  speech
1-60 of 192 similar projects