All Projects → wav2vec2-live → Similar Projects or Alternatives

529 Open source projects that are alternatives of or similar to wav2vec2-live

Wsay
Windows "say"
Stars: ✭ 36 (-82.44%)
Mutual labels:  speech
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (-56.1%)
Mutual labels:  speech
SignDetect
This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.
Stars: ✭ 21 (-89.76%)
Mutual labels:  speech
Siricontrol System
Control anything with Siri voice commands.
Stars: ✭ 180 (-12.2%)
Mutual labels:  speech
Lightspeech
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-84.88%)
Mutual labels:  speech
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (-87.8%)
Mutual labels:  speech
gtranscribe
Software for interview transcription
Stars: ✭ 12 (-94.15%)
Mutual labels:  speech
Audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
Stars: ✭ 1,262 (+515.61%)
Mutual labels:  speech
pie
百度云流式语音识别客户端 SDK
Stars: ✭ 62 (-69.76%)
Mutual labels:  asr
IMS-Toucan
Text-to-Speech Toolkit of the Speech and Language Technologies Group at the University of Stuttgart. Objectives of the development are simplicity, modularity, controllability and multilinguality.
Stars: ✭ 295 (+43.9%)
Mutual labels:  speech
Voice Gender
Gender recognition by voice and speech analysis
Stars: ✭ 248 (+20.98%)
Mutual labels:  speech
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (-62.44%)
Mutual labels:  speech
cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (-85.85%)
Mutual labels:  speech
icassp2019-latex-template
ICASSP 2019 official Latex template
Stars: ✭ 21 (-89.76%)
Mutual labels:  speech
DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Stars: ✭ 17 (-91.71%)
Mutual labels:  speech
Tts
Tools to convert text to speech 📚💬
Stars: ✭ 84 (-59.02%)
Mutual labels:  speech
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+6665.85%)
Mutual labels:  speech
ventib
📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (-79.02%)
Mutual labels:  speech
Praat
Praat: Doing Phonetics By Computer
Stars: ✭ 675 (+229.27%)
Mutual labels:  speech
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (-60.98%)
Mutual labels:  speech
Deep speaker Speaker recognition system
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
Stars: ✭ 174 (-15.12%)
Mutual labels:  speech
Segan
Speech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+222.44%)
Mutual labels:  speech
Automatic speech recognition
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 2,751 (+1241.95%)
Mutual labels:  speech-recognition
Wavegrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.
Stars: ✭ 245 (+19.51%)
Mutual labels:  speech
Chatbot Watson Android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (-17.56%)
Mutual labels:  speech
Dragonfly
Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Stars: ✭ 209 (+1.95%)
Mutual labels:  speech-recognition
react-native-speech-bubble
💬 A speech bubble dialog component for React Native.
Stars: ✭ 50 (-75.61%)
Mutual labels:  speech
Speech Denoising Wavenet
A neural network for end-to-end speech denoising
Stars: ✭ 516 (+151.71%)
Mutual labels:  speech
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+140.49%)
Mutual labels:  speech
Subsync
Subtitle Speech Synchronizer
Stars: ✭ 379 (+84.88%)
Mutual labels:  speech-recognition
data-at-hand-mobile
Mobile application for exploring fitness data using both speech and touch interaction.
Stars: ✭ 50 (-75.61%)
Mutual labels:  speech
Espnet
End-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+2111.22%)
Mutual labels:  speech-recognition
Libfaceid
libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+72.68%)
Mutual labels:  speech-recognition
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (-78.05%)
Mutual labels:  speech
Emotion Classification From Audio Files
Understanding emotions from audio files using neural networks and multiple datasets.
Stars: ✭ 189 (-7.8%)
Mutual labels:  speech
Alan Sdk Ios
Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (+55.12%)
Mutual labels:  speech-recognition
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (-93.66%)
Mutual labels:  speech
Tts Papers
🐸 collection of TTS papers
Stars: ✭ 160 (-21.95%)
Mutual labels:  speech
Xr3player
🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (+130.24%)
Mutual labels:  speech
Kaldi Offline Transcriber
Offline transcription system for Estonian using Kaldi
Stars: ✭ 182 (-11.22%)
Mutual labels:  speech-recognition
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-87.8%)
Mutual labels:  speech
Deepspeech German
Automatic Speech Recognition (ASR) - German
Stars: ✭ 179 (-12.68%)
Mutual labels:  speech-recognition
Cboard
AAC communication system with text-to-speech for the browser
Stars: ✭ 437 (+113.17%)
Mutual labels:  speech
Cidlib
The CIDLib general purpose C++ development environment
Stars: ✭ 179 (-12.68%)
Mutual labels:  speech-recognition
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+847.32%)
Mutual labels:  speech
Kaldi Onnx
Kaldi model converter to ONNX
Stars: ✭ 174 (-15.12%)
Mutual labels:  speech-recognition
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (-21.46%)
Mutual labels:  speech
Pocketsphinx
PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+1331.22%)
Mutual labels:  speech-recognition
Stl
The ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-78.54%)
Mutual labels:  speech
Awesome Speech Recognition Speech Synthesis Papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Stars: ✭ 2,085 (+917.07%)
Mutual labels:  speech-recognition
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-89.27%)
Mutual labels:  speech
Wavenet vocoder
WaveNet vocoder
Stars: ✭ 1,926 (+839.51%)
Mutual labels:  speech
Cordova Plugin Speechrecognition
🎤 Cordova Plugin for Speech Recognition
Stars: ✭ 174 (-15.12%)
Mutual labels:  speech-recognition
Gst Kaldi Nnet2 Online
GStreamer plugin around Kaldi's online neural network decoder
Stars: ✭ 171 (-16.59%)
Mutual labels:  speech-recognition
Kaldiio
A pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (-21.95%)
Mutual labels:  speech-recognition
Voice Converter Cyclegan
Voice Converter Using CycleGAN and Non-Parallel Data
Stars: ✭ 384 (+87.32%)
Mutual labels:  speech
Interspeech2019 Tutorial
INTERSPEECH 2019 Tutorial Materials
Stars: ✭ 160 (-21.95%)
Mutual labels:  speech-recognition
obvi
A Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-73.66%)
Mutual labels:  speech-recognition
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+18.05%)
Mutual labels:  speech
Tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+756.59%)
Mutual labels:  speech
301-360 of 529 similar projects