All Projects → speech_recognition_ctc → Similar Projects or Alternatives

212 Open source projects that are alternatives of or similar to speech_recognition_ctc

web-speech-demo
Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+110%)
Mutual labels:  speech
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+12.5%)
Mutual labels:  speech
browser-apis
🦄 Cool & Fun Browser Web APIs 🥳
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (+25%)
Mutual labels:  ctc
Voice Gender
Gender recognition by voice and speech analysis
Stars: ✭ 248 (+520%)
Mutual labels:  speech
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (-67.5%)
Mutual labels:  speech
Speech Feature Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+95%)
Mutual labels:  speech
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (+177.5%)
Mutual labels:  speech
Tacotron pytorch
PyTorch implementation of Tacotron speech synthesis model.
Stars: ✭ 242 (+505%)
Mutual labels:  speech
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
Gcc Nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
Stars: ✭ 231 (+477.5%)
Mutual labels:  speech
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
Source separation
Deep learning based speech source separation using Pytorch
Stars: ✭ 226 (+465%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+302.5%)
Mutual labels:  speech
Speech Denoiser
A speech denoise lv2 plugin based on RNNoise library
Stars: ✭ 220 (+450%)
Mutual labels:  speech
linear16
Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
Tts Cube
End-2-end speech synthesis with recurrent neural networks
Stars: ✭ 213 (+432.5%)
Mutual labels:  speech
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-40%)
Mutual labels:  speech
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+412.5%)
Mutual labels:  speech
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-12.5%)
Mutual labels:  speech
Esp8266sam
Speech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (+397.5%)
Mutual labels:  speech
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+122.5%)
Mutual labels:  speech
Speechtotext Websockets Javascript
SDK & Sample to do speech recognition using websockets in Javascript
Stars: ✭ 191 (+377.5%)
Mutual labels:  speech
DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Stars: ✭ 17 (-57.5%)
Mutual labels:  speech
Depression Detect
Predicting depression from acoustic features of speech using a Convolutional Neural Network.
Stars: ✭ 187 (+367.5%)
Mutual labels:  speech
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
React Native Dialogflow
A React-Native Bridge for the Google Dialogflow (API.AI) SDK
Stars: ✭ 182 (+355%)
Mutual labels:  speech
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
Mutual labels:  speech
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (+337.5%)
Mutual labels:  speech
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
Chatbot Watson Android
An Android ChatBot powered by Watson Services - Assistant, Speech-to-Text and Text-to-Speech on IBM Cloud.
Stars: ✭ 169 (+322.5%)
Mutual labels:  speech
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+34575%)
Mutual labels:  speech
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+312.5%)
Mutual labels:  speech
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+125%)
Mutual labels:  speech
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+4755%)
Mutual labels:  speech
TASNET
Time-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-55%)
Mutual labels:  speech
Tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+4290%)
Mutual labels:  speech
rnn benchmarks
RNN benchmarks of pytorch, tensorflow and theano
Stars: ✭ 85 (+112.5%)
Mutual labels:  ctc
Diffwave
DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.
Stars: ✭ 139 (+247.5%)
Mutual labels:  speech
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+105%)
Mutual labels:  speech
Voice activity detection
Voice Activity Detection based on Deep Learning & TensorFlow
Stars: ✭ 132 (+230%)
Mutual labels:  speech
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+92.5%)
Mutual labels:  speech
Voc
A physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (+222.5%)
Mutual labels:  speech
HTK
The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-42.5%)
Mutual labels:  speech
Reconstructing faces from voices
An example of the paper "reconstructing faces from voices"
Stars: ✭ 127 (+217.5%)
Mutual labels:  speech
icassp2019-latex-template
ICASSP 2019 official Latex template
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
AESRC2020
a deep accent recognition network
Stars: ✭ 35 (-12.5%)
Mutual labels:  ctc
txt2speech
Convert text to speech using Google Translate API
Stars: ✭ 38 (-5%)
Mutual labels:  speech
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-67.5%)
Mutual labels:  speech
Tts
Text-to-Speech for Arduino
Stars: ✭ 118 (+195%)
Mutual labels:  speech
ventib
📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (+7.5%)
Mutual labels:  speech
Tfg Voice Conversion
Deep Learning-based Voice Conversion system
Stars: ✭ 115 (+187.5%)
Mutual labels:  speech
Audio Signal Processing
Audio or speech signal processing guide.
Stars: ✭ 45 (+12.5%)
Mutual labels:  speech
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (+32.5%)
Mutual labels:  speech
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+412.5%)
Mutual labels:  speech
jackpair
p2p speech encrypting device with analog audio interface suitable for GSM phones
Stars: ✭ 26 (-35%)
Mutual labels:  speech
fade
A Simulation Framework for Auditory Discrimination Experiments
Stars: ✭ 12 (-70%)
Mutual labels:  speech
KAREN
KAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-55%)
Mutual labels:  speech
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+460%)
Mutual labels:  speech
61-120 of 212 similar projects