All Projects → speech_recognition_ctc → Similar Projects or Alternatives

212 Open source projects that are alternatives of or similar to speech_recognition_ctc

Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+512.5%)
Mutual labels:  speech, ctc
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (+210%)
Mutual labels:  speech, ctc
torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+5%)
Mutual labels:  speech, ctc
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+920%)
Mutual labels:  speech, ctc
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-37.5%)
Mutual labels:  speech, ctc
MajorDomo-Scenarios
Сценарии для системы домашней автоматизации Majordomo
Stars: ✭ 12 (-70%)
Mutual labels:  speech
CRNN.tf2
Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
Stars: ✭ 131 (+227.5%)
Mutual labels:  ctc
Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
Stars: ✭ 33 (-17.5%)
Mutual labels:  speech
Shifter
Pitch shifter using WSOLA and resampling implemented by Python3
Stars: ✭ 22 (-45%)
Mutual labels:  speech
nlp-class
A Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (+137.5%)
Mutual labels:  speech
VAD-LTSD
Efficient voice activity detection algorithm using long-term speech information
Stars: ✭ 37 (-7.5%)
Mutual labels:  speech
room-impulse-responses
A list of publicly available room impulse response datasets and scripts to download them.
Stars: ✭ 143 (+257.5%)
Mutual labels:  speech
AdaSpeech
AdaSpeech: Adaptive Text to Speech for Custom Voice
Stars: ✭ 108 (+170%)
Mutual labels:  speech
gtranscribe
Software for interview transcription
Stars: ✭ 12 (-70%)
Mutual labels:  speech
Phomeme
Simple sentence mixing tool (work in progress)
Stars: ✭ 18 (-55%)
Mutual labels:  speech
opensnips
Open source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+25%)
Mutual labels:  speech
ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
Stars: ✭ 112 (+180%)
Mutual labels:  ctc
speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+0%)
Mutual labels:  speech
TFGAN
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Stars: ✭ 65 (+62.5%)
Mutual labels:  speech
nabaztag-php
a simple php implementation of a Nabaztag server
Stars: ✭ 14 (-65%)
Mutual labels:  speech
lidbox
End-to-end spoken language identification out of the box.
Stars: ✭ 39 (-2.5%)
Mutual labels:  speech
JD-NMF
Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)
Stars: ✭ 20 (-50%)
Mutual labels:  speech
SignDetect
This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
eidos-audition
Collection of auditory models.
Stars: ✭ 25 (-37.5%)
Mutual labels:  speech
Voice2Mesh
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (+67.5%)
Mutual labels:  speech
D-TDNN
PyTorch implementation of Densely Connected Time Delay Neural Network
Stars: ✭ 60 (+50%)
Mutual labels:  speech
cape
Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch
Stars: ✭ 29 (-27.5%)
Mutual labels:  speech
voice-based-email-for-blind
Emailing System for visually impaired persons
Stars: ✭ 35 (-12.5%)
Mutual labels:  speech
web-speech-demo
Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
CVC
CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)
Stars: ✭ 45 (+12.5%)
Mutual labels:  speech
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (+25%)
Mutual labels:  ctc
aframe-speech-controls-component
alternative form of inputs for in-VR interaction with the content of a scene
Stars: ✭ 13 (-67.5%)
Mutual labels:  speech
Speech Feature Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+95%)
Mutual labels:  speech
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
Stars: ✭ 161 (+302.5%)
Mutual labels:  speech
linear16
Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
audio noise clustering
https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.
Stars: ✭ 24 (-40%)
Mutual labels:  speech
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-12.5%)
Mutual labels:  speech
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+122.5%)
Mutual labels:  speech
DeepSegmentor
Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)
Stars: ✭ 17 (-57.5%)
Mutual labels:  speech
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
Mutual labels:  speech
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+34575%)
Mutual labels:  speech
FAST-RIR
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.
Stars: ✭ 90 (+125%)
Mutual labels:  speech
TASNET
Time-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-55%)
Mutual labels:  speech
rnn benchmarks
RNN benchmarks of pytorch, tensorflow and theano
Stars: ✭ 85 (+112.5%)
Mutual labels:  ctc
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+105%)
Mutual labels:  speech
NBSS
The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".
Stars: ✭ 77 (+92.5%)
Mutual labels:  speech
HTK
The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-42.5%)
Mutual labels:  speech
icassp2019-latex-template
ICASSP 2019 official Latex template
Stars: ✭ 21 (-47.5%)
Mutual labels:  speech
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-67.5%)
Mutual labels:  speech
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+347.5%)
Mutual labels:  speech
ventib
📈 Ventib records your voice, transcribes it in realtime, and performs speech pattern analysis to give you objective statistics about how you speak.
Stars: ✭ 43 (+7.5%)
Mutual labels:  speech
Audio Signal Processing
Audio or speech signal processing guide.
Stars: ✭ 45 (+12.5%)
Mutual labels:  speech
melgan
MelGAN implementation with Multi-Band and Full Band supports...
Stars: ✭ 54 (+35%)
Mutual labels:  speech
pytorch-pcen
PyTorch reimplementation of per-channel energy normalization for audio.
Stars: ✭ 80 (+100%)
Mutual labels:  speech
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+412.5%)
Mutual labels:  speech
AESRC2020
a deep accent recognition network
Stars: ✭ 35 (-12.5%)
Mutual labels:  ctc
1-60 of 212 similar projects