All Projects → speech_recognition_ctc → Similar Projects or Alternatives

212 Open source projects that are alternatives of or similar to speech_recognition_ctc

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+512.5%)

Mutual labels: speech, ctc

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+210%)

Mutual labels: speech, ctc

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (+5%)

Mutual labels: speech, ctc

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (+920%)

Mutual labels: speech, ctc

Multimodal-Gesture-Recognition-with-LSTMs-and-CTC

An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-37.5%)

Mutual labels: speech, ctc

MajorDomo-Scenarios

Сценарии для системы домашней автоматизации Majordomo

Stars: ✭ 12 (-70%)

Mutual labels: speech

CRNN.tf2

Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2

Stars: ✭ 131 (+227.5%)

Mutual labels: ctc

Zero-Shot-TTS

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Stars: ✭ 33 (-17.5%)

Mutual labels: speech

Shifter

Pitch shifter using WSOLA and resampling implemented by Python3

Stars: ✭ 22 (-45%)

Mutual labels: speech

nlp-class

A Natural Language Processing course taught by Professor Ghassemi

Stars: ✭ 95 (+137.5%)

Mutual labels: speech

VAD-LTSD

Efficient voice activity detection algorithm using long-term speech information

Stars: ✭ 37 (-7.5%)

Mutual labels: speech

room-impulse-responses

A list of publicly available room impulse response datasets and scripts to download them.

Stars: ✭ 143 (+257.5%)

Mutual labels: speech

AdaSpeech

AdaSpeech: Adaptive Text to Speech for Custom Voice

Stars: ✭ 108 (+170%)

Mutual labels: speech

gtranscribe

Software for interview transcription

Stars: ✭ 12 (-70%)

Mutual labels: speech

Phomeme

Simple sentence mixing tool (work in progress)

Stars: ✭ 18 (-55%)

Mutual labels: speech

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (+25%)

Mutual labels: speech

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+180%)

Mutual labels: ctc

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (+0%)

Mutual labels: speech

TFGAN

TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

Stars: ✭ 65 (+62.5%)

Mutual labels: speech

nabaztag-php

a simple php implementation of a Nabaztag server

Stars: ✭ 14 (-65%)

Mutual labels: speech

lidbox

End-to-end spoken language identification out of the box.

Stars: ✭ 39 (-2.5%)

Mutual labels: speech

JD-NMF

Joint Dictionary Learning-based Non-Negative Matrix Factorization for Voice Conversion (TBME 2016)

Stars: ✭ 20 (-50%)

Mutual labels: speech

SignDetect

This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.

Stars: ✭ 21 (-47.5%)

Mutual labels: speech

eidos-audition

Collection of auditory models.

Stars: ✭ 25 (-37.5%)

Mutual labels: speech

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (+67.5%)

Mutual labels: speech

D-TDNN

PyTorch implementation of Densely Connected Time Delay Neural Network

Stars: ✭ 60 (+50%)

Mutual labels: speech

cape

Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch

Stars: ✭ 29 (-27.5%)

Mutual labels: speech

voice-based-email-for-blind

Emailing System for visually impaired persons

Stars: ✭ 35 (-12.5%)

Mutual labels: speech

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-52.5%)

Mutual labels: speech

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Stars: ✭ 45 (+12.5%)

Mutual labels: speech

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (+25%)

Mutual labels: ctc

aframe-speech-controls-component

alternative form of inputs for in-VR interaction with the content of a scene

Stars: ✭ 13 (-67.5%)

Mutual labels: speech

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (+95%)

Mutual labels: speech

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-65%)

Mutual labels: speech

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-52.5%)

Mutual labels: speech

StyleSpeech

Official implementation of Meta-StyleSpeech and StyleSpeech

Stars: ✭ 161 (+302.5%)

Mutual labels: speech

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-65%)

Mutual labels: speech

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-40%)

Mutual labels: speech

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-12.5%)

Mutual labels: speech

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+122.5%)

Mutual labels: speech

DeepSegmentor

Sequence Segmentation using Joint RNN and Structured Prediction Models (ICASSP 2017)

Stars: ✭ 17 (-57.5%)

Mutual labels: speech

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-47.5%)

Mutual labels: speech

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+295%)

Mutual labels: speech

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-47.5%)

Mutual labels: speech

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Stars: ✭ 13,870 (+34575%)

Mutual labels: speech

FAST-RIR

This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating room impulse responses (RIRs) for a given acoustic environment.

Stars: ✭ 90 (+125%)

Mutual labels: speech

TASNET

Time-domain Audio Separation Network (IN PYTORCH)

Stars: ✭ 18 (-55%)

Mutual labels: speech

rnn benchmarks

RNN benchmarks of pytorch, tensorflow and theano

Stars: ✭ 85 (+112.5%)

Mutual labels: ctc

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+105%)

Mutual labels: speech

NBSS

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Stars: ✭ 77 (+92.5%)

Mutual labels: speech

HTK

The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.

Stars: ✭ 23 (-42.5%)

Mutual labels: speech

icassp2019-latex-template

ICASSP 2019 official Latex template

Stars: ✭ 21 (-47.5%)

Mutual labels: speech

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.