Pytorch AsrASR with PyTorch
Stars: ✭ 124 (+210%)
Mutual labels: speech, ctc
torch-asgAuto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+5%)
Mutual labels: speech, ctc
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+512.5%)
Mutual labels: speech, ctc
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+920%)
Mutual labels: speech, ctc
Multimodal-Gesture-Recognition-with-LSTMs-and-CTCAn end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-37.5%)
Mutual labels: speech, ctc
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-70%)
Mutual labels: speech
opensnipsOpen source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+25%)
Mutual labels: speech
CRNN.tf2Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
Stars: ✭ 131 (+227.5%)
Mutual labels: ctc
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+0%)
Mutual labels: speech
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
Mutual labels: speech
nabaztag-phpa simple php implementation of a Nabaztag server
Stars: ✭ 14 (-65%)
Mutual labels: speech
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (+137.5%)
Mutual labels: speech
web-speech-demoLearn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-52.5%)
Mutual labels: speech
Speech Feature ExtractionFeature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+95%)
Mutual labels: speech
MelNet-SpeechGenerationImplementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-52.5%)
Mutual labels: speech
linear16Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-65%)
Mutual labels: speech
TASNETTime-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-55%)
Mutual labels: speech
HTKThe Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-42.5%)
Mutual labels: speech
Voice2MeshCVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (+67.5%)
Mutual labels: speech