All Projects → EliasCai → speech_recognition_ctc

EliasCai / speech_recognition_ctc

Licence: other
Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to speech recognition ctc

Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (+210%)
Mutual labels:  speech, ctc
torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
Stars: ✭ 42 (+5%)
Mutual labels:  speech, ctc
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+512.5%)
Mutual labels:  speech, ctc
Neural sp
End-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+920%)
Mutual labels:  speech, ctc
Multimodal-Gesture-Recognition-with-LSTMs-and-CTC
An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.
Stars: ✭ 25 (-37.5%)
Mutual labels:  speech, ctc
gtranscribe
Software for interview transcription
Stars: ✭ 12 (-70%)
Mutual labels:  speech
opensnips
Open source projects related to Snips https://snips.ai/.
Stars: ✭ 50 (+25%)
Mutual labels:  speech
CRNN.tf2
Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2
Stars: ✭ 131 (+227.5%)
Mutual labels:  ctc
speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (+0%)
Mutual labels:  speech
ttslearn
ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (+295%)
Mutual labels:  speech
nabaztag-php
a simple php implementation of a Nabaztag server
Stars: ✭ 14 (-65%)
Mutual labels:  speech
nlp-class
A Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (+137.5%)
Mutual labels:  speech
web-speech-demo
Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
Rus-SpeechRecognition-LSTM-CTC-VoxForge
Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge
Stars: ✭ 50 (+25%)
Mutual labels:  ctc
Speech Feature Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
Stars: ✭ 78 (+95%)
Mutual labels:  speech
MelNet-SpeechGeneration
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
Stars: ✭ 19 (-52.5%)
Mutual labels:  speech
linear16
Converts an audio file to LINEAR16 Google-speech compatible file.
Stars: ✭ 14 (-65%)
Mutual labels:  speech
TASNET
Time-domain Audio Separation Network (IN PYTORCH)
Stars: ✭ 18 (-55%)
Mutual labels:  speech
HTK
The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.
Stars: ✭ 23 (-42.5%)
Mutual labels:  speech
Voice2Mesh
CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?
Stars: ✭ 67 (+67.5%)
Mutual labels:  speech

speech_recognition_ctc(中文语音识别)

数据来源

代码说明

  • 1-1-generate.py: 生成MFCC特征矩阵,方便2-2和2-3的代码调用(一次性载入内存)
  • 2-2-ctc_speech_thchs30.py:使用清华语音库进行训练,输入为一次性加载,输出为文字
  • 2-3-ctc_speech_thchs30_pinyin.py:使用清华语音库进行训练,输入为一次性加载,输出为拼音
  • 2-4-ctc_speech_aishell.py:使用希尔贝壳语音库进行训练,输入为生成器,输出为文字
  • 2-5-ctc_speech_thchs.py:使用清华语音库进行训练,输入为生成器,输出为文字
  • 2-6-ctc_speech_both.py:使用清华和希尔贝壳语音库进行训练,输入为生成器,输出为文字
  • 2-7-ctc_speech_pinyin.py:使用清华和希尔贝壳语音库进行训练,输入为生成器,输出为拼音
  • utils.py:包含数据的读取和处理,由2-4调用
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].