An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-37.5%)

Mutual labels: speech, ctc

gtranscribe

Software for interview transcription

Stars: ✭ 12 (-70%)

Mutual labels: speech

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (+25%)

Mutual labels: speech

CRNN.tf2

Convolutional Recurrent Neural Network(CRNN) for End-to-End Text Recognition - TensorFlow 2

Stars: ✭ 131 (+227.5%)

Mutual labels: ctc

speech-transformer

Transformer implementation speciaized in speech recognition tasks using Pytorch.

Stars: ✭ 40 (+0%)

Mutual labels: speech

ttslearn

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

Stars: ✭ 158 (+295%)

Mutual labels: speech

nabaztag-php

a simple php implementation of a Nabaztag server

Stars: ✭ 14 (-65%)

Mutual labels: speech

nlp-class

A Natural Language Processing course taught by Professor Ghassemi

Stars: ✭ 95 (+137.5%)

Mutual labels: speech

web-speech-demo

Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.

Stars: ✭ 19 (-52.5%)

Mutual labels: speech

Rus-SpeechRecognition-LSTM-CTC-VoxForge

Распознавание речи русского языка используя Tensorflow, обучаясь на базе Voxforge

Stars: ✭ 50 (+25%)

Mutual labels: ctc

Speech Feature Extraction

Feature extraction of speech signal is the initial stage of any speech recognition system.

Stars: ✭ 78 (+95%)

Mutual labels: speech

MelNet-SpeechGeneration

Implementation of MelNet in PyTorch to generate high-fidelity audio samples

Stars: ✭ 19 (-52.5%)

Mutual labels: speech

linear16

Converts an audio file to LINEAR16 Google-speech compatible file.

Stars: ✭ 14 (-65%)

Mutual labels: speech

TASNET

Time-domain Audio Separation Network (IN PYTORCH)

Stars: ✭ 18 (-55%)

Mutual labels: speech

HTK

The Hidden Markov Model Toolkit (HTK) from University of Cambridge, with fixed issues.

Stars: ✭ 23 (-42.5%)

Mutual labels: speech

Voice2Mesh

CVPR 2022: Cross-Modal Perceptionist: Can Face Geometry be Gleaned from Voices?

Stars: ✭ 67 (+67.5%)

Mutual labels: speech

View All Similar Projects ➔

speech_recognition_ctc（中文语音识别）

数据来源

代码说明

1-1-generate.py: 生成MFCC特征矩阵，方便2-2和2-3的代码调用（一次性载入内存）
2-2-ctc_speech_thchs30.py：使用清华语音库进行训练，输入为一次性加载，输出为文字
2-3-ctc_speech_thchs30_pinyin.py：使用清华语音库进行训练，输入为一次性加载，输出为拼音
2-4-ctc_speech_aishell.py：使用希尔贝壳语音库进行训练，输入为生成器，输出为文字
2-5-ctc_speech_thchs.py：使用清华语音库进行训练，输入为生成器，输出为文字
2-6-ctc_speech_both.py：使用清华和希尔贝壳语音库进行训练，输入为生成器，输出为文字
2-7-ctc_speech_pinyin.py：使用清华和希尔贝壳语音库进行训练，输入为生成器，输出为拼音
utils.py：包含数据的读取和处理，由2-4调用

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

EliasCai / speech_recognition_ctc

Programming Languages

Labels

Projects that are alternatives of or similar to speech recognition ctc

speech_recognition_ctc（中文语音识别）

数据来源

代码说明