An end-to-end system that performs temporal recognition of gesture sequences using speech and skeletal input. The model combines three networks with a CTC output layer that recognises gestures from continuous stream.

Stars: ✭ 25 (-81.48%)

Mutual labels: speech

Listen-Attend-Spell-v2

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Stars: ✭ 29 (-78.52%)

Mutual labels: speech-recognition

lightning-asr

Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.

Stars: ✭ 36 (-73.33%)

Mutual labels: speech-recognition

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (-30.37%)

Mutual labels: speech-recognition

hifigan-denoiser

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Stars: ✭ 88 (-34.81%)

Mutual labels: speech

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+41190.37%)

Mutual labels: speech-recognition

Masr

中文语音识别; Mandarin Automatic Speech Recognition;

Stars: ✭ 1,246 (+822.96%)

Mutual labels: speech-recognition

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+4343.7%)

Mutual labels: speech-recognition

vosk-model-ru-adaptation

No description or website provided.

Stars: ✭ 19 (-85.93%)

Mutual labels: speech-recognition

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (+653.33%)

Mutual labels: speech

browser-apis

🦄 Cool & Fun Browser Web APIs 🥳

Stars: ✭ 21 (-84.44%)

Mutual labels: speech

minutes

🔭 Speaker diarization via transfer learning

Stars: ✭ 25 (-81.48%)

Mutual labels: speech

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+1937.78%)

Mutual labels: speech-recognition

Mongolian Speech Recognition

Mongolian speech recognition with PyTorch

Stars: ✭ 97 (-28.15%)

Mutual labels: speech-recognition

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+84.44%)

Mutual labels: speech-recognition

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-83.7%)

Mutual labels: speech-recognition

Dialectid e2e

End to End Dialect Identification using Convolutional Neural Network

Stars: ✭ 40 (-70.37%)

Mutual labels: speech

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+2629.63%)

Mutual labels: speech-recognition

flite-go

Go bindings for Flite (festival-lite)

Stars: ✭ 14 (-89.63%)

Mutual labels: speech

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+45.19%)

Mutual labels: speech-recognition

quran-align

Word-accurate timestamps for Qur'anic audio.

Stars: ✭ 139 (+2.96%)

Mutual labels: speech-recognition

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (-62.96%)

Mutual labels: speech

Voice

🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)

Stars: ✭ 993 (+635.56%)

Mutual labels: speech-recognition

StageMate

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

Stars: ✭ 60 (-55.56%)

Mutual labels: speech-recognition

B.e.n.j.i.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

Stars: ✭ 83 (-38.52%)

Mutual labels: speech-recognition

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (+368.89%)

Mutual labels: speech-recognition

pocketsphinx

Updated ROS bindings to pocketsphinx

Stars: ✭ 36 (-73.33%)

Mutual labels: speech-recognition

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-31.85%)

Mutual labels: speech-recognition

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+40%)

Mutual labels: speech-recognition

tt-vae-gan

Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source to that of another.

Stars: ✭ 37 (-72.59%)

Mutual labels: speech

Kaldi Offline Transcriber

Offline transcription system for Estonian using Kaldi

Stars: ✭ 182 (+34.81%)

Mutual labels: speech-recognition

Wsay

Windows "say"

Stars: ✭ 36 (-73.33%)

Mutual labels: speech

Deepspeech German

Automatic Speech Recognition (ASR) - German