Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+457.49%)

Mutual labels: language-model, speech-recognition

mongolian-nlp

Useful resources for Mongolian NLP

Stars: ✭ 119 (-68.18%)

Mutual labels: speech-recognition, language-model

torchain

WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)

Stars: ✭ 20 (-94.65%)

Mutual labels: kaldi, asr

Docker Kaldi Gstreamer Server

Dockerfile for kaldi-gstreamer-server.

Stars: ✭ 266 (-28.88%)

Mutual labels: asr, kaldi

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-81.55%)

Mutual labels: speech-recognition, asr

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-79.14%)

Mutual labels: speech-recognition, asr

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (-5.35%)

Mutual labels: speech-recognition, asr

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (-93.58%)

Mutual labels: speech-recognition, asr

srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

Stars: ✭ 22 (-94.12%)

Mutual labels: speech-recognition, kaldi

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-83.69%)

Mutual labels: speech-recognition, kaldi

Pocketsphinx

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Stars: ✭ 2,934 (+684.49%)

Mutual labels: speech-recognition

scim

[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.

Stars: ✭ 17 (-95.45%)

Mutual labels: speech-recognition

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-30.75%)

Mutual labels: kaldi

VoiceCom

A Simple Voice Command Application powered by Java and Sphinx4 Speech Recognition Library

Stars: ✭ 17 (-95.45%)

Mutual labels: speech-recognition

porfir

Голосовой ассистент Порфирьевич

Stars: ✭ 23 (-93.85%)

Mutual labels: speech-recognition

A Pytorch Tutorial To Sequence Labeling

Empower Sequence Labeling with Task-Aware Neural Language Model | a PyTorch Tutorial to Sequence Labeling

Stars: ✭ 257 (-31.28%)

Mutual labels: language-model

commonvoice-utils

Linguistic processing for Common Voice

Stars: ✭ 32 (-91.44%)

Mutual labels: asr

Brevitas

Brevitas: quantization-aware training in PyTorch

Stars: ✭ 343 (-8.29%)

Mutual labels: speech-recognition

Gpt Neox

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Stars: ✭ 303 (-18.98%)

Mutual labels: language-model

HotVoice

Adds Speech Recognition support to AutoHotkey, via a C# DLL

Stars: ✭ 41 (-89.04%)

Mutual labels: speech-recognition

SDLM-pytorch

Code accompanying EMNLP 2018 paper Language Modeling with Sparse Product of Sememe Experts

Stars: ✭ 27 (-92.78%)

Mutual labels: language-model

YouTube-Tutorials--Italian

📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.

Stars: ✭ 28 (-92.51%)

Mutual labels: speech-recognition

Listen-Attend-Spell-v2

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Stars: ✭ 29 (-92.25%)

Mutual labels: speech-recognition

CCAligner

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

Stars: ✭ 131 (-64.97%)

Mutual labels: speech-recognition

rosecho

Tianbot Rosecho (Tianecho)，中文语音人机交互模块，支持ROS即插即用