Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+40.97%)

Mutual labels: seq2seq, speech-recognition, speaker-verification

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-96.42%)

Mutual labels: nlu, speech-recognition, asr

opensnips

Open source projects related to Snips https://snips.ai/.

Stars: ✭ 50 (-96.62%)

Mutual labels: nlu, speech, asr

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-91.62%)

Mutual labels: speech-recognition, speech, asr

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+653.96%)

Mutual labels: speech-recognition, speech, speaker-verification

Kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition.

Stars: ✭ 190 (-87.15%)

Mutual labels: speech-recognition, seq2seq, asr

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+41.78%)

Mutual labels: speech-recognition, speech, asr

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-96.15%)

Mutual labels: speech-recognition, speech, asr

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-84.85%)

Mutual labels: speech, speech-recognition, speaker-verification

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-95.33%)

Mutual labels: speech-recognition, speech, asr

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-87.9%)

Mutual labels: speech, speech-recognition, asr

torch-asg

Auto Segmentation Criterion (ASG) implemented in pytorch

Stars: ✭ 42 (-97.16%)

Mutual labels: speech, seq2seq, asr

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-91.68%)

Mutual labels: speech, speech-recognition, asr

Rnn For Joint Nlu

Tensorflow implementation of "Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling" (https://arxiv.org/abs/1609.01454)

Stars: ✭ 281 (-81%)

Mutual labels: seq2seq, nlu

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-81.68%)

Mutual labels: speech-recognition, asr

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-81.27%)

Mutual labels: speech-recognition, asr

Bert seq2seq

pytorch实现bert做seq2seq任务，使用unilm方案,现在也可以做自动摘要，文本分类，情感分析，NER，词性标注等任务,支持GPT2进行文章续写。

Stars: ✭ 298 (-79.85%)

Mutual labels: text-classification, seq2seq

Textbox

TextBox is an open-source library for building text generation system.

Stars: ✭ 257 (-82.62%)

Mutual labels: text-generation, sequence-to-sequence

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-79.85%)

Mutual labels: speech-recognition, speech

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-93.78%)

Mutual labels: speech-recognition, asr

Pytorch Chatbot

Pytorch seq2seq chatbot

Stars: ✭ 336 (-77.28%)

Mutual labels: seq2seq, sequence-to-sequence

Snips Nlu Rs

Snips NLU rust implementation

Stars: ✭ 315 (-78.7%)

Mutual labels: inference, nlu

Snips Nlu

Snips Python library to extract meaning from text

Stars: ✭ 3,583 (+142.26%)

Mutual labels: text-classification, nlu

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (-74.71%)

Mutual labels: speech-recognition, asr

UnityASR

Automatic Speech Recognition in Unity.

Stars: ✭ 14 (-99.05%)

Mutual labels: speech-recognition, asr

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (-79.38%)

Mutual labels: speech-recognition, asr

Nlp Projects

word2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding

Stars: ✭ 360 (-75.66%)

Mutual labels: text-classification, text-generation

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-93.04%)

Mutual labels: speech-recognition, nlu

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (-73.43%)

Mutual labels: speech-recognition, speech

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-93.31%)

Mutual labels: speech-recognition, speech

Tf Seq2seq

Sequence to sequence learning using TensorFlow.

Stars: ✭ 387 (-73.83%)

Mutual labels: seq2seq, sequence-to-sequence

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-72.41%)

Mutual labels: speech-recognition, speech

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-64.71%)

Mutual labels: speech-recognition, asr

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-66.87%)

Mutual labels: speech-recognition, speech

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-64.03%)

Mutual labels: speech-recognition, speech

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-98.51%)

Mutual labels: speech-recognition, asr

Cheetah

On-device streaming speech-to-text engine powered by deep learning