Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Stars: ✭ 2,085 (+963.78%)

Mutual labels: speech-recognition

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (+312.24%)

Mutual labels: speech-recognition

Kaldi Onnx

Kaldi model converter to ONNX

Stars: ✭ 174 (-11.22%)

Mutual labels: speech-recognition

Speech Recognition Neural Network

This is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.

Stars: ✭ 148 (-24.49%)

Mutual labels: speech-recognition

Ios ml

List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.

Stars: ✭ 1,409 (+618.88%)

Mutual labels: speech-recognition

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+289.8%)

Mutual labels: speech-recognition

Pykaldi

A Python wrapper for Kaldi

Stars: ✭ 756 (+285.71%)

Mutual labels: speech-recognition

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+654.59%)

Mutual labels: speech-recognition

Nonocaptcha

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

Stars: ✭ 744 (+279.59%)

Mutual labels: speech-to-text

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (-46.94%)

Mutual labels: speech-recognition

Cordova Plugin Speechrecognition

🎤 Cordova Plugin for Speech Recognition

Stars: ✭ 174 (-11.22%)

Mutual labels: speech-recognition

Wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

Stars: ✭ 5,907 (+2913.78%)

Mutual labels: speech-recognition

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (+243.37%)

Mutual labels: speech-recognition

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+222.96%)

Mutual labels: speech-recognition

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (+222.96%)

Mutual labels: speech-recognition

Dla

Deep learning for audio processing

Stars: ✭ 142 (-27.55%)

Mutual labels: speech-recognition

Voicy

@voicybot Telegram bot main repository

Stars: ✭ 620 (+216.33%)

Mutual labels: speech-to-text

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (+217.35%)

Mutual labels: speech-recognition

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (+214.8%)

Mutual labels: speech-recognition

Open stt

Open STT

Stars: ✭ 584 (+197.96%)

Mutual labels: speech-to-text

Nodejs Speech

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

Stars: ✭ 545 (+178.06%)

Mutual labels: speech-to-text

Gst Kaldi Nnet2 Online

GStreamer plugin around Kaldi's online neural network decoder

Stars: ✭ 171 (-12.76%)

Mutual labels: speech-recognition

Aimybox Android Assistant

Embeddable custom voice assistant for Android applications

Stars: ✭ 139 (-29.08%)

Mutual labels: speech-recognition

Audiomate

Python library for handling audio datasets.

Stars: ✭ 99 (-49.49%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (+176.53%)

Mutual labels: speech-recognition

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (-50%)

Mutual labels: speech-recognition

Speech recognition

中文语音识别

Stars: ✭ 534 (+172.45%)

Mutual labels: speech-recognition

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (+169.9%)

Mutual labels: speech-recognition

Ai Study

人工智能学习资料超全整理，包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题

Stars: ✭ 93 (-52.55%)

Mutual labels: speech-recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

Stars: ✭ 481 (+145.41%)

Mutual labels: speech-recognition

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-31.12%)

Mutual labels: speech-recognition

Dexter

Let your talking do the code

Stars: ✭ 93 (-52.55%)

Mutual labels: speech-to-text

Rhasspy

Offline private voice assistant for many human languages

Stars: ✭ 458 (+133.67%)

Mutual labels: speech-recognition

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (-53.06%)

Mutual labels: speech-recognition

Uspeech

Speech recognition toolkit for the arduino

Stars: ✭ 448 (+128.57%)

Mutual labels: speech-recognition

Cross vc

Cross-lingual Voice Conversion

Stars: ✭ 91 (-53.57%)

Mutual labels: speech-recognition

Deep Learning Drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

Stars: ✭ 9,717 (+4857.65%)

Mutual labels: speech-recognition

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (+108.16%)

Mutual labels: speech-recognition

Speaker adapted tts

Making a TTS model with 1 minute of speech samples within 10 minutes

Stars: ✭ 183 (-6.63%)

Mutual labels: speech-to-text

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+969.9%)

Mutual labels: speech-recognition

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (-32.65%)

Mutual labels: speech-recognition

Speech Transformer Tf2.0

transformer for ASR-systerm (via tensorflow2.0)

Stars: ✭ 90 (-54.08%)

Mutual labels: speech-recognition

Neural sp

End-to-end ASR/LM implementation with PyTorch