Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+5579.81%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-83.65%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+547.12%)
mixupspeechpro.com/
Stars: ✭ 23 (-77.88%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-65.38%)
cobraOn-device voice activity detection (VAD) powered by deep learning.
Stars: ✭ 76 (-26.92%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+508.65%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+508.65%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+498.08%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1916.35%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+493.27%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-79.81%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1322.12%)
syn-speech-samplesAn application that demostrate the usage of Syn.Speech library for Speech Recognition
Stars: ✭ 24 (-76.92%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+421.15%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+408.65%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (-44.23%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+53.85%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+362.5%)
VoiceDictation迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息,让机器能够“听懂”人类语言,相当于给机器安装上“耳朵”,使其具备“能听”的功能。
Stars: ✭ 36 (-65.38%)
KerasdeepspeechA Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+135.58%)
Rnnt Speech RecognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Stars: ✭ 158 (+51.92%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (+340.38%)
Go AstibobGolang framework to build an AI that can understand and speak back to you, and everything else you want
Stars: ✭ 222 (+113.46%)
iOSProjectsIt's project that contains different applications developed with Swift 5.7 👨💻👩🏼💻🧑🏿💻
Stars: ✭ 122 (+17.31%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+50%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+330.77%)
Speaker adapted ttsMaking a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (+75.96%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-5.77%)
Nlp Models TensorflowGathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Stars: ✭ 1,603 (+1441.35%)
Ai Study人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题
Stars: ✭ 93 (-10.58%)
Casr Demo基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
Stars: ✭ 76 (-26.92%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-19.23%)
WatbotAn Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-38.46%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (-12.5%)
NonocaptchaAn asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Stars: ✭ 744 (+615.38%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (+45.19%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+292.31%)
Open sttOpen STT
Stars: ✭ 584 (+461.54%)
Autoedit 2Fast text based video editing, node Electron Os X desktop app, with Backbone front end.
Stars: ✭ 343 (+229.81%)
picovoiceThe end-to-end platform for building voice products at scale
Stars: ✭ 316 (+203.85%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+292.31%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+138.46%)
TinyCogSmall Robot, Toy Robot platform
Stars: ✭ 29 (-72.12%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+43.27%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+282.69%)
LipNetAutomated Lip reading from real-time videos in tensorflow in python
Stars: ✭ 113 (+8.65%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (+42.31%)