PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-79.59%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+28339.8%)
BigcidianPronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.
Stars: ✭ 99 (-49.49%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+325%)
ArvutajaAn Android app for voice actions in Estonian and English
Stars: ✭ 28 (-85.71%)
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Stars: ✭ 935 (+377.04%)
PansoriTools for ASR Corpus Generation from Online Video
Stars: ✭ 106 (-45.92%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-90.82%)
EspressoEspresso: A Fast End-to-End Neural Speech Recognition Toolkit
Stars: ✭ 808 (+312.24%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (-11.22%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (-24.49%)
Ios mlList of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Stars: ✭ 1,409 (+618.88%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+289.8%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+285.71%)
DeltaDELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+654.59%)
NonocaptchaAn asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Stars: ✭ 744 (+279.59%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (-46.94%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+2913.78%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+243.37%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+222.96%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (+222.96%)
DlaDeep learning for audio processing
Stars: ✭ 142 (-27.55%)
Voicy@voicybot Telegram bot main repository
Stars: ✭ 620 (+216.33%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+217.35%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (+214.8%)
Open sttOpen STT
Stars: ✭ 584 (+197.96%)
Nodejs SpeechNode.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+178.06%)
AudiomatePython library for handling audio datasets.
Stars: ✭ 99 (-49.49%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+176.53%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-50%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+169.9%)
Ai Study人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题
Stars: ✭ 93 (-52.55%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+145.41%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-31.12%)
DexterLet your talking do the code
Stars: ✭ 93 (-52.55%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (+133.67%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-53.06%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (+128.57%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (-53.57%)
Deep Learning DrizzleDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+4857.65%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (+108.16%)
Speaker adapted ttsMaking a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (-6.63%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+969.9%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (+108.16%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+103.06%)
SpeechlessSpeech-to-text based on wav2letter built for transfer learning
Stars: ✭ 89 (-54.59%)
PersephoneA tool for automatic phoneme transcription
Stars: ✭ 130 (-33.67%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (+100%)