Asr Stars: ✭ 54 (-73.66%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-91.22%)
Ml RoadMachine Learning Resources, Practice and Research
Stars: ✭ 1,776 (+766.34%)
Speechpy💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+306.34%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-88.78%)
Voice SynthesisThis repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
Stars: ✭ 51 (-75.12%)
Esp8266samSpeech synthesis for ESP8266 using S.A.M. port
Stars: ✭ 199 (-2.93%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (-26.34%)
Transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+27091.22%)
NonocaptchaAn asynchronized Python library to automate solving ReCAPTCHA v2 using audio
Stars: ✭ 744 (+262.93%)
YouTube-Tutorials--Italian📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.
Stars: ✭ 28 (-86.34%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-76.59%)
Project aliasAlias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
Stars: ✭ 1,577 (+669.27%)
ttslearnttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)
Stars: ✭ 158 (-22.93%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-78.05%)
AthenaA free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity
Stars: ✭ 73 (-64.39%)
JiwerEvaluate your speech-to-text system with similarity measures such as word error rate (WER)
Stars: ✭ 158 (-22.93%)
Wav2letterFacebook AI Research's Automatic Speech Recognition Toolkit
Stars: ✭ 5,907 (+2781.46%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-79.02%)
kaldi-allignerscripts to align a given wave to its transcription using trained models by Kaldi
Stars: ✭ 24 (-88.29%)
SounderAn intent recognizing algorithm to predict the intent of a given text.
Stars: ✭ 118 (-42.44%)
PraatPraat: Doing Phonetics By Computer
Stars: ✭ 675 (+229.27%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (-27.32%)
PansoriTools for ASR Corpus Generation from Online Video
Stars: ✭ 106 (-48.29%)
Awesome DiarizationA curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
Stars: ✭ 673 (+228.29%)
KARENKAREN: Unifying Hatespeech Detection and Benchmarking
Stars: ✭ 18 (-91.22%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-80.49%)
CidlibThe CIDLib general purpose C++ development environment
Stars: ✭ 179 (-12.68%)
SeganSpeech Enhancement Generative Adversarial Network in TensorFlow
Stars: ✭ 661 (+222.44%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+384.39%)
nlp-classA Natural Language Processing course taught by Professor Ghassemi
Stars: ✭ 95 (-53.66%)
WsayWindows "say"
Stars: ✭ 36 (-82.44%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (-27.8%)
Ios mlList of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Stars: ✭ 1,409 (+587.32%)
Voicy@voicybot Telegram bot main repository
Stars: ✭ 620 (+202.44%)
Kaldi GopComputes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.
Stars: ✭ 104 (-49.27%)
Speech TransformerA PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
Stars: ✭ 565 (+175.61%)
Listen Attend SpellA PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
Stars: ✭ 147 (-28.29%)
ArvutajaAn Android app for voice actions in Estonian and English
Stars: ✭ 28 (-86.34%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-86.83%)
gtranscribeSoftware for interview transcription
Stars: ✭ 12 (-94.15%)
Aeneasaeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+847.32%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+158.05%)
TacotronAudio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+140.49%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (+134.63%)
Xr3player🎧 🎼 Advanced JavaFX Media Player
Stars: ✭ 472 (+130.24%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (+123.41%)