Wer are weAttempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Stars: ✭ 1,684 (+1147.41%)
PyspeechrevThis python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.
Stars: ✭ 74 (-45.19%)
PansoriTools for ASR Corpus Generation from Online Video
Stars: ✭ 106 (-21.48%)
Asr benchmarkProgram to benchmark various speech recognition APIs
Stars: ✭ 71 (-47.41%)
Project aliasAlias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
Stars: ✭ 1,577 (+1068.15%)
Speech aiSimple speech linguistic AI with Python
Stars: ✭ 66 (-51.11%)
Spokestack PythonSpokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-23.7%)
PapersA list of paper, books and sites for various different topics related to machine learning and deep learning along with various field in which it is implemented
Stars: ✭ 63 (-53.33%)
PersephoneA tool for automatic phoneme transcription
Stars: ✭ 130 (-3.7%)
Angle⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-54.81%)
Wav2letter.pytorchA fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (-22.96%)
SounderAn intent recognizing algorithm to predict the intent of a given text.
Stars: ✭ 118 (-12.59%)
Speech And TextSpeech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-24.44%)
BiglittlenetOfficial repository for Big-Little Net
Stars: ✭ 57 (-57.78%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-60.74%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-12.59%)
Openseq2seqToolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+920.74%)
SoloudFree, easy, portable audio engine for games
Stars: ✭ 1,048 (+676.3%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-64.44%)
Vosk ApiOffline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+905.19%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-65.19%)
Py NltoolsA collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-65.93%)
Cortex M KwsCortex M KWS example with Tengine Lite.
Stars: ✭ 45 (-66.67%)
WikipronMassively multilingual pronunciation mining
Stars: ✭ 99 (-26.67%)
StlThe ITU-T Software Tool Library (G.191)
Stars: ✭ 44 (-67.41%)
VocA physical model of the human vocal tract using literate programming, based on Pink Trombone.
Stars: ✭ 129 (-4.44%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-68.15%)
Factorized TdnnPyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-27.41%)
Avsr Deep SpeechGoogle Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
Stars: ✭ 43 (-68.15%)
Dc ttsA TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+653.33%)
Artyom.jsA voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+648.89%)
Dialectid e2eEnd to End Dialect Identification using Convolutional Neural Network
Stars: ✭ 40 (-70.37%)
Ai Study人工智能学习资料超全整理,包含机器学习基础ML、深度学习基础DL、计算机视觉CV、自然语言处理NLP、推荐系统、语音识别、图神经网路、算法工程师面试题
Stars: ✭ 93 (-31.11%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-70.37%)
KtspeechcrawlerAutomatically constructing corpus for automatic speech recognition from YouTube videos
Stars: ✭ 92 (-31.85%)
Voice🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)
Stars: ✭ 993 (+635.56%)
WsayWindows "say"
Stars: ✭ 36 (-73.33%)
Lip Reading Deeplearning🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Stars: ✭ 1,641 (+1115.56%)
Rnn TransducerMXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
Stars: ✭ 114 (-15.56%)
Cross vcCross-lingual Voice Conversion
Stars: ✭ 91 (-32.59%)
LightspeechLightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Stars: ✭ 31 (-77.04%)
Deep Learning DrizzleDrench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Stars: ✭ 9,717 (+7097.78%)
ArvutajaAn Android app for voice actions in Estonian and English
Stars: ✭ 28 (-79.26%)
RhasspyRhasspy voice assistant for offline home automation
Stars: ✭ 851 (+530.37%)
KontinuousspeechrecognizerA Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword
Stars: ✭ 113 (-16.3%)
GttsPython library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (+865.19%)
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Stars: ✭ 935 (+592.59%)
Assistant ClientИнструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"
Stars: ✭ 26 (-80.74%)