K6neleAn Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+176.06%)
Stephanie VaStephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+987.32%)
UnityASRAutomatic Speech Recognition in Unity.
Stars: ✭ 14 (-80.28%)
doctrdocTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Stars: ✭ 1,409 (+1884.51%)
learning invariances in speech recognitionIn this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…
Stars: ✭ 15 (-78.87%)
DlaDeep learning for audio processing
Stars: ✭ 142 (+100%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+73.24%)
ASR-Audio-Data-LinksA list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+152.11%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-15.49%)
Go AstideepspeechGolang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+92.96%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-56.34%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-80.28%)
musicologistMusic advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-73.24%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-80.28%)
TensorFlow2.0 SSDA tensorflow_2.0 implementation of SSD (Single Shot MultiBox Detector) .
Stars: ✭ 83 (+16.9%)
PersephoneA tool for automatic phoneme transcription
Stars: ✭ 130 (+83.1%)
VoiceComA Simple Voice Command Application powered by Java and Sphinx4 Speech Recognition Library
Stars: ✭ 17 (-76.06%)
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (+80.28%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (-67.61%)
CaptionThis"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories
Stars: ✭ 12 (-83.1%)
YouTube-Tutorials--Italian📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.
Stars: ✭ 28 (-60.56%)
Tensorflow Ctc Speech RecognitionApplication of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (+78.87%)
rosechoTianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Stars: ✭ 28 (-60.56%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-54.93%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (+125.35%)
Lip Reading Deeplearning🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Stars: ✭ 1,641 (+2211.27%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-36.62%)
wav2vec2-liveA live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+188.73%)
mixupspeechpro.com/
Stars: ✭ 23 (-67.61%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-49.3%)
opensource-voice-toolsA repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-70.42%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+174.65%)
PykaldiA Python wrapper for Kaldi
Stars: ✭ 756 (+964.79%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (+95.77%)
Wer are weAttempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Stars: ✭ 1,684 (+2271.83%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-49.3%)
bobBob is a free signal-processing and machine learning toolbox originally developed by the Biometrics group at Idiap Research Institute, in Switzerland. - Mirrored from https://gitlab.idiap.ch/bob/bob
Stars: ✭ 38 (-46.48%)
SounderAn intent recognizing algorithm to predict the intent of a given text.
Stars: ✭ 118 (+66.2%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+46.48%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (+60.56%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-61.97%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-63.38%)
KontinuousspeechrecognizerA Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword
Stars: ✭ 113 (+59.15%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+15.49%)
titanium-speechUse the iOS 10 SFSpeechRecognizer API in JavaScript with Appcelerator Hyperloop.
Stars: ✭ 21 (-70.42%)
EesenThe official repository of the Eesen project
Stars: ✭ 738 (+939.44%)
LingvoLingvo
Stars: ✭ 2,361 (+3225.35%)
Annyang💬 Speech recognition for your site
Stars: ✭ 6,216 (+8654.93%)
AdaptAdapt Intent Parser
Stars: ✭ 690 (+871.83%)
KalliopeKalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+2025.35%)
awesome-speech-enhancementA curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
Stars: ✭ 48 (-32.39%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-76.06%)
AmazonSpeechTranslatorEnd-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-29.58%)