StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (-91.08%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-47.4%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-20.95%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-96.88%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-92.12%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-49.03%)
leon🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+1171.92%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-34.62%)
sepia-stt-serverSEPIA server to support open-source speech recognition via WebSocket connection.
Stars: ✭ 45 (-93.31%)
J.a.r.v.i.spython powered Intelligent System
Stars: ✭ 325 (-51.71%)
speech-to-textmixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-90.94%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-7.58%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-94.5%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (-52.75%)
kosrKorean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
Stars: ✭ 25 (-96.29%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-39.38%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-54.23%)
YouTube-Tutorials--Italian📂 Source Code for (some of) the Programming Tutorials from my Italian YouTube Channel and website ProgrammareInPython.it. This is just a small portion of the content: please visit the website for more.
Stars: ✭ 28 (-95.84%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-22.44%)
rosechoTianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Stars: ✭ 28 (-95.84%)
Speech recognitionA Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (-55.13%)
awesome-multimodal-mlReading list for research topics in multimodal machine learning
Stars: ✭ 3,125 (+364.34%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-39.38%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (-76.23%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-55.87%)
deepspeechA PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-93.31%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-5.94%)
mixupspeechpro.com/
Stars: ✭ 23 (-96.58%)
Alan Sdk AndroidAlan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-58.69%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-94.65%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-40.86%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (-79.35%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (-94.65%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-27.19%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-60.03%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-86.33%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-41.6%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+335.96%)
SpeechEnhancementCombining Weighted Multi-resolution STFT Loss and Distance Fusion to Optimize Speech Enhancement Generative Adversarial Networks
Stars: ✭ 49 (-92.72%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (-83.21%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-19.47%)
Mycroft PreciseA lightweight, simple-to-use, RNN wake word listener
Stars: ✭ 481 (-28.53%)
NmtpytorchSequence-to-Sequence Framework in PyTorch
Stars: ✭ 392 (-41.75%)
Awesome Speech EnhancementA tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech enhancement and make them universally accessible and useful.
Stars: ✭ 257 (-61.81%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-95.99%)
pyssppython speech signal processing library
Stars: ✭ 18 (-97.33%)
SpeechTransProgressTracking the progress in end-to-end speech translation
Stars: ✭ 139 (-79.35%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-93.02%)
BookLibraryBook Library of P&W Studio
Stars: ✭ 13 (-98.07%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-43.09%)
HotVoiceAdds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-93.91%)
Listen-Attend-Spell-v2PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
Stars: ✭ 29 (-95.69%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-87.82%)