ArvutajaAn Android app for voice actions in Estonian and English
Stars: ✭ 28 (+27.27%)
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Stars: ✭ 935 (+4150%)
sepia-docsDocumentation and Wiki for SEPIA. Please post your questions and bug-reports here in the issues section! Thank you :-)
Stars: ✭ 160 (+627.27%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-18.18%)
masr中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。
Stars: ✭ 179 (+713.64%)
VoiceBridgeVoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit
Stars: ✭ 17 (-22.73%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+627.27%)
pocketsphinxUpdated ROS bindings to pocketsphinx
Stars: ✭ 36 (+63.64%)
Rnnt Speech RecognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Stars: ✭ 158 (+618.18%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (+586.36%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+372.73%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+577.27%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (+572.73%)
multilingual kwsFew-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Stars: ✭ 122 (+454.55%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (+172.73%)
HTR-ctcPytorch implementation of HTR on IAM dataset (word or line level + CTC loss)
Stars: ✭ 15 (-31.82%)
porfirГолосовой ассистент Порфирьевич
Stars: ✭ 23 (+4.55%)
Speech-RecognitionEnd-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
Stars: ✭ 21 (-4.55%)
DlaDeep learning for audio processing
Stars: ✭ 142 (+545.45%)
VietSentiWordNet[VietSentiWordNet] A quick and simple method to find Opinion for Vietnamese text.
Stars: ✭ 26 (+18.18%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+918.18%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-31.82%)
AllosaurusAllosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+513.64%)
PersephoneA tool for automatic phoneme transcription
Stars: ✭ 130 (+490.91%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+318.18%)
Alan Sdk PcfAlan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.
Stars: ✭ 128 (+481.82%)
KoLMKorean text normalization and language preparation package for LM in Kaldi-based ASR system
Stars: ✭ 46 (+109.09%)
salutejsSmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript
Stars: ✭ 35 (+59.09%)
Lip Reading Deeplearning🔓 Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Stars: ✭ 1,641 (+7359.09%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (+831.82%)
Keras KaldiKeras Interface for Kaldi ASR
Stars: ✭ 124 (+463.64%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+250%)
Wer are weAttempt at tracking states of the arts and recent results (bibliography) on speech recognition.
Stars: ✭ 1,684 (+7554.55%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (+1622.73%)
Project aliasAlias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
Stars: ✭ 1,577 (+7068.18%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+20504.55%)
SounderAn intent recognizing algorithm to predict the intent of a given text.
Stars: ✭ 118 (+436.36%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (+1509.09%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+281.82%)
mixupspeechpro.com/
Stars: ✭ 23 (+4.55%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (+436.36%)
OCROptical character recognition Using Deep Learning
Stars: ✭ 25 (+13.64%)
telltimeiOS application to tell the time in the British way 🇬🇧⏰
Stars: ✭ 49 (+122.73%)
HolobotHoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (+418.18%)
vi-rsVietnamese Input Method library
Stars: ✭ 69 (+213.64%)
KontinuousspeechrecognizerA Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword
Stars: ✭ 113 (+413.64%)
Ml RoadMachine Learning Resources, Practice and Research
Stars: ✭ 1,776 (+7972.73%)
KhronosThe open source intelligent personal assistant
Stars: ✭ 25 (+13.64%)