awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (+22.95%)
PorcupineOn-device wake word detection powered by deep learning.
Stars: ✭ 2,606 (+2036.07%)
web-voice-processorA library for real-time voice processing in web browsers
Stars: ✭ 69 (-43.44%)
Multi-Hotword SpottingWon't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Stars: ✭ 31 (-74.59%)
Tensorflow Speech Recognition🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+1636.07%)
CidlibThe CIDLib general purpose C++ development environment
Stars: ✭ 179 (+46.72%)
TF-Speech-Recognition-Challenge-SolutionSource code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (-52.46%)
ZerothKaldi-based Korean ASR (한국어 음성인식) open-source project
Stars: ✭ 248 (+103.28%)
LingvoLingvo
Stars: ✭ 2,361 (+1835.25%)
idear🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (-31.15%)
Asr EvaluationPython module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).
Stars: ✭ 190 (+55.74%)
obviA Polymer 3+ webcomponent / button for doing speech recognition
Stars: ✭ 54 (-55.74%)
VoskVOSK Speech Recognition Toolkit
Stars: ✭ 182 (+49.18%)
Speech recognition with tensorflowImplementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+107.38%)
leopardOn-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+190.16%)
NaomiThe Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+40.16%)
Tacotron asrSpeech Recognition Using Tacotron
Stars: ✭ 165 (+35.25%)
KaldiioA pure python module for reading and writing kaldi ark files
Stars: ✭ 160 (+31.15%)
LibFewShotLibFewShot: A Comprehensive Library for Few-shot Learning.
Stars: ✭ 629 (+415.57%)
Rnn ctcRecurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+80.33%)
Rnnt Speech RecognitionEnd-to-end speech recognition using RNN Transducers in Tensorflow 2.0
Stars: ✭ 158 (+29.51%)
Dictate.jsA small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+59.84%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-22.95%)
KospeechOpen-Source Toolkit for End-to-End Korean Automatic Speech Recognition.
Stars: ✭ 190 (+55.74%)
SCL📄 Spatial Contrastive Learning for Few-Shot Classification (ECML/PKDD 2021).
Stars: ✭ 42 (-65.57%)
Voice Overlay Android🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+54.92%)
few-shot-segmentationPyTorch implementation of 'Squeeze and Excite' Guided Few Shot Segmentation of Volumetric Scans
Stars: ✭ 78 (-36.07%)
anycontrolVoice control for your websites and applications
Stars: ✭ 53 (-56.56%)
Deepspeech ServerA testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+44.26%)
Cn2an📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
Stars: ✭ 249 (+104.1%)
Kaldi OnnxKaldi model converter to ONNX
Stars: ✭ 174 (+42.62%)
CaptionThis"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories
Stars: ✭ 12 (-90.16%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+98.36%)
Pytorch Kaldipytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+1618.85%)
Hey JetsonDeep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+31.97%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+2920.49%)
good-speech-web-clientPractice your speech level in any language using speech recognition
Stars: ✭ 26 (-78.69%)
Py Kaldi AsrSome simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (+27.87%)
DragonflySpeech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
Stars: ✭ 209 (+71.31%)
ClovacallClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)
Stars: ✭ 151 (+23.77%)
Speech To Text RussianПроект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+23.77%)
megsA merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-82.79%)
EdgedictWorking online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+68.03%)
SwiftspeechA speech recognition framework designed for SwiftUI.
Stars: ✭ 149 (+22.13%)
Speech Recognition Neural NetworkThis is the end-to-end Speech Recognition neural network, deployed in Keras. This was my final project for Artificial Intelligence Nanodegree @Udacity.
Stars: ✭ 148 (+21.31%)
Kaldi Active GrammarPython Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+60.66%)