speechreca simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-97.33%)
open-speech-corpora💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+24.96%)
SincnetSincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+13.52%)
Speechbrain.github.ioThe SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (-64.04%)
Keras SincnetKeras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-93.02%)
UspeechSpeech recognition toolkit for the arduino
Stars: ✭ 448 (-33.43%)
UniSpeechUniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (-66.72%)
torchsubbandPytorch implementation of subband decomposition
Stars: ✭ 63 (-90.64%)
NonautoreggenprogressTracking the progress in non-autoregressive generation (translation, transcription, etc.)
Stars: ✭ 118 (-82.47%)
awesome-keyword-spottingThis repository is a curated list of awesome Speech Keyword Spotting (Wake-Up Word Detection).
Stars: ✭ 150 (-77.71%)
Speech-BackbonesThis is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stars: ✭ 205 (-69.54%)
QuantumSpeech-QCNNIEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition
Stars: ✭ 71 (-89.45%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-93.61%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-78.31%)
PnccA implementation of Power Normalized Cepstral Coefficients: PNCC
Stars: ✭ 40 (-94.06%)
UHV-OTS-SpeechA data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
Stars: ✭ 94 (-86.03%)
spokestack-iosSpokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-95.99%)
scim[wip]Speech recognition tool-box written by Nim. Based on Arraymancer.
Stars: ✭ 17 (-97.47%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-47.4%)
PaseProblem Agnostic Speech Encoder
Stars: ✭ 348 (-48.29%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+2675.63%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (-21.4%)
Asrt speechrecognitionA Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+634.47%)
Alan Sdk IosAlan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.
Stars: ✭ 318 (-52.75%)
EspnetEnd-to-End Speech Processing Toolkit
Stars: ✭ 4,533 (+573.55%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-20.95%)
Alan Sdk FlutterAlan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-54.09%)
BrevitasBrevitas: quantization-aware training in PyTorch
Stars: ✭ 343 (-49.03%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-34.62%)
J.a.r.v.i.spython powered Intelligent System
Stars: ✭ 325 (-51.71%)
VadVoice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (-7.58%)
SurfboardNovoic's audio feature extraction library
Stars: ✭ 318 (-52.75%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-39.38%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-22.44%)
NnmnkwiiLibrary to build speech synthesis systems designed for easy and fast prototyping.
Stars: ✭ 308 (-54.23%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-39.67%)
Speech recognitionA Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (-55.13%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-39.38%)
Pocketsphinx PythonPython interface to CMU Sphinxbase and Pocketsphinx libraries
Stars: ✭ 298 (-55.72%)
PysptkA python wrapper for Speech Signal Processing Toolkit (SPTK).
Stars: ✭ 297 (-55.87%)
Speech Emotion AnalyzerThe neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (-5.94%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (-8.32%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-40.56%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (-57.36%)
Alan Sdk AndroidAlan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-58.69%)
CtcwordbeamsearchConnectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (-40.86%)
Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (-58.84%)
Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-27.19%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-59.73%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-60.03%)
Awesome KaldiThis is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-41.6%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+335.96%)