Java Speech ApiThe J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-91.7%)
Recording-BotA bot built to record and transcribe audio fragments from Discord.
Stars: ✭ 22 (-99.63%)
CheetahOn-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-93.52%)
Athenaan open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (-90.82%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-99.1%)
SubsyncSubtitle Speech Synchronizer
Stars: ✭ 379 (-93.58%)
ocaml-otrOff-the-record (OTR) messaging protocol, purely in OCaml
Stars: ✭ 39 (-99.34%)
htkHTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-99.76%)
speech-to-text-code-patternReact app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-99.37%)
Libreasr💬 An On-Premises, Streaming Speech Recognition System
Stars: ✭ 633 (-89.28%)
spokestack-androidExtensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-99.12%)
Libfaceidlibfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.
Stars: ✭ 354 (-94.01%)
RhasspyOffline private voice assistant for many human languages
Stars: ✭ 458 (-92.25%)
RhinoOn-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-93.13%)
soxanWav2Vec for speech recognition, classification, and audio classification
Stars: ✭ 113 (-98.09%)
SOLQ"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.
Stars: ✭ 159 (-97.31%)
DeepspeechDeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+216.23%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-90.99%)
CCAligner🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
Stars: ✭ 131 (-97.78%)
voce-browserVoice Controlled Chromium Web Browser
Stars: ✭ 34 (-99.42%)
Alan Sdk FlutterAlan AI Flutter SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 309 (-94.77%)
Speech recognitionSpeech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+1.56%)
Speech recognitionA Flutter plugin to use speech recognition on iOS & Android (Swift/Java)
Stars: ✭ 302 (-94.89%)
lightning-asrModular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Stars: ✭ 36 (-99.39%)
Voice Overlay Ios🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-92.55%)
quran-alignWord-accurate timestamps for Qur'anic audio.
Stars: ✭ 139 (-97.65%)
Alan Sdk IonicAlan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.
Stars: ✭ 287 (-95.14%)
speech to texthow to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-99.41%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-91.16%)
Alan Sdk AndroidAlan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.
Stars: ✭ 278 (-95.29%)
speech-transformerTransformer implementation speciaized in speech recognition tasks using Pytorch.
Stars: ✭ 40 (-99.32%)
SpecaugmentA Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
Stars: ✭ 408 (-93.09%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (-98.44%)
Neuraldialog CvaeTensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 279 (-95.28%)
WenetProduction First and Production Ready End-to-End Speech Recognition Toolkit
Stars: ✭ 617 (-89.55%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-99.54%)
Vosk Android DemoOffline speech recognition for Android with Vosk library.
Stars: ✭ 271 (-95.41%)
Neural spEnd-to-end ASR/LM implementation with PyTorch
Stars: ✭ 408 (-93.09%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (-95.45%)
ASVspoof PANo description or website provided.
Stars: ✭ 22 (-99.63%)
A chronology of deep learningTracing back and exposing in chronological order the main ideas in the field of deep learning, to help everyone better understand the current intense research in AI.
Stars: ✭ 47 (-99.2%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (-50.33%)
postchildren-desktop👨👦👦 A E2E test visualization tool (get along with postman and postwoman)
Stars: ✭ 23 (-99.61%)
InspectitinspectIT is the leading Open Source APM (Application Performance Management) tool for analyzing your Java (EE) applications.
Stars: ✭ 513 (-91.32%)
Tensorflowasr⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-93.23%)
HotVoiceAdds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-99.31%)
deepspeech.mxnetA MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-98.61%)