The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (-91.7%)

Mutual labels: speech-recognition

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-93.52%)

Mutual labels: speech-recognition

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

Stars: ✭ 542 (-90.82%)

Mutual labels: speech-recognition

Subsync

Subtitle Speech Synchronizer

Stars: ✭ 379 (-93.58%)

Mutual labels: speech-recognition

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (-91.86%)

Mutual labels: speech-recognition

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

Stars: ✭ 633 (-89.28%)

Mutual labels: speech-recognition

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

Stars: ✭ 354 (-94.01%)

Mutual labels: speech-recognition

Rhasspy

Offline private voice assistant for many human languages

Stars: ✭ 458 (-92.25%)

Mutual labels: speech-recognition

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+216.23%)

Mutual labels: speech-recognition

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-90.99%)

Mutual labels: speech-recognition

Speech Demo

语音api示例

Stars: ✭ 454 (-92.31%)

Mutual labels: speech-recognition

Textspotter

Stars: ✭ 323 (-94.53%)

Mutual labels: end-to-end

Alan Sdk Ios

Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.

Stars: ✭ 318 (-94.62%)

Mutual labels: speech-recognition

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+1.56%)

Mutual labels: speech-recognition

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (-89.47%)

Mutual labels: speech-recognition

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (-91.04%)

Mutual labels: speech-recognition

Uspeech

Speech recognition toolkit for the arduino

Stars: ✭ 448 (-92.42%)

Mutual labels: speech-recognition

Alan Sdk Flutter

Alan AI Flutter SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 309 (-94.77%)

Mutual labels: speech-recognition

Speech recognition

A Flutter plugin to use speech recognition on iOS & Android (Swift/Java)

Stars: ✭ 302 (-94.89%)

Mutual labels: speech-recognition

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-92.55%)

Mutual labels: speech-recognition

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-94.96%)

Mutual labels: speech-recognition

Alan Sdk Ionic

Alan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.

Stars: ✭ 287 (-95.14%)

Mutual labels: speech-recognition

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-91.16%)

Mutual labels: speech-recognition

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (-16.32%)

Mutual labels: speech-recognition

Wire Ios

📱 Wire for iOS (iPhone and iPad)

Stars: ✭ 3,079 (-47.88%)

Mutual labels: end-to-end

Alan Sdk Android

Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.

Stars: ✭ 278 (-95.29%)

Mutual labels: speech-recognition

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-93.09%)

Mutual labels: speech-recognition

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-95.31%)

Mutual labels: speech-recognition

Neuraldialog Cvae

Tensorflow Implementation of Knowledge-Guided CVAE for dialog generation ACL 2017. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU

Stars: ✭ 279 (-95.28%)

Mutual labels: end-to-end

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 617 (-89.55%)

Mutual labels: speech-recognition

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (-91.26%)

Mutual labels: end-to-end

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (-93.13%)

Mutual labels: speech-recognition

1-60 of 367 similar projects

›

next*5