All Projects → Speech_recognition → Similar Projects or Alternatives

984 Open source projects that are alternatives of or similar to Speech_recognition

Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (-98.98%)
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-98.63%)
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (-91.13%)
Deep-learning-And-Paper
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等
Stars: ✭ 62 (-98.97%)
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-98.52%)
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (-99%)
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (-91.3%)
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-99.57%)
demo vietasr
Vietnamese Speech Recognition
Stars: ✭ 22 (-99.63%)
htk
HTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-99.77%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (-91.83%)
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-99.5%)
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-99.37%)
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-99.7%)
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (-99.58%)
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-99.77%)
Chinese-automatic-speech-recognition
Chinese speech recognition
Stars: ✭ 147 (-97.55%)
deep avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (-98.27%)
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-99.48%)
speech-recognition
SDKs and docs for Skit's speech to text service
Stars: ✭ 20 (-99.67%)
speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-98.98%)
kim-voice-assistant
Kim,你的私人语音助理。
Stars: ✭ 70 (-98.83%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+42.69%)
Tensorflowasr
⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (-93.33%)
Free Spoken Digit Dataset
A free audio dataset of spoken digits. Think MNIST for audio.
Stars: ✭ 396 (-93.4%)
Mutual labels:  speech-recognition, audio
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (-91.98%)
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (-98.85%)
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (-99.73%)
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-99.65%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-99.12%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-99.42%)
speechmatics-python
Python library and CLI for Speechmatics
Stars: ✭ 24 (-99.6%)
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-99.17%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-97.02%)
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-99.77%)
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-99.65%)
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (-85.98%)
Inimesed
An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-98.92%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-99.55%)
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-99.65%)
revai-node-sdk
Node.js SDK for the Rev AI API
Stars: ✭ 21 (-99.65%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (-96.58%)
voce-browser
Voice Controlled Chromium Web Browser
Stars: ✭ 34 (-99.43%)
deepspeech
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-99.25%)
SpeechToText
Speech To Text in Android
Stars: ✭ 53 (-99.12%)
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-99.42%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-97.95%)
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-99.68%)
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (-95.38%)
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-99.13%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (-93.45%)
Cheetah
On-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (-93.62%)
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (-93.23%)
Deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+211.39%)
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (-92.67%)
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (-99.42%)
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (-94.1%)
speech-to-text-code-pattern
React app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-99.38%)
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (-94.92%)
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (-17.6%)
1-60 of 984 similar projects