All Projects → speech_to_text → Similar Projects or Alternatives

1505 Open source projects that are alternatives of or similar to speech_to_text

anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (+51.43%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+51.43%)
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+485.71%)
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+154.29%)
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+371.43%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+265.71%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+485.71%)
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+591.43%)
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-60%)
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-48.57%)
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (+48.57%)
Mutual labels:  speech, speech-recognition, speech-api
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+1300%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (+97.14%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+6645.71%)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (+0%)
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (+134.29%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+31760%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+411.43%)
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+17660%)
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-40%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+1022.86%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-22.86%)
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+1420%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+251.43%)
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (+62.86%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+3382.86%)
Voice activity detection
Voice Activity Detection based on Deep Learning & TensorFlow
Stars: ✭ 132 (+277.14%)
Mutual labels:  speech, speech-recognition
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+285.71%)
Mutual labels:  speech, speech-recognition
UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
Stars: ✭ 224 (+540%)
Mutual labels:  speech, speech-recognition
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+5891.43%)
Mutual labels:  speech, speech-recognition
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (+48.57%)
Speechtotext Websockets Javascript
SDK & Sample to do speech recognition using websockets in Javascript
Stars: ✭ 191 (+445.71%)
Mutual labels:  speech, speech-recognition
End2end Asr Pytorch
End-to-End Automatic Speech Recognition on PyTorch
Stars: ✭ 175 (+400%)
Mutual labels:  speech, speech-recognition
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+600%)
Mutual labels:  speech, speech-to-text
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+5448.57%)
Mutual labels:  ffmpeg, speech
idear
🎙️ Handsfree Audio Development Interface
Stars: ✭ 84 (+140%)
Mutual labels:  speech, speech-recognition
Thumbnail
Thumbnail for a given video using FFMpeg
Stars: ✭ 96 (+174.29%)
Mutual labels:  composer, ffmpeg
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-40%)
Airflow Toolkit
Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]
Stars: ✭ 51 (+45.71%)
Mutual labels:  composer, google-cloud
TF-Speech-Recognition-Challenge-Solution
Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.
Stars: ✭ 58 (+65.71%)
Mutual labels:  speech, speech-recognition
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (+0%)
speech-to-text-code-pattern
React app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (+5.71%)
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (+254.29%)
Mutual labels:  speech, speech-recognition
Php Ffmpeg Video Streaming
📼 Package media content for online streaming(DASH and HLS) using FFmpeg
Stars: ✭ 246 (+602.86%)
Mutual labels:  ffmpeg, google-cloud
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+911.43%)
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (-54.29%)
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (+97.14%)
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (-28.57%)
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+8.57%)
speechmatics-python
Python library and CLI for Speechmatics
Stars: ✭ 24 (-31.43%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+0%)
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-11.43%)
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+42.86%)
Inimesed
An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+85.71%)
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-14.29%)
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+71.43%)
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (-40%)
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-60%)
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-40%)
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (-25.71%)
1-60 of 1505 similar projects