All Projects β†’ simple-obs-stt β†’ Similar Projects or Alternatives

964 Open source projects that are alternatives of or similar to simple-obs-stt

deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-7.87%)
open-speech-corpora
πŸ’Ž A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+844.94%)
Mutual labels:  tts, speech-recognition, speech-to-text, stt
opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
Stars: ✭ 21 (-76.4%)
Mutual labels:  speech, tts, speech-recognition, stt
Lingvo
Lingvo
Stars: ✭ 2,361 (+2552.81%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+38.2%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-22.47%)
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+130.34%)
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+1042.7%)
Mutual labels:  speech, tts, speech-to-text
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (-84.27%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+341.57%)
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-40.45%)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-60.67%)
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-76.4%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1269.66%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+101.12%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+450.56%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+43.82%)
revai-node-sdk
Node.js SDK for the Rev AI API
Stars: ✭ 21 (-76.4%)
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+85.39%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+12429.21%)
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+171.91%)
demo vietasr
Vietnamese Speech Recognition
Stars: ✭ 22 (-75.28%)
bingspeech-api-client
Microsoft Bing Speech API client in node.js
Stars: ✭ 32 (-64.04%)
Mutual labels:  tts, speech-to-text, stt
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-84.27%)
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+15.73%)
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-60.67%)
spokestack-android
Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!
Stars: ✭ 52 (-41.57%)
Mutual labels:  speech, tts, speech-recognition
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-35.96%)
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (-82.02%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-40.45%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+130.34%)
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (-60.67%)
Tensorflow Speech Recognition
πŸŽ™Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+2279.78%)
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (-71.91%)
Annyang
πŸ’¬ Speech recognition for your site
Stars: ✭ 6,216 (+6884.27%)
Sonus
πŸ’¬ /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+497.75%)
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+297.75%)
Watbot
An Android ChatBot powered by IBM Watson Services (Assistant V1, Text-to-Speech, and Speech-to-Text with Speaker Recognition) on IBM Cloud.
Stars: ✭ 64 (-28.09%)
Mutual labels:  speech, speech-to-text
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (+1077.53%)
Mutual labels:  speech, speech-to-text
Tts
Tools to convert text to speech πŸ“šπŸ’¬
Stars: ✭ 84 (-5.62%)
Mutual labels:  speech, tts
Audiomate
Python library for handling audio datasets.
Stars: ✭ 99 (+11.24%)
Mutual labels:  speech, speech-recognition
Gtts
Python library and CLI tool to interface with Google Translate's text-to-speech API
Stars: ✭ 1,303 (+1364.04%)
Mutual labels:  speech, tts
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-57.3%)
Python Speech recognition
A simple example for use speech recognition baidu api with python.
Stars: ✭ 106 (+19.1%)
Mutual labels:  speech, speech-recognition
Julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+1313.48%)
Mutual labels:  speech, speech-recognition
Delta
DELTA is a deep learning based natural language and speech processing platform.
Stars: ✭ 1,479 (+1561.8%)
Mutual labels:  speech, speech-recognition
Durian
Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.
Stars: ✭ 111 (+24.72%)
Mutual labels:  speech, tts
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (+39.33%)
Mutual labels:  speech, speech-recognition
Tts
Text-to-Speech for Arduino
Stars: ✭ 118 (+32.58%)
Mutual labels:  speech, tts
Voice activity detection
Voice Activity Detection based on Deep Learning & TensorFlow
Stars: ✭ 132 (+48.31%)
Mutual labels:  speech, speech-recognition
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (+51.69%)
Mutual labels:  speech, speech-recognition
Aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
Stars: ✭ 1,942 (+2082.02%)
Mutual labels:  speech, tts
Tts Papers
🐸 collection of TTS papers
Stars: ✭ 160 (+79.78%)
Mutual labels:  speech, tts
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-66.29%)
Holobot
HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (+28.09%)
Mutual labels:  speech, speech-recognition
Tacotron
A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model
Stars: ✭ 1,756 (+1873.03%)
Mutual labels:  speech, tts
Pytorch Kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Stars: ✭ 2,097 (+2256.18%)
Mutual labels:  speech, speech-recognition
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (-22.47%)
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+175.28%)
Mutual labels:  speech, speech-to-text
XION-ChaseCam
This is a free-to-use HTML/javascript based overlay for roleplay streamers. Basically it mimics the overlay of the AXON bodycam, but since most folks play in 3rd person, it's a ChaseCam. I've included a logo, and the html file. The html file has the css, html, and javascript all in one file for ease of editing. Goto line 81 of the html file to c…
Stars: ✭ 27 (-69.66%)
Mutual labels:  twitch, obs
1-60 of 964 similar projects