All Projects → speech-to-text → Similar Projects or Alternatives

172 Open source projects that are alternatives of or similar to speech-to-text

speech-to-text-code-pattern
React app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (+164.29%)
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (+50%)
Mutual labels:  speech-to-text, transcription
speechmatics-python
Python library and CLI for Speechmatics
Stars: ✭ 24 (+71.43%)
Mutual labels:  speech-to-text, transcription
scription
An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech
Stars: ✭ 46 (+228.57%)
Mutual labels:  speech-to-text, transcription
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+2428.57%)
Mutual labels:  speech-to-text, transcription
kaldi helpers
🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
Stars: ✭ 13 (-7.14%)
Mutual labels:  speech-to-text, transcription
simple diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
Stars: ✭ 26 (+85.71%)
Mutual labels:  speech-to-text, transcription
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (+150%)
Mutual labels:  speech-to-text
benchmarkstt
Open Source AI Benchmarking toolkit for benchmarking speech to text services
Stars: ✭ 43 (+207.14%)
Mutual labels:  speech-to-text
watson-multimedia-analyzer
WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode. A Node app that use Watson Visual Recognition, Speech to Text, Natural Language Understanding, and Tone Analyzer to enrich media files.
Stars: ✭ 23 (+64.29%)
Mutual labels:  watson-speech
Kerasdeepspeech
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
Stars: ✭ 245 (+1650%)
Mutual labels:  speech-to-text
wiki2ssml
Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.
Stars: ✭ 31 (+121.43%)
Mutual labels:  ibm-watson-speech
PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
Stars: ✭ 21 (+50%)
Mutual labels:  speech-to-text
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (+50%)
Mutual labels:  speech-to-text
Inimesed
An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+364.29%)
Mutual labels:  speech-to-text
Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+1707.14%)
Mutual labels:  speech-to-text
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (+50%)
Mutual labels:  speech-to-text
scripty
Speech to text bot for Discord using Mozilla's DeepSpeech
Stars: ✭ 14 (+0%)
Mutual labels:  speech-to-text
Go Astibob
Golang framework to build an AI that can understand and speak back to you, and everything else you want
Stars: ✭ 222 (+1485.71%)
Mutual labels:  speech-to-text
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (+257.14%)
Mutual labels:  speech-to-text
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (+114.29%)
Mutual labels:  speech-to-text
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+1364.29%)
Mutual labels:  speech-to-text
K6nele
An Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+1300%)
Mutual labels:  speech-to-text
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (+14.29%)
Mutual labels:  speech-to-text
Lingvo
Lingvo
Stars: ✭ 2,361 (+16764.29%)
Mutual labels:  speech-to-text
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+1271.43%)
Mutual labels:  speech-to-text
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+1364.29%)
Mutual labels:  speech-to-text
Watson-Unity-ARKit
# WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.
Stars: ✭ 24 (+71.43%)
Mutual labels:  watson-speech
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (+28.57%)
Mutual labels:  speech-to-text
parlatype
GNOME audio player for transcription
Stars: ✭ 151 (+978.57%)
Mutual labels:  transcription
Tensorflow Speech Recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+15028.57%)
Mutual labels:  speech-to-text
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (+278.57%)
Mutual labels:  speech-to-text
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (+78.57%)
Mutual labels:  speech-to-text
Chinese-automatic-speech-recognition
Chinese speech recognition
Stars: ✭ 147 (+950%)
Mutual labels:  speech-to-text
LastSecondSlides
Use the Google speech-to-text API to generate presentation slides as you talk!
Stars: ✭ 32 (+128.57%)
Mutual labels:  google-speech
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+535.71%)
Mutual labels:  speech-to-text
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+1628.57%)
Mutual labels:  speech-to-text
DeepSpeech-API
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (+121.43%)
Mutual labels:  speech-to-text
Stt
🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production
Stars: ✭ 197 (+1307.14%)
Mutual labels:  speech-to-text
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (+171.43%)
Mutual labels:  speech-to-text
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+1471.43%)
Mutual labels:  speech-to-text
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+5907.14%)
Mutual labels:  speech-to-text
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+1300%)
Mutual labels:  speech-to-text
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (+392.86%)
Mutual labels:  speech-to-text
Dictate.js
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+1292.86%)
Mutual labels:  speech-to-text
rnnt decoder cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+328.57%)
Mutual labels:  speech-to-text
Expressive tacotron
Tensorflow Implementation of Expressive Tacotron
Stars: ✭ 192 (+1271.43%)
Mutual labels:  speech-to-text
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (+278.57%)
Mutual labels:  speech-to-text
Voice Overlay Android
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+1250%)
Mutual labels:  speech-to-text
glaemscribe
Glaemscribe, the tolkienian languages/writings transcription engine.
Stars: ✭ 29 (+107.14%)
Mutual labels:  transcription
cloud-speech-and-vision-demos
A set of demo applications that make use of google speech, nlp and vision apis based in angular2
Stars: ✭ 35 (+150%)
Mutual labels:  google-speech
Speaker adapted tts
Making a TTS model with 1 minute of speech samples within 10 minutes
Stars: ✭ 183 (+1207.14%)
Mutual labels:  speech-to-text
Vosk
VOSK Speech Recognition Toolkit
Stars: ✭ 182 (+1200%)
Mutual labels:  speech-to-text
Deepspeech Server
A testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+1157.14%)
Mutual labels:  speech-to-text
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (+150%)
Mutual labels:  speech-to-text
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+1178.57%)
Mutual labels:  speech-to-text
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+1121.43%)
Mutual labels:  speech-to-text
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+1078.57%)
Mutual labels:  speech-to-text
asr24
24-hour Automatic Speech Recognition
Stars: ✭ 27 (+92.86%)
Mutual labels:  transcription
Unity live caption
Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!
Stars: ✭ 26 (+85.71%)
Mutual labels:  speech-to-text
1-60 of 172 similar projects