All Projects → rnnt_decoder_cuda → Similar Projects or Alternatives

414 Open source projects that are alternatives of or similar to rnnt_decoder_cuda

Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (+408.33%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-41.67%)
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (+48.33%)
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+490%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-11.67%)
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+2161.67%)
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (+111.67%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (+113.33%)
Voice Overlay Android
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 189 (+215%)
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-11.67%)
megs
A merged version of multiple open-source German speech datasets.
Stars: ✭ 21 (-65%)
revai-java-sdk
Rev.ai Java SDK
Stars: ✭ 16 (-73.33%)
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Stars: ✭ 72 (+20%)
Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (+61.67%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1931.67%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+198.33%)
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+2415%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+241.67%)
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (+70%)
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (-41.67%)
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (+175%)
Tensorflow Speech Recognition
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
Stars: ✭ 2,118 (+3430%)
Naomi
The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!
Stars: ✭ 171 (+185%)
Rnn ctc
Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.
Stars: ✭ 220 (+266.67%)
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+241.67%)
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+303.33%)
web-voice-processor
A library for real-time voice processing in web browsers
Stars: ✭ 69 (+15%)
speech-recognition-evaluation
Evaluate results from ASR/Speech-to-Text quickly
Stars: ✭ 25 (-58.33%)
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (+30%)
Patter
speech-to-text in pytorch
Stars: ✭ 71 (+18.33%)
B.e.n.j.i.
B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (+38.33%)
Deepspeech Websocket Server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (+31.67%)
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+2196.67%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (+15%)
Self Supervised Speech Recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Stars: ✭ 106 (+76.67%)
Wav2letter.pytorch
A fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (+73.33%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+18485%)
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (+71.67%)
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (+128.33%)
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (+121.67%)
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-65%)
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+1766.67%)
React.ai
It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux
Stars: ✭ 38 (-36.67%)
Hey Jetson
Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.
Stars: ✭ 161 (+168.33%)
octopus
On-device speech-to-index engine powered by deep learning.
Stars: ✭ 30 (-50%)
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (+151.67%)
Vosk
VOSK Speech Recognition Toolkit
Stars: ✭ 182 (+203.33%)
Deepspeech Server
A testing server for a speech to text service based on mozilla deepspeech
Stars: ✭ 176 (+193.33%)
Automatic Speech Recognition
🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)
Stars: ✭ 192 (+220%)
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+143.33%)
Kaldi Active Grammar
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
Stars: ✭ 196 (+226.67%)
K6nele
An Android app that offers speech-to-text services and user interfaces to other apps
Stars: ✭ 196 (+226.67%)
Nemo
NeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+6041.67%)
Dictate.js
A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.
Stars: ✭ 195 (+225%)
speechrec
a simple speech recognition app using the Web Speech API Interfaces
Stars: ✭ 18 (-70%)
Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (+1.67%)
Angle
⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (+1.67%)
Speechrecognizerbutton
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (+140%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+3835%)
Speech recognition with tensorflow
Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.
Stars: ✭ 253 (+321.67%)
1-60 of 414 similar projects