All Projects → Automatic Speech Recognition → Similar Projects or Alternatives

986 Open source projects that are alternatives of or similar to Automatic Speech Recognition

KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-89.06%)
react-native-spokestack
Spokestack: give your React Native app a voice interface!
Stars: ✭ 53 (-72.4%)
web-speech-cognitive-services
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-81.77%)
speechmatics-python
Python library and CLI for Speechmatics
Stars: ✭ 24 (-87.5%)
AmazonSpeechTranslator
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Stars: ✭ 50 (-73.96%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (-6.77%)
kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-92.71%)
open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
Stars: ✭ 841 (+338.02%)
Inimesed
An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (-66.15%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-85.94%)
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-89.06%)
revai-node-sdk
Node.js SDK for the Rev AI API
Stars: ✭ 21 (-89.06%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+6.77%)
voce-browser
Voice Controlled Chromium Web Browser
Stars: ✭ 34 (-82.29%)
deepspeech
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-76.56%)
speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-68.23%)
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-81.77%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+4358.33%)
htk
HTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-92.71%)
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-21.35%)
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-72.92%)
Wav2letter.pytorch
A fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (-45.83%)
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (+58.85%)
Cppflow
Run TensorFlow models in C++ without installation and without Bazel
Stars: ✭ 357 (+85.94%)
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (+44.27%)
Ctcwordbeamsearch
Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.
Stars: ✭ 398 (+107.29%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+104.69%)
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+111.46%)
leopard
On-device speech-to-text engine powered by deep learning
Stars: ✭ 354 (+84.38%)
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+171.88%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+155.21%)
Awesome Bert Nlp
A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.
Stars: ✭ 567 (+195.31%)
Mutual labels:  neural-networks, language-model
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+150.52%)
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+284.38%)
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+3137.5%)
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+297.92%)
Speech Demo
语音api示例
Stars: ✭ 454 (+136.46%)
Tacotron asr
Speech Recognition Using Tacotron
Stars: ✭ 165 (-14.06%)
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+426.56%)
Transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+28932.29%)
Self Supervised Speech Recognition
speech to text with self-supervised learning based on wav2vec 2.0 framework
Stars: ✭ 106 (-44.79%)
Kalliope
Kalliope is a framework that will help you to create your own personal assistant.
Stars: ✭ 1,509 (+685.94%)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-81.77%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-64.06%)
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+483.33%)
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+129.17%)
B.e.n.j.i.
B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-56.77%)
Deepspeech Websocket Server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-58.85%)
Factorized Tdnn
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi
Stars: ✭ 98 (-48.96%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+534.9%)
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-46.87%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-33.33%)
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (-33.85%)
Persephone
A tool for automatic phoneme transcription
Stars: ✭ 130 (-32.29%)
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-28.65%)
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-72.4%)
revai-python-sdk
Rev AI Python SDK
Stars: ✭ 35 (-81.77%)
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+2474.48%)
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Stars: ✭ 72 (-62.5%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+5707.81%)
61-120 of 986 similar projects