All Projects → Automatic Speech Recognition → Similar Projects or Alternatives

986 Open source projects that are alternatives of or similar to Automatic Speech Recognition

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-89.06%)

Mutual labels: speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-72.4%)

Mutual labels: speech-recognition, speech-to-text

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-81.77%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (-87.5%)

Mutual labels: speech-recognition, speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-73.96%)

Mutual labels: speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-6.77%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-92.71%)

Mutual labels: speech-recognition, speech-to-text

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+338.02%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (-66.15%)

Mutual labels: speech-recognition, speech-to-text

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-85.94%)

Mutual labels: speech-recognition, speech-to-text

kaldi-long-audio-alignment

Long audio alignment using Kaldi

Stars: ✭ 21 (-89.06%)

Mutual labels: speech-recognition, speech-to-text

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-89.06%)

Mutual labels: speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+6.77%)

Mutual labels: speech-recognition, speech-to-text

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-82.29%)

Mutual labels: speech-recognition, speech-to-text

deepspeech

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Stars: ✭ 45 (-76.56%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

Stars: ✭ 61 (-68.23%)

Mutual labels: speech-recognition, speech-to-text

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-81.77%)

Mutual labels: speech-recognition, speech-to-text

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+4358.33%)

Mutual labels: speech-recognition, speech-to-text

htk

HTK Toolkit with Linux 64 bit and Docker support

Stars: ✭ 14 (-92.71%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (-21.35%)

Mutual labels: speech-recognition, speech-to-text

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-72.92%)

Mutual labels: speech-recognition, speech-to-text

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (-45.83%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (+58.85%)

Mutual labels: speech-recognition, speech-to-text

Cppflow

Run TensorFlow models in C++ without installation and without Bazel

Stars: ✭ 357 (+85.94%)

Mutual labels: neural-networks, tensorflow-models

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (+44.27%)

Mutual labels: speech-recognition, speech-to-text

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (+107.29%)

Mutual labels: speech-recognition, language-model

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+104.69%)

Mutual labels: speech-recognition, speech-to-text

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (+111.46%)

Mutual labels: speech-recognition, speech-to-text

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+84.38%)

Mutual labels: speech-recognition, speech-to-text

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+171.88%)

Mutual labels: speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+155.21%)

Mutual labels: speech-recognition, speech-to-text

Awesome Bert Nlp

A curated list of NLP resources focused on BERT, attention mechanism, Transformer networks, and transfer learning.

Stars: ✭ 567 (+195.31%)

Mutual labels: neural-networks, language-model

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (+150.52%)

Mutual labels: speech-recognition, speech-to-text

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+284.38%)

Mutual labels: speech-recognition, speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+3137.5%)

Mutual labels: speech-recognition, speech-to-text

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+297.92%)

Mutual labels: neural-networks, speech-recognition

Speech Demo

语音api示例

Stars: ✭ 454 (+136.46%)

Mutual labels: speech-recognition, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-14.06%)

Mutual labels: speech-recognition, speech-to-text

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+426.56%)

Mutual labels: speech-recognition, speech-to-text

Transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Stars: ✭ 55,742 (+28932.29%)

Mutual labels: language-model, speech-recognition

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (-44.79%)

Mutual labels: speech-recognition, speech-to-text

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+685.94%)

Mutual labels: speech-recognition, speech-to-text

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-81.77%)

Mutual labels: speech-recognition, speech-to-text

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-64.06%)

Mutual labels: speech-recognition, speech-to-text

Dragonfire

the open-source virtual assistant for Ubuntu based Linux distributions

Stars: ✭ 1,120 (+483.33%)

Mutual labels: speech-recognition, speech-to-text

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI