End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-51.92%)

Mutual labels: speech-recognition, speech-to-text

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (+41.35%)

Mutual labels: speech-recognition, speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-86.54%)

Mutual labels: speech-recognition, speech-to-text

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+88.46%)

Mutual labels: speech-recognition, speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+111.54%)

Mutual labels: speech-recognition, speech-to-text

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-74.04%)

Mutual labels: speech-recognition, speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (-70.19%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Russian

Проект для распознавания речи на русском языке на основе pykaldi.

Stars: ✭ 151 (+45.19%)

Mutual labels: speech-recognition, speech-to-text

2018-dlsl

UPC Deep Learning for Speech and Language 2018

Stars: ✭ 18 (-82.69%)

Mutual labels: speech-recognition, automatic-speech-recognition

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (+58.65%)

Mutual labels: speech-recognition, speech-to-text

Lingvo

Stars: ✭ 2,361 (+2170.19%)

Mutual labels: speech-recognition, speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+87.5%)

Mutual labels: speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+97.12%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+1936.54%)

Mutual labels: speech-recognition, speech-to-text

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-49.04%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (-76.92%)

Mutual labels: speech-recognition, speech-to-text

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (-48.08%)

Mutual labels: speech-recognition, automatic-speech-recognition

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (-14.42%)

Mutual labels: speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (-66.35%)

Mutual labels: speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+72.12%)

Mutual labels: speech-recognition, speech-to-text

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+708.65%)

Mutual labels: speech-recognition, speech-to-text

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (+40.38%)

Mutual labels: speech-recognition, speech-to-text

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (+38.46%)

Mutual labels: speech-recognition, speech-to-text

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+54.81%)

Mutual labels: speech-recognition, speech-to-text

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (+31.73%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

Stars: ✭ 176 (+69.23%)

Mutual labels: speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (+64.42%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (-42.31%)

Mutual labels: speech-recognition, speech-to-text

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (+27.88%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-82.69%)

Mutual labels: speech-recognition, speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+84.62%)

Mutual labels: speech-recognition, speech-to-text

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-75%)

Mutual labels: speech-recognition, speech-to-text

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+81.73%)

Mutual labels: speech-recognition, speech-to-text

hf-experiments

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (-64.42%)

Mutual labels: speech-recognition, automatic-speech-recognition

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+88.46%)

Mutual labels: speech-recognition, speech-to-text

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-66.35%)

Mutual labels: speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+23.08%)

Mutual labels: speech-recognition, speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-79.81%)

Mutual labels: speech-recognition, speech-to-text

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+2545.19%)

Mutual labels: speech-recognition, automatic-speech-recognition

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-75.96%)

Mutual labels: speech-recognition, speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+143.27%)

Mutual labels: speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+97.12%)

Mutual labels: speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-79.81%)

Mutual labels: speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-49.04%)

Mutual labels: speech-recognition, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+132.69%)

Mutual labels: speech-recognition, speech-to-text

kaldi helpers

🙊 A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.

Stars: ✭ 13 (-87.5%)

Mutual labels: automatic-speech-recognition, speech-to-text

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Stars: ✭ 2,384 (+2192.31%)

Mutual labels: speech-recognition, automatic-speech-recognition

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (-33.65%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-63.46%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-86.54%)

Mutual labels: speech-recognition, speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-71.15%)

Mutual labels: speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.