End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-65.99%)

Mutual labels: speech-recognition, speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+72.11%)

Mutual labels: speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (-76.19%)

Mutual labels: speech-recognition, speech-to-text

Dla

Deep learning for audio processing

Stars: ✭ 142 (-3.4%)

Mutual labels: signal-processing, speech-recognition

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (-53.06%)

Mutual labels: speech-recognition, speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (-89.12%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (-74.15%)

Mutual labels: speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+7485.71%)

Mutual labels: speech-recognition, speech-to-text

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+926.53%)

Mutual labels: speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-12.93%)

Mutual labels: speech-recognition, speech-to-text

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (-27.89%)

Mutual labels: speech-recognition, speech-to-text

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (-2.04%)

Mutual labels: speech-recognition, speech-to-text

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (-6.8%)

Mutual labels: speech-recognition, speech-to-text

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+9.52%)

Mutual labels: speech-recognition, speech-to-text

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (-29.25%)

Mutual labels: speech-recognition, speech-to-text

Vosk

VOSK Speech Recognition Toolkit

Stars: ✭ 182 (+23.81%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech Server

A testing server for a speech to text service based on mozilla deepspeech

Stars: ✭ 176 (+19.73%)

Mutual labels: speech-recognition, speech-to-text

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+28.57%)

Mutual labels: speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (+16.33%)

Mutual labels: speech-recognition, speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+32.65%)

Mutual labels: speech-recognition, speech-to-text

Lingvo

Stars: ✭ 2,361 (+1506.12%)

Mutual labels: speech-recognition, speech-to-text

End-to-End-Mandarin-ASR

End-to-end speech recognition on AISHELL dataset.

Stars: ✭ 20 (-86.39%)

Mutual labels: speech-recognition, chinese-speech-recognition

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-29.93%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-82.99%)

Mutual labels: speech-recognition, speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+64.63%)

Mutual labels: speech-recognition, speech-to-text

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+1771.43%)

Mutual labels: speech-recognition, chinese-speech-recognition

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+2406.8%)

Mutual labels: speech-recognition, speech-to-text

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-63.95%)

Mutual labels: speech-recognition, speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-85.71%)

Mutual labels: speech-recognition, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (-39.46%)

Mutual labels: speech-recognition, speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+49.66%)

Mutual labels: speech-recognition, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-63.95%)

Mutual labels: speech-recognition, speech-to-text

torchsubband

Pytorch implementation of subband decomposition

Stars: ✭ 63 (-57.14%)

Mutual labels: signal-processing, speech-recognition

octopus

On-device speech-to-index engine powered by deep learning.