WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode. A Node app that use Watson Visual Recognition, Speech to Text, Natural Language Understanding, and Tone Analyzer to enrich media files.

Stars: ✭ 23 (+64.29%)

Mutual labels: watson-speech

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+1650%)

Mutual labels: speech-to-text

wiki2ssml

Wiki2SSML provides the WikiVoice markup language used for fine-tuning synthesised voice.

Stars: ✭ 31 (+121.43%)

Mutual labels: ibm-watson-speech

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (+50%)

Mutual labels: speech-to-text

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (+50%)

Mutual labels: speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (+364.29%)

Mutual labels: speech-to-text

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+1707.14%)

Mutual labels: speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (+50%)

Mutual labels: speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (+0%)

Mutual labels: speech-to-text

Go Astibob

Golang framework to build an AI that can understand and speak back to you, and everything else you want

Stars: ✭ 222 (+1485.71%)

Mutual labels: speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+257.14%)

Mutual labels: speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (+114.29%)

Mutual labels: speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+1364.29%)

Mutual labels: speech-to-text

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+1300%)

Mutual labels: speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (+14.29%)

Mutual labels: speech-to-text

Lingvo

Stars: ✭ 2,361 (+16764.29%)

Mutual labels: speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+1271.43%)

Mutual labels: speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+1364.29%)

Mutual labels: speech-to-text

Watson-Unity-ARKit

# WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.

Stars: ✭ 24 (+71.43%)

Mutual labels: watson-speech

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (+28.57%)

Mutual labels: speech-to-text

parlatype

GNOME audio player for transcription

Stars: ✭ 151 (+978.57%)

Mutual labels: transcription

Tensorflow Speech Recognition

🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks

Stars: ✭ 2,118 (+15028.57%)

Mutual labels: speech-to-text

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+278.57%)

Mutual labels: speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (+78.57%)

Mutual labels: speech-to-text

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (+950%)

Mutual labels: speech-to-text

LastSecondSlides

Use the Google speech-to-text API to generate presentation slides as you talk!

Stars: ✭ 32 (+128.57%)

Mutual labels: google-speech

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+535.71%)

Mutual labels: speech-to-text

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+1628.57%)

Mutual labels: speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (+121.43%)

Mutual labels: speech-to-text

Stt

🐸STT - a deep learning toolkit for Speech-to-Text, battle-tested in research and production

Stars: ✭ 197 (+1307.14%)

Mutual labels: speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (+171.43%)

Mutual labels: speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+1471.43%)

Mutual labels: speech-to-text

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+5907.14%)

Mutual labels: speech-to-text

Kaldi Active Grammar

Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time

Stars: ✭ 196 (+1300%)

Mutual labels: speech-to-text

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (+392.86%)

Mutual labels: speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+1292.86%)

Mutual labels: speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (+328.57%)

Mutual labels: speech-to-text

Expressive tacotron

Tensorflow Implementation of Expressive Tacotron

Stars: ✭ 192 (+1271.43%)

Mutual labels: speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!