End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (-18.03%)

Mutual labels: speech-recognition, speech-to-text

Eesen

The official repository of the Eesen project

Stars: ✭ 738 (+1109.84%)

Mutual labels: speech-recognition, speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (-49.18%)

Mutual labels: speech-recognition, speech-to-text

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (+140.98%)

Mutual labels: speech-recognition, speech-to-text

PCPM

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

Stars: ✭ 21 (-65.57%)

Mutual labels: speech-recognition, speech-to-text

Deep-learning-And-Paper

【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等

Stars: ✭ 62 (+1.64%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-39.34%)

Mutual labels: speech-recognition, speech-to-text

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+1557.38%)

Mutual labels: speech-recognition, speech-to-text

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-55.74%)

Mutual labels: speech-recognition, speech-to-text

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (+354.1%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

Stars: ✭ 305 (+400%)

Mutual labels: speech-recognition, speech-to-text

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+8003.28%)

Mutual labels: speech-recognition, speech-to-text

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+544.26%)

Mutual labels: speech-recognition, speech-to-text

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-42.62%)

Mutual labels: speech-recognition, speech-to-text

Adapt

Adapt Intent Parser

Stars: ✭ 690 (+1031.15%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (-1.64%)

Mutual labels: speech-recognition, speech-to-text

Kur

Descriptive Deep Learning

Stars: ✭ 811 (+1229.51%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-70.49%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (+6.56%)

Mutual labels: speech-recognition, speech-to-text

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (-42.62%)

Mutual labels: speech-recognition, speech-to-text

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-77.05%)

Mutual labels: speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-77.05%)

Mutual labels: speech-recognition, speech-to-text

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (-57.38%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (-60.66%)

Mutual labels: speech-recognition, speech-to-text

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-65.57%)

Mutual labels: speech-recognition, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+34.43%)

Mutual labels: speech-recognition, speech-to-text

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (-14.75%)

Mutual labels: speech-recognition, speech-to-text

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (+70.49%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition

SDKs and docs for Skit's speech to text service

Stars: ✭ 20 (-67.21%)

Mutual labels: speech-recognition, speech-to-text

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-44.26%)

Mutual labels: speech-recognition, speech-to-text

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-13.11%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-59.02%)

Mutual labels: speech-recognition, speech-to-text

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-63.93%)

Mutual labels: speech-recognition, speech-to-text

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (+14.75%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+30522.95%)

Mutual labels: speech-recognition, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+101.64%)

Mutual labels: speech-recognition, speech-to-text

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (+565.57%)

Mutual labels: speech-recognition, speech-to-text

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (+555.74%)

Mutual labels: speech-recognition, speech-to-text

Speech Demo

语音api示例

Stars: ✭ 454 (+644.26%)

Mutual labels: speech-recognition, speech-to-text

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-68.85%)

Mutual labels: speech-recognition, speech-to-text

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+9734.43%)

Mutual labels: speech-recognition, speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+772.13%)

Mutual labels: speech-recognition, speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+10090.16%)

Mutual labels: speech-recognition, speech-to-text

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (+755.74%)

Mutual labels: speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-65.57%)

Mutual labels: speech-recognition, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+45.9%)

Mutual labels: speech-recognition, speech-to-text

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+13932.79%)

Mutual labels: speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+703.28%)

Mutual labels: speech-recognition, speech-to-text

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (+1165.57%)

Mutual labels: speech-recognition, speech-to-text

Audio Pretrained Model

A collection of Audio and Speech pre-trained models.

Stars: ✭ 61 (+0%)

Mutual labels: speech-recognition, speech-to-text

1-60 of 846 similar projects

›

next*5