All Categories → Machine Learning → speech-to-text

Top 151 speech-to-text open source projects

kaldi-long-audio-alignment

Long audio alignment using Kaldi

✭ 21

shell python speech-recognition automatic-speech-recognition speech-to-text kaldi transcription asr speechrecognition split-audio longaudio-alignment audio-segments speech-transcription

Python helper for Google and IBM Watson speech-to-text cloud APIs.

✭ 14

python ibm-watson-speech google-speech speech-to-text watson-speech-sdk transcription dictation watson-speech

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

✭ 26

python C#stream unity speech-recognition google-api speech-to-text vtuber youtuber automatic-caption live-caption

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

✭ 841

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

📢 Complete V bindings for Mozilla's DeepSpeech TensorFlow based Speech-to-Text library. 📜

✭ 38

V AMPL machine-learning tensorflow mozilla speech-to-text v deepspeech

Chinese-automatic-speech-recognition

Chinese speech recognition

✭ 147

Jupyter Notebook python machine-learning deep-learning signal-processing speech-recognition chinese-nlp speech-to-text chinese-speech-recognition chinese-speech-to-text

Speech to text bot for Discord using Mozilla's DeepSpeech

✭ 14

rust discord discord-bot speech-recognition speech-to-text stt

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

✭ 14

shell python training custom personal speech speech-recognition speech-to-text kaldi fine-tuning kaldi-asr

Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.

✭ 21

sentiment speech-recognition speech-to-text pretrained-models language-model asr pretrained

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

✭ 65

java c HTML Makefile CSS android speech-recognition speech-to-text pocketsphinx android-ndk estonian

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

✭ 31

typescript python HTML javascript CSS speech-recognition speech-to-text mozilla-deepspeech

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

✭ 50

swift ruby text-to-speech interpreter translation youtube-video speech-synthesis voice-recognition speech-recognition speech-to-text amazon-polly amazon-cognito mobile-development speech-recognizer translation-api aws-sdk-ios aws-mobilehub amazon-translate

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

✭ 60

Cuda C++python Makefile cuda speech-recognition beam-search speech-to-text transducer handwriting-recognition prefix-search rnnt

a simple speech recognition app using the Web Speech API Interfaces

✭ 18

javascript CSS HTML speech-synthesis speech-recognition speech-to-text speech-processing speech-api

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

✭ 35

javascript HTML shell text-to-speech azure speech-synthesis speech-recognition speech-to-text cognitive-services

speechmatics-python

Python library and CLI for Speechmatics

✭ 24

python Makefile cli speech-recognition speech-to-text transcription

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

✭ 25

javascript diff statistics evaluation comparison speech-recognition accuracy words speech-to-text stt difference asr mismatches wer word-error-rate transcriptions punctuations insertions

Open Source AI Benchmarking toolkit for benchmarking speech to text services

✭ 43

python HTML benchmark machine-learning speech-to-text benchmarking-suite asr-benchmark stt-benchmark

Speech-to-text and keyboard input captions for OBS.

✭ 89

typescript HTML rust CSS javascript twitch angular azure webrtc speech captions tts subtitles speech-recognition speech-to-text obs stt text-animation tauri akita stt-plugins

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

✭ 21

java offline voice-commands speech voice-recognition speech-recognition voice-chat speech-to-text voice-control voice-assistant speech-to-text-android on-device

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

On-device speech-to-index engine powered by deep learning.

✭ 30

python typescript swift javascript java c audio voice-recognition speech-recognition speech-to-text voice-search speech-to-index

web-voice-processor

A library for real-time voice processing in web browsers

✭ 69

typescript javascript HTML python real-time browser worker realtime voice-commands microphone speech-recognition webaudio-api pcm web-browser speech-to-text audio-processing wake-word-detection downsampling voice-processing

Rev.ai Java SDK

✭ 16

java sdk captions speech-recognition speech-to-text rev revai transcription-job

react-native-spokestack

Spokestack: give your React Native app a voice interface!

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

✭ 179

shell data speech speech-recognition audio-data speech-to-text asr speech-activities

A live speech recognition using Facebooks wav2vec 2.0 model.

✭ 205

python pyaudio speech speech-recognition speech-to-text asr wav2vec wav2vec2

On-device speech-to-text engine powered by deep learning

✭ 354

python java C#typescript rust go voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr voice-to-text on-device

revai-python-sdk

Rev AI Python SDK

✭ 35

python Makefile Dockerfile sdk realtime captions speech-recognition speech-to-text rev transcription-job

Voice control for your websites and applications

✭ 53

javascript voice speech speech-recognition speech-to-text voice-control voice-assistant speech-api anycontrol

A merged version of multiple open-source German speech datasets.

✭ 21

Jupyter Notebook python shell corpus dataset speech-recognition speech-to-text asr

121-151 of 151 speech-to-text projects