All Categories → Machine Learning → speech-to-text

Top 151 speech-to-text open source projects

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

✭ 35

javascript speech-recognition discord-bot speech speech-to-text voice-commands

Botium Speech Processing

✭ 908

javascript text-to-speech speech-to-text

Speechtotext Websockets Java

SDK & Sample to do speech recognition using websockets in Java

✭ 11

java websockets speech-to-text cognitive-services

Kur

Descriptive Deep Learning

✭ 811

python deep-learning machine-learning neural-network deep-neural-networks neural-networks speech-recognition speech-to-text image-recognition deep-learning-tutorial

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

✭ 772

python speech-recognition speech-to-text personal-assistant

Nonocaptcha

An asynchronized Python library to automate solving ReCAPTCHA v2 using audio

✭ 744

python asyncio speech-to-text recaptcha

Eesen

The official repository of the Eesen project

✭ 738

tensorflow speech-recognition speech-to-text asr kaldi ctc

Annyang

💬 Speech recognition for your site

✭ 6,216

javascript HTML hacktoberfest speech-recognition speech speech-to-text voice

Adapt

Adapt Intent Parser

✭ 690

python open-source opensource speech-recognition speech-to-text

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

✭ 5,999

python shell audio speech-recognition speech-to-text

Voicy

@voicybot Telegram bot main repository

✭ 620

javascript bot telegram-bot speech-to-text

Open stt

Open STT

✭ 584

python dataset speech-to-text russian asr

Nodejs Speech

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

✭ 545

typescript nodejs machine-learning speech speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

✭ 532

javascript node speech-recognition speech speech-to-text alexa voice-recognition voice-control

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

✭ 522

jupyter-notebook pytorch speech-recognition speech-to-text pretrained-models english asr

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

✭ 490

java api google speech-recognition speech speech-to-text speech-synthesis recognition

Speech To Text Benchmark

speech to text benchmark framework

✭ 481

python deep-learning deep-neural-networks privacy speech-recognition offline speech-to-text voice-recognition

Speech Demo

语音api示例

✭ 454

java rest-api speech-recognition speech-to-text baidu

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

✭ 440

swift ios search speech-recognition input speech-to-text voice overlay permissions chatbots conversation voice-recognition conversational-ui

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

✭ 4,943

python tensorflow keras cnn speech-recognition speech-to-text ctc chinese-speech-recognition asrt

Rhino

On-device speech-to-intent engine powered by deep learning

✭ 406

javascript python c android deep-learning nodejs ios raspberry-pi iot speech-recognition speech-to-text natural-language-understanding voice-recognition voice-commands cortex-m voice-control

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

✭ 400

python tensorflow speech-recognition speech-to-text ctc

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

✭ 393

awesome-list speech-recognition speech speech-to-text kaldi

Cheetah

On-device streaming speech-to-text engine powered by deep learning

✭ 383

python c android deep-learning ios machine-learning raspberry-pi iot webassembly arm speech-recognition offline speech-to-text asr voice-recognition

Autoedit 2

Fast text based video editing, node Electron Os X desktop app, with Backbone front end.

✭ 343

javascript electron osx desktop mac speech-to-text backbone video-editing backbonejs watson

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

✭ 18,680

C++python c shell C#swift deep-learning machine-learning tensorflow neural-networks embedded speech-recognition offline speech-to-text deepspeech on-device

React Mic

Record audio from a user's microphone and display a cool visualization.

✭ 323

javascript reactjs speech-to-text voice microphone voice-recognition audio-visualizer

Tensorflow end2end speech recognition

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

✭ 305

python tensorflow speech-recognition attention-mechanism speech-to-text asr end-to-end ctc beam-search

Css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

✭ 302

html dataset speech speech-to-text

Phonetisaurus

Phonetisaurus G2P

✭ 277

shell speech-recognition speech-to-text

Nemo

NeMo: a toolkit for conversational AI

✭ 3,685

Jupyter Notebook python Roff shell C++HTML deep-learning nlp neural-network speech-recognition nlp-machine-learning text-to-speech machine-translation speech-synthesis speech-to-text nmt

demo vietasr

Vietnamese Speech Recognition

✭ 22

C++python Makefile Cuda shell Jupyter Notebook speech-recognition automatic-speech-recognition speech-to-text stt asr vietnamese-nlp ctc-loss vietnamese-language ctc-decode vietnamese-speech-recognition

kim-voice-assistant

Kim，你的私人语音助理。

✭ 70

python Dockerfile dockerfile iot pip speech-recognition speech-to-text mit-license aliyun snowboy speakers kim-voice-assistant

sova-asr

SOVA ASR (Automatic Speech Recognition)

✭ 123

python javascript CSS HTML Dockerfile speech speech-recognition automatic-speech-recognition speech-to-text stt asr wav2letter asr-model

BangalASR

Transformer based Bangla Speech Recognition

✭ 20

Jupyter Notebook python transformers bangla speech-to-text attention-is-all-you-need bangla-nlp bangla-asr bangla-speech-recognition bangla-speech-to-text bangla-automatic-speech-recognition

musicologist

Music advice from a conversational interface powered by Algolia

leon

🧠 Leon is your open-source personal assistant.

htk

HTK Toolkit with Linux 64 bit and Docker support

✭ 14

c TeX machine-learning speech-recognition speech-to-text htk

SpeechToText

Speech To Text in Android

✭ 53

java android android-application speech-recognition android-studio speech-to-text

s3-lambda-transcribe-audio-to-text-s3

Transcribe your audio to text with this serverless component

✭ 84

javascript audio lambda serverless s3 speech-to-text transcribe transcribe-audio-files

speech-to-text

mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras

✭ 61

Jupyter Notebook python shell perl cnn dnn speech-recognition speech-to-text kaldi

speech-recognition

SDKs and docs for Skit's speech to text service

✭ 20

python java speech-recognition speech-to-text asr multilingual-speech-recognition speech-recognition-api

simple diarizer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

✭ 26

python speech-to-text transcription asr speaker-diarization colab-notebook diarization

bingspeech-api-client

Microsoft Bing Speech API client in node.js

✭ 32

typescript text-to-speech tts speech-to-text bing-speech stt

Athena

A free and open source replacement for Google Assistant on Android devices, meant to integrate with the Sapphire Framework. It contains both speech-to-text and text-to-speech services. It does not require Google services or network connectivity

✭ 73

kotlin java android text-to-speech foss tasker assistant speech-to-text termux google-assistant

voce-browser

Voice Controlled Chromium Web Browser

✭ 34