The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+591.43%)

Mutual labels: speech, speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-60%)

Mutual labels: speech, speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-48.57%)

Mutual labels: speech-recognition, speech-to-text, speech-api

spokestack-android

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Stars: ✭ 52 (+48.57%)

Mutual labels: speech, speech-recognition, speech-api

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+1300%)

Mutual labels: speech, speech-recognition, speech-to-text

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (+97.14%)

Mutual labels: speech, speech-recognition, speech-to-text

Lingvo

Stars: ✭ 2,361 (+6645.71%)

Mutual labels: speech, speech-recognition, speech-to-text

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (+0%)

Mutual labels: speech, speech-recognition, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (+134.29%)

Mutual labels: speech, speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+31760%)

Mutual labels: speech, speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+411.43%)

Mutual labels: speech, speech-recognition, speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+17660%)

Mutual labels: speech, speech-recognition, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-40%)

Mutual labels: speech, speech-recognition, speech-to-text

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+1022.86%)

Mutual labels: speech, speech-recognition, speech-to-text

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-22.86%)

Mutual labels: speech-recognition, speech-to-text, speech-api

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+1420%)

Mutual labels: speech, speech-recognition, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+251.43%)

Mutual labels: speech, speech-recognition, speech-to-text

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (+62.86%)

Mutual labels: speech, speech-recognition, speech-to-text

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+3382.86%)

Mutual labels: speech, speech-recognition, speech-to-text

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (+277.14%)

Mutual labels: speech, speech-recognition

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (+285.71%)

Mutual labels: speech, speech-recognition

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (+540%)

Mutual labels: speech, speech-recognition

Pytorch Kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

Stars: ✭ 2,097 (+5891.43%)

Mutual labels: speech, speech-recognition

vosk-asterisk

Speech Recognition in Asterisk with Vosk Server

Stars: ✭ 52 (+48.57%)

Mutual labels: speech-recognition, speech-to-text

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (+445.71%)

Mutual labels: speech, speech-recognition

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+400%)

Mutual labels: speech, speech-recognition

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+600%)

Mutual labels: speech, speech-to-text

Aeneas

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)

Stars: ✭ 1,942 (+5448.57%)

Mutual labels: ffmpeg, speech

idear

🎙️ Handsfree Audio Development Interface

Stars: ✭ 84 (+140%)

Mutual labels: speech, speech-recognition

Thumbnail

Thumbnail for a given video using FFMpeg

Stars: ✭ 96 (+174.29%)

Mutual labels: composer, ffmpeg

megs

A merged version of multiple open-source German speech datasets.

Stars: ✭ 21 (-40%)

Mutual labels: speech-recognition, speech-to-text

Airflow Toolkit

Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested data pipelines(DAGs) 🖥 >> [ 🚀, 🚢 ]

Stars: ✭ 51 (+45.71%)

Mutual labels: composer, google-cloud

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

Stars: ✭ 58 (+65.71%)

Mutual labels: speech, speech-recognition

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (+0%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (+5.71%)

Mutual labels: speech-recognition, speech-to-text

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (+254.29%)

Mutual labels: speech, speech-recognition

Php Ffmpeg Video Streaming

📼 Package media content for online streaming(DASH and HLS) using FFmpeg

Stars: ✭ 246 (+602.86%)

Mutual labels: ffmpeg, google-cloud

leopard

On-device speech-to-text engine powered by deep learning

Stars: ✭ 354 (+911.43%)

Mutual labels: speech-recognition, speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (-54.29%)

Mutual labels: speech-recognition, speech-to-text

web-voice-processor

A library for real-time voice processing in web browsers

Stars: ✭ 69 (+97.14%)

Mutual labels: speech-recognition, speech-to-text

speech-recognition-evaluation

Evaluate results from ASR/Speech-to-Text quickly

Stars: ✭ 25 (-28.57%)

Mutual labels: speech-recognition, speech-to-text

React.ai

It recognize your speech and trained AI Bot will respond(i.e Customer Service, Personal Assistant) using Machine Learning API (DialogFlow, apiai), Speech Recognition, GraphQL, Next.js, React, redux

Stars: ✭ 38 (+8.57%)

Mutual labels: speech-recognition, speech-to-text

speechmatics-python

Python library and CLI for Speechmatics

Stars: ✭ 24 (-31.43%)

Mutual labels: speech-recognition, speech-to-text

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (+0%)

Mutual labels: speech-recognition, speech-to-text

DeepSpeech-API

The code enables users to use Mozilla's Deep Speech model over the Web Browser.

Stars: ✭ 31 (-11.43%)

Mutual labels: speech-recognition, speech-to-text

AmazonSpeechTranslator

End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.

Stars: ✭ 50 (+42.86%)

Mutual labels: speech-recognition, speech-to-text

Inimesed

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Stars: ✭ 65 (+85.71%)

Mutual labels: speech-recognition, speech-to-text

octopus

On-device speech-to-index engine powered by deep learning.

Stars: ✭ 30 (-14.29%)

Mutual labels: speech-recognition, speech-to-text

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.