All Projects → Speechbrain.github.io → Similar Projects or Alternatives

1312 Open source projects that are alternatives of or similar to Speechbrain.github.io

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+1.24%)

Mutual labels: neural-networks, deeplearning, speech, speech-to-text

KeenASR-Android-PoC

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Stars: ✭ 21 (-91.32%)

Mutual labels: speech, speech-recognition, speech-to-text

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+7619.01%)

Mutual labels: neural-networks, speech-recognition, speech-to-text

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (-39.67%)

Mutual labels: speech-recognition, speech-to-text, speech-processing

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+4507.85%)

Mutual labels: speech-recognition, speech, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-94.21%)

Mutual labels: speech, speech-recognition, speech-to-text

Voice activity detection

Voice Activity Detection based on Deep Learning & TensorFlow

Stars: ✭ 132 (-45.45%)

Mutual labels: deeplearning, speech-recognition, speech

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (-78.1%)

Mutual labels: speech, speech-recognition, speech-to-text

Speech Denoising Wavenet

A neural network for end-to-end speech denoising

Stars: ✭ 516 (+113.22%)

Mutual labels: neural-networks, speech, speech-processing

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (-31.82%)

Mutual labels: speech-recognition, speech, speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+2468.6%)

Mutual labels: speech-recognition, speech, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-92.56%)

Mutual labels: speech-recognition, speech-to-text, speech-processing

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (-26.03%)

Mutual labels: speech, speech-recognition, speech-to-text

speech to text

how to use the Google Cloud Speech API to transcribe audio/video files.

Stars: ✭ 35 (-85.54%)

Mutual labels: speech, speech-recognition, speech-to-text

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (+119.83%)

Mutual labels: speech-recognition, speech, speech-to-text

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-85.54%)

Mutual labels: speech-recognition, speech, speech-to-text

Openasr

A pytorch based end2end speech recognition system.

Stars: ✭ 69 (-71.49%)

Mutual labels: speech-recognition, speech, speech-to-text

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Stars: ✭ 53 (-78.1%)

Mutual labels: speech-recognition, speech-to-text, speech-processing

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (-47.11%)

Mutual labels: speech-recognition, speech, speech-to-text

deepspeech.mxnet

A MXNet implementation of Baidu's DeepSpeech architecture

Stars: ✭ 82 (-66.12%)

Mutual labels: speech, speech-recognition, speech-to-text

UniSpeech

UniSpeech - Large Scale Self-Supervised Learning for Speech

Stars: ✭ 224 (-7.44%)

Mutual labels: speech, speech-recognition, speech-processing

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+102.48%)

Mutual labels: speech-recognition, speech, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (-49.17%)

Mutual labels: speech, speech-recognition, speech-to-text

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (-76.45%)

Mutual labels: speech-recognition, speech, speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (-20.66%)

Mutual labels: neural-networks, speech-recognition, speech-to-text

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (-15.29%)

Mutual labels: speech-recognition, speech, speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-57.44%)

Mutual labels: neural-networks, speech-recognition, speech-to-text

Kur

Descriptive Deep Learning

Stars: ✭ 811 (+235.12%)

Mutual labels: neural-networks, speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (-15.29%)

Mutual labels: speech, speech-recognition, speech-to-text

Lingvo

Stars: ✭ 2,361 (+875.62%)

Mutual labels: speech-recognition, speech, speech-to-text

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (+247.52%)

Mutual labels: speech-recognition, speech-to-text, speech-processing

spokestack-ios

Spokestack: give your iOS app a voice interface!

Stars: ✭ 27 (-88.84%)

Mutual labels: speech-recognition, speech-to-text, speech-processing

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+62.4%)

Mutual labels: speech-recognition, speech, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (-63.22%)

Mutual labels: speech, speech-recognition, speech-to-text

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

Stars: ✭ 764 (+215.7%)

Mutual labels: neural-networks, speech-recognition, speech-processing

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-67.77%)

Mutual labels: neural-networks, speech-recognition, speech-to-text

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+403.72%)

Mutual labels: speech-recognition, speech, speech-to-text

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (-57.85%)

Mutual labels: speech-recognition, speech-to-text

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (-57.02%)

Mutual labels: speech-recognition, speech-to-text

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (-56.2%)

Mutual labels: speech-recognition, speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+469.42%)

Mutual labels: speech-recognition, speech-to-text

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+511.16%)

Mutual labels: speech-recognition, speech

Python Speech recognition

A simple example for use speech recognition baidu api with python.

Stars: ✭ 106 (-56.2%)

Mutual labels: speech-recognition, speech

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+523.55%)

Mutual labels: speech-recognition, speech-to-text

Faceswap

Deepfakes Software For All

Stars: ✭ 39,911 (+16392.15%)

Mutual labels: neural-networks, deeplearning

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-52.89%)

Mutual labels: speech-recognition, speech

Nonautoreggenprogress

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

Stars: ✭ 118 (-51.24%)

Mutual labels: speech-recognition, speech-processing

Vosk Api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Stars: ✭ 1,357 (+460.74%)

Mutual labels: speech-recognition, speech-to-text

Ssd Pytorch

SSD: Single Shot MultiBox Detector pytorch implementation focusing on simplicity

Stars: ✭ 107 (-55.79%)

Mutual labels: neural-networks, deeplearning

Tfg Voice Conversion

Deep Learning-based Voice Conversion system

Stars: ✭ 115 (-52.48%)

Mutual labels: speech, speech-processing

Pytorch Asr

ASR with PyTorch

Stars: ✭ 124 (-48.76%)

Mutual labels: speech-recognition, speech

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (-47.52%)

Mutual labels: speech-recognition, speech-to-text

Persephone

A tool for automatic phoneme transcription

Stars: ✭ 130 (-46.28%)

Mutual labels: neural-networks, speech-recognition

Fasttext.js

FastText for Node.js

Stars: ✭ 127 (-47.52%)

Mutual labels: neural-networks, deeplearning

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (-45.04%)

Mutual labels: speech-recognition, speech-to-text

Allosaurus

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Stars: ✭ 135 (-44.21%)

Mutual labels: speech-recognition, speech

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (-9.09%)

Mutual labels: speech-recognition, speech-to-text

Paddlex

PaddlePaddle End-to-End Development Toolkit（『飞桨』深度学习全流程开发工具）

Stars: ✭ 3,399 (+1304.55%)

Mutual labels: neural-networks, deeplearning

Wavenet vocoder

WaveNet vocoder

Stars: ✭ 1,926 (+695.87%)

Mutual labels: speech, speech-processing

Mit Deep Learning Book Pdf

MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Stars: ✭ 9,859 (+3973.97%)

Mutual labels: neural-networks, deeplearning

1-60 of 1312 similar projects

›

next*5