The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+356.6%)

Mutual labels: speech, speech-recognition, speech-to-text

sova-asr

SOVA ASR (Automatic Speech Recognition)

Stars: ✭ 123 (+132.08%)

Mutual labels: speech, speech-recognition, speech-to-text

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-35.85%)

Mutual labels: speech-recognition, speech-to-text, voice-assistant

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-66.04%)

Mutual labels: speech-recognition, speech-to-text, speech-api

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+16050.94%)

Mutual labels: speech-recognition, speech-to-text, voice-assistant

voice gender detection

♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).

Stars: ✭ 51 (-3.77%)

Mutual labels: voice, voice-control, voice-assistant

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (+666.04%)

Mutual labels: speech-recognition, speech-to-text, voice-control

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+67.92%)

Mutual labels: speech, speech-recognition, speech-to-text

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

Stars: ✭ 21 (-60.38%)

Mutual labels: voice, speech, speech-recognition

Voice Overlay Android

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 189 (+256.6%)

Mutual labels: voice, speech-recognition, speech-to-text

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

Stars: ✭ 393 (+641.51%)

Mutual labels: speech, speech-recognition, speech-to-text

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

Stars: ✭ 490 (+824.53%)

Mutual labels: speech, speech-recognition, speech-to-text

Alan Sdk Flutter

Alan AI Flutter SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 309 (+483.02%)

Mutual labels: voice, speech-recognition, voice-control

Alan Sdk Ios

Alan AI iOS SDK adds a voice assistant or chatbot to your app. Supports Swift, Objective-C.

Stars: ✭ 318 (+500%)

Mutual labels: voice, speech-recognition, voice-control

Edgedict

Working online speech recognition based on RNN Transducer. ( Trained model release available in release )

Stars: ✭ 205 (+286.79%)

Mutual labels: speech, speech-recognition, speech-to-text

Alan Sdk Android

Alan AI Android SDK adds a voice assistant or chatbot to your app. Supports Java, Kotlin.

Stars: ✭ 278 (+424.53%)

Mutual labels: voice, speech-recognition, voice-control

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+2200%)

Mutual labels: speech, speech-recognition, speech-to-text

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (+730.19%)

Mutual labels: voice, speech-recognition, speech-to-text

Alan Sdk Ionic

Alan AI Ionic SDK adds a voice assistant or chatbot to your app. Supports React, Angular.

Stars: ✭ 287 (+441.51%)

Mutual labels: voice, speech-recognition, voice-control

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (-33.96%)

Mutual labels: speech, speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!

Stars: ✭ 171 (+222.64%)

Mutual labels: voice, speech-recognition, speech-to-text

Alan Sdk Pcf

Alan AI Power Apps SDK adds a voice assistant or chatbot to your Microsoft Power Apps project.

Stars: ✭ 128 (+141.51%)

Mutual labels: voice, speech-recognition, voice-control

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Stars: ✭ 633 (+1094.34%)

Mutual labels: voice, speech, speech-recognition

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

Stars: ✭ 368 (+594.34%)

Mutual labels: voice, speech-recognition, voice-control

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (+175.47%)

Mutual labels: voice, speech-recognition, speech-to-text

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 179 (+237.74%)

Mutual labels: speech, speech-recognition, speech-to-text

wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

Stars: ✭ 205 (+286.79%)

Mutual labels: speech, speech-recognition, speech-to-text

Asr audio data links

A list of publically available audio data that anyone can download for ASR or other speech activities

Stars: ✭ 128 (+141.51%)

Mutual labels: speech, speech-recognition, speech-to-text

kaldi ag training

Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.

Stars: ✭ 14 (-73.58%)

Mutual labels: speech, speech-recognition, speech-to-text

Syn Speech

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Stars: ✭ 57 (+7.55%)

Mutual labels: speech, speech-recognition, speech-to-text

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project.

Stars: ✭ 11,151 (+20939.62%)

Mutual labels: speech, speech-recognition, speech-to-text

Tacotron asr

Speech Recognition Using Tacotron

Stars: ✭ 165 (+211.32%)

Mutual labels: speech, speech-recognition, speech-to-text

Aimybox Android Assistant

Embeddable custom voice assistant for Android applications

Stars: ✭ 139 (+162.26%)

Mutual labels: voice, speech-recognition

Speechtotext Websockets Javascript

SDK & Sample to do speech recognition using websockets in Javascript

Stars: ✭ 191 (+260.38%)

Mutual labels: speech, speech-recognition

Alan Sdk Reactnative

Alan React Native SDK adds a voice assistant or chatbot to your app.

Stars: ✭ 138 (+160.38%)

Mutual labels: voice, voice-control

Siricontrol System

Control anything with Siri voice commands.

Stars: ✭ 180 (+239.62%)

Mutual labels: speech, voice-control

Voice Gender

Gender recognition by voice and speech analysis

Stars: ✭ 248 (+367.92%)

Mutual labels: voice, speech

karen

open-source voice assistant

Stars: ✭ 19 (-64.15%)

Mutual labels: voice, voice-assistant

Caster

Dragonfly-Based Voice Programming and Accessibility Toolkit

Stars: ✭ 242 (+356.6%)

Mutual labels: voice, voice-control

End2end Asr Pytorch

End-to-End Automatic Speech Recognition on PyTorch

Stars: ✭ 175 (+230.19%)

Mutual labels: speech, speech-recognition

Kerasdeepspeech

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

Stars: ✭ 245 (+362.26%)

Mutual labels: speech, speech-to-text

SignDetect

This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.

Stars: ✭ 21 (-60.38%)

Mutual labels: voice, speech

Avpi

an open source voice command macro software

Stars: ✭ 130 (+145.28%)

Mutual labels: voice, speech

picovoice

The end-to-end platform for building voice products at scale

Stars: ✭ 316 (+496.23%)

Mutual labels: voice, speech-recognition

Voicebook

🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).

Stars: ✭ 236 (+345.28%)

Mutual labels: voice, voice-control

brasiltts

Brasil TTS é um conjunto de sintetizadores de voz, em português do Brasil, que lê telas para portadores de deficiência visual. Transforma texto em áudio, permitindo que pessoas cegas ou com baixa visão tenham acesso ao conteúdo exibido na tela. Embora o principal público-alvo de sistemas de conversão texto-fala – como o Brasil TTS – seja formado…

Stars: ✭ 34 (-35.85%)

Mutual labels: voice, voice-assistant

1-60 of 667 similar projects

›

next*5