All Categories → Machine Learning → speech-recognition

Top 326 speech-recognition open source projects

On-device speech-to-index engine powered by deep learning.

✭ 30

python typescript swift javascript java c audio voice-recognition speech-recognition speech-to-text voice-search speech-to-index

awesome-end2end-speech-recognition

💬 A list of End-to-End speech recognition, including papers, codes and other materials

✭ 49

code speech-recognition awesome-list toolkits papers curated-list end-to-end-speech-recognition

It's project that contains different applications developed with Swift 5.7 👨‍💻👩🏼‍💻🧑🏿‍💻

The end-to-end platform for building voice products at scale

✭ 316

typescript java javascript C#swift python nodejs android ios machine-learning microcontroller embedded ai deep-learning offline dotnet voice voice-commands voice-recognition speech-recognition neural-networks

opensource-voice-tools

A repo listing known open source voice tools, ordered by where they sit in the voice stack

✭ 21

TeX chatbot voice corpus speech conversational-ui tts speech-recognition stt asr

web-voice-processor

A library for real-time voice processing in web browsers

✭ 69

typescript javascript HTML python real-time browser worker realtime voice-commands microphone speech-recognition webaudio-api pcm web-browser speech-to-text audio-processing wake-word-detection downsampling voice-processing

Rev.ai Java SDK

✭ 16

java sdk captions speech-recognition speech-to-text rev revai transcription-job

CEP is a software platform designed for users that want to learn or rapidly prototype using standard A.I. components.

✭ 140

javascript python CSS HTML SCSS Less raspberry-pi opencv iot natural-language-processing computer-vision deep-learning smarthome artificial-intelligence speech-recognition edge-computing lego-mindstorms oak-d speech-generation

react-native-spokestack

Spokestack: give your React Native app a voice interface!

Tensor2tensor experiment with SpecAugment

✭ 46

python speech-recognition data-augmentation tensor2tensor specaugment

中文语音识别系列，读者可以借助它快速训练属于自己的中文语音识别模型，或直接使用预训练模型测试效果。

✭ 179

python javascript HTML deep-learning pytorch speech-recognition pretrained-models

Pytorch implementation of subband decomposition

✭ 63

HTML python deep-learning signal-processing speech-recognition speech-processing music-source-separation speech-enhancement

ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

✭ 179

shell data speech speech-recognition audio-data speech-to-text asr speech-activities

multilingual kws

Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus

✭ 122

Jupyter Notebook python speech-recognition keyword-spotting wake-word-detection query-by-example kws keyword-search few-shot-learning

"Caption This" is an iOS app that adds real-time captions to videos for Instagram Stories

✭ 12

javascript objective c c swift ruby java ios react-native speech-recognition

A live speech recognition using Facebooks wav2vec 2.0 model.

✭ 205

python pyaudio speech speech-recognition speech-to-text asr wav2vec wav2vec2

On-device speech-to-text engine powered by deep learning

✭ 354

python java C#typescript rust go voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr voice-to-text on-device

good-speech-web-client

Practice your speech level in any language using speech recognition

✭ 26

javascript HTML CSS react redux pronunciation speech-recognition

revai-python-sdk

Rev AI Python SDK

✭ 35

python Makefile Dockerfile sdk realtime captions speech-recognition speech-to-text rev transcription-job

A Polymer 3+ webcomponent / button for doing speech recognition

✭ 54

javascript HTML polymer button speech-recognition automatic-speech-recognition polymer2 webcomponent

Voice control for your websites and applications

✭ 53

javascript voice speech speech-recognition speech-to-text voice-control voice-assistant speech-api anycontrol

TF-Speech-Recognition-Challenge-Solution

Source code of the model used in Tensorflow Speech Recognition Challenge (https://www.kaggle.com/c/tensorflow-speech-recognition-challenge). The solution ranked in top 5% in private leaderboard.

✭ 58

Jupyter Notebook python shell raspberry-pi deep-learning neural-network tensorflow scikit-learn speech recurrent-neural-networks speech-recognition ensemble-learning convolutional-neural-networks audio-recognition

TextNormalizationCoveringGrammars

Covering grammars for English and Russian text normalization

✭ 60

Makefile nlp text-to-speech speech-recognition

A merged version of multiple open-source German speech datasets.

✭ 21

Jupyter Notebook python shell corpus dataset speech-recognition speech-to-text asr

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

✭ 94

forth python shell Dockerfile TeX Jupyter Notebook speech-recognition speech-processing audio-segmentation gender-classification speaker-diarization synthetic-speech-detection topic-detection speech-seperation speaker-identification accent-detection speech-transcription speech-annotation

🎙️ Handsfree Audio Development Interface

✭ 84

kotlin shell accessibility intellij speech speech-synthesis speech-recognition intellij-plugin voice-control voice-assistant vosk-api

301-326 of 326 speech-recognition projects