All Categories → Machine Learning → speech-recognition

Top 326 speech-recognition open source projects

Cortex M Kws

Cortex M KWS example with Tengine Lite.

✭ 45

c tensorflow cnn speech-recognition cortex-m

Formant Analyzer

iOS application for finding formants in spoken sounds

✭ 43

swift language ios app speech-recognition application speech-processing

Avsr Deep Speech

Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab

✭ 43

python deep-learning audio lstm speech-recognition

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

✭ 1,011

javascript speech-recognition speech-to-text speech-synthesis recognition voice-commands

Pncc

A implementation of Power Normalized Cepstral Coefficients: PNCC

✭ 40

python deep-learning speech-recognition speech-processing

Voice

🎤 React Native Voice Recognition library for iOS and Android (Online and Offline Support)

✭ 993

android ios react-native speech-recognition voice-recognition

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

✭ 35

javascript speech-recognition discord-bot speech speech-to-text voice-commands

Itri Speech Recognition Dataset Generation

Automatic Speech Recognition Dataset Generation

✭ 32

jupyter-notebook speech-recognition automatic

Arvutaja

An Android app for voice actions in Estonian and English

✭ 28

java android speech-recognition

Rhasspy

Rhasspy voice assistant for offline home automation

✭ 851

html home-assistant speech-recognition russian

Kaldi Gstreamer Server

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

✭ 935

python speech-recognition

Assistant Client

Инструмент для тестирования и отладки СanvasApps c семейством Виртуальных Ассистентов "Салют"

✭ 26

javascript speech-recognition

Wavenet Stt

An end-to-end speech recognition system with Wavenet. Built using C++ and python.

✭ 18

python python3 tensorflow speech-recognition wavenet

Speechpy

💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

✭ 833

python speech-recognition feature-extraction

Kur

Descriptive Deep Learning

✭ 811

python deep-learning machine-learning neural-network deep-neural-networks neural-networks speech-recognition speech-to-text image-recognition deep-learning-tutorial

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

✭ 808

python pytorch speech-recognition asr kaldi end-to-end

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

✭ 772

python speech-recognition speech-to-text personal-assistant

Sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

✭ 764

python deep-learning pytorch audio artificial-intelligence convolutional-neural-networks neural-networks cnn speech-recognition audio-processing signal-processing asr speech-processing filtering waveform

Pykaldi

A Python wrapper for Kaldi

✭ 756

python numpy wrapper speech-recognition speech language-model feature-extraction asr kaldi

Eesen

The official repository of the Eesen project

✭ 738

tensorflow speech-recognition speech-to-text asr kaldi ctc

Annyang

💬 Speech recognition for your site

✭ 6,216

javascript HTML hacktoberfest speech-recognition speech speech-to-text voice

Adapt

Adapt Intent Parser

✭ 690

python open-source opensource speech-recognition speech-to-text

Wav2letter

Facebook AI Research's Automatic Speech Recognition Toolkit

✭ 5,907

C++python shell CMake perl c Dockerfile deep-learning speech-recognition end-to-end wav2letter

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

✭ 673

deep-learning machine-learning awesome awesome-list speech-recognition speech-processing

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

✭ 5,999

python shell audio speech-recognition speech-to-text

Speech Emotion Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

✭ 633

python3 jupyter-notebook deep-learning data-science keras neural-network natural-language-processing deep-neural-networks speech-recognition speech voice natural-language-understanding emotion

Libreasr

💬 An On-Premises, Streaming Speech Recognition System

✭ 633

python deep-learning pytorch speech-recognition asr

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

✭ 622

matlab data lstm speech-recognition attention speech dnn

Wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

✭ 617

python pytorch speech-recognition transformer asr

Athena

an open-source implementation of sequence-to-sequence based speech processing engine

✭ 542

python tensorflow deployment speech-recognition transformer unsupervised-learning tts speech-synthesis asr sequence-to-sequence ctc

Speech recognition

中文语音识别

✭ 534

python tensorflow chinese speech-recognition

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

✭ 532

javascript node speech-recognition speech speech-to-text alexa voice-recognition voice-control

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

✭ 529

python speech-recognition recurrent-neural-networks opencl language-model ctc beam-search

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

✭ 522

jupyter-notebook pytorch speech-recognition speech-to-text pretrained-models english asr

Java Speech Api

The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.

✭ 490

java api google speech-recognition speech speech-to-text speech-synthesis recognition

Mycroft Precise

A lightweight, simple-to-use, RNN wake word listener

✭ 481

python raspberry-pi speech-recognition embedded-systems voice-recognition voice-control

Speech To Text Benchmark

speech to text benchmark framework

✭ 481

python deep-learning deep-neural-networks privacy speech-recognition offline speech-to-text voice-recognition

Rhasspy

Offline private voice assistant for many human languages

✭ 458

shell privacy home-assistant speech-recognition voice-commands

Speech Demo

语音api示例

✭ 454

java rest-api speech-recognition speech-to-text baidu

Uspeech

Speech recognition toolkit for the arduino

✭ 448

python arduino speech-recognition speech-processing signal

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

✭ 440

swift ios search speech-recognition input speech-to-text voice overlay permissions chatbots conversation voice-recognition conversational-ui

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

✭ 4,943

python tensorflow keras cnn speech-recognition speech-to-text ctc chinese-speech-recognition asrt

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

✭ 408

python pytorch tensorflow speech-recognition speech data-augmentation

Rhino

On-device speech-to-intent engine powered by deep learning

✭ 406

javascript python c android deep-learning nodejs ios raspberry-pi iot speech-recognition speech-to-text natural-language-understanding voice-recognition voice-commands cortex-m voice-control

Neural sp

End-to-end ASR/LM implementation with PyTorch

✭ 408

python pytorch streaming speech-recognition transformer attention-mechanism attention seq2seq speech language-model asr sequence-to-sequence ctc

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

✭ 400

python tensorflow speech-recognition speech-to-text ctc

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

✭ 398

tensorflow algorithm speech-recognition recurrent-neural-networks language-model decoder text-recognition ctc

Free Spoken Digit Dataset

A free audio dataset of spoken digits. Think MNIST for audio.

✭ 396

python machine-learning audio dataset speech-recognition mnist

Awesome Kaldi

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

✭ 393

awesome-list speech-recognition speech speech-to-text kaldi

Nmtpytorch

Sequence-to-Sequence Framework in PyTorch

✭ 392

jupyter-notebook deep-learning pytorch cnn speech-recognition seq2seq asr neural-machine-translation nmt

Cheetah

On-device streaming speech-to-text engine powered by deep learning

✭ 383

python c android deep-learning ios machine-learning raspberry-pi iot webassembly arm speech-recognition offline speech-to-text asr voice-recognition

Zamia Speech

Open tools and data for cloudless automatic speech recognition

✭ 374

python speech-recognition language-model asr kaldi

Subsync

Subtitle Speech Synchronizer

✭ 379

speech-recognition synchronization subtitles

Alan Sdk Web

Alan AI Web SDK adds a voice assistant or chatbot to your app. Supports React, Angular, Vue, Ember, JavaScript, Electron.

✭ 368

machine-learning sdk chatbot speech-recognition text-to-speech voice voice-commands voice-control

Espnet

End-to-End Speech Processing Toolkit

✭ 4,533

python shell perl matlab Dockerfile M deep-learning pytorch speech-recognition speech-synthesis machine-translation chainer kaldi end-to-end voice-conversion speech-separation speech-enhancement speech-translation

Deepspeech Examples

Examples of how to use or integrate DeepSpeech

✭ 356

python nodejs machine-learning dotnet examples speech-recognition

Libfaceid

libfaceid is a research framework for prototyping of face recognition solutions. It seamlessly integrates multiple detection, recognition and liveness models w/ speech synthesis and speech recognition.

✭ 354

python deep-learning machine-learning tensorflow keras raspberry-pi opencv scikit-learn speech-recognition face-detection face-recognition pose-estimation speech-synthesis dlib

Brevitas

Brevitas: quantization-aware training in PyTorch

✭ 343

python pytorch neural-networks speech-recognition image-classification fpga text-to-speech quantization

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

✭ 18,680

C++python c shell C#swift deep-learning machine-learning tensorflow neural-networks embedded speech-recognition offline speech-to-text deepspeech on-device

J.a.r.v.i.s

python powered Intelligent System

✭ 325

python python3 hacktoberfest youtube speech-recognition requests

121-180 of 326 speech-recognition projects

first

‹

›