PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Stars: ✭ 2,934 (+263.12%)

Mutual labels: speech-recognition

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-97.4%)

Mutual labels: speech-recognition

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (+511.76%)

Mutual labels: speech-recognition

deep avsr

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Stars: ✭ 104 (-87.13%)

Mutual labels: speech-recognition

HotVoice

Adds Speech Recognition support to AutoHotkey, via a C# DLL

Stars: ✭ 41 (-94.93%)

Mutual labels: speech-recognition

Ctcwordbeamsearch

Connectionist Temporal Classification (CTC) decoder with dictionary and language model for TensorFlow.

Stars: ✭ 398 (-50.74%)

Mutual labels: speech-recognition

leopard-chat-ui-teneo

Leopard Chat UI - A Teneo Chat Client based on Vue and Vuetify

Stars: ✭ 65 (-91.96%)

Mutual labels: asr

pytorch audio

audio processing module for pytorch:stft, istft

Stars: ✭ 33 (-95.92%)

Mutual labels: speech-recognition

cobra

On-device voice activity detection (VAD) powered by deep learning.

Stars: ✭ 76 (-90.59%)

Mutual labels: speech-recognition

Speech-Command-Recognition-with-Capsule-Network

Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.

Stars: ✭ 20 (-97.52%)

Mutual labels: speech-recognition

Chinese-automatic-speech-recognition

Chinese speech recognition

Stars: ✭ 147 (-81.81%)

Mutual labels: speech-recognition

Recording-Bot

A bot built to record and transcribe audio fragments from Discord.

Stars: ✭ 22 (-97.28%)

Mutual labels: speech-recognition

scripty

Speech to text bot for Discord using Mozilla's DeepSpeech

Stars: ✭ 14 (-98.27%)

Mutual labels: speech-recognition

timit-preprocessor

Extract mfcc vectors and phones from TIMIT dataset

Stars: ✭ 14 (-98.27%)

Mutual labels: speech-recognition

Awesome Diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Stars: ✭ 673 (-16.71%)

Mutual labels: speech-recognition

Ctcdecoder

Connectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.

Stars: ✭ 529 (-34.53%)

Mutual labels: speech-recognition

dropclass speaker

DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020

Stars: ✭ 20 (-97.52%)

Mutual labels: kaldi

speech-recognition-transfer-learning

Speech command recognition DenseNet transfer learning from UrbanSound8k in keras tensorflow

Stars: ✭ 18 (-97.77%)

Mutual labels: speech-recognition

mongolian-nlp

Useful resources for Mongolian NLP

Stars: ✭ 119 (-85.27%)

Mutual labels: speech-recognition

learning invariances in speech recognition

In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses convolutional neural networks (CNN) because of their intrinsic nature of learning correlated represen- tations as is the speech. In particular I develop different CNNs trained on the Google Speech Command Dataset…

Stars: ✭ 15 (-98.14%)

Mutual labels: speech-recognition

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (-50.5%)

Mutual labels: speech-recognition

klaam

Arabic speech recognition, classification and text-to-speech.

Stars: ✭ 151 (-81.31%)

Mutual labels: asr

Sonus

💬 /so.nus/ STT (speech to text) for Node with offline hotword detection

Stars: ✭ 532 (-34.16%)

Mutual labels: speech-recognition

favorite-research-papers

Listing my favorite research papers 📝 from different fields as I read them.

Stars: ✭ 12 (-98.51%)

Mutual labels: speech-recognition

Ajax-Chat

Ajax Chat is a complete web chat in javascript, ajax, php and mysql compatible with Phonegap

Stars: ✭ 19 (-97.65%)

Mutual labels: end-to-end

VoiceDictation

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Stars: ✭ 36 (-95.54%)

Mutual labels: speech-recognition

Free Spoken Digit Dataset

A free audio dataset of spoken digits. Think MNIST for audio.

Stars: ✭ 396 (-50.99%)

Mutual labels: speech-recognition

QuantumSpeech-QCNN

IEEE ICASSP 21 - Quantum Convolution Neural Networks for Speech Processing and Automatic Speech Recognition

Stars: ✭ 71 (-91.21%)

Mutual labels: speech-recognition

StageMate

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

Stars: ✭ 60 (-92.57%)

Mutual labels: speech-recognition

voicekit-examples

Examples on how to use Tinkoff Voicekit

Stars: ✭ 35 (-95.67%)

Mutual labels: speech-recognition

VoiceBridge

VoiceBridge - an AI-TOOLKIT Open Source C++ Speech Recognition Toolkit

Stars: ✭ 17 (-97.9%)

Mutual labels: speech-recognition

Multi-Hotword Spotting

Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?

Stars: ✭ 31 (-96.16%)

Mutual labels: speech-recognition

download audioset

📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).

Stars: ✭ 53 (-93.44%)

Mutual labels: speech-recognition

spokestack-tray-android

A UI component that makes it easy to add voice interaction to your app.

Stars: ✭ 13 (-98.39%)

Mutual labels: asr

kaldi-readers-for-tensorflow

readers that enable reading kaldi ark in tensorflow

Stars: ✭ 16 (-98.02%)

Mutual labels: asr

wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Stars: ✭ 30 (-96.29%)

Mutual labels: asr

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-97.65%)

Mutual labels: speech-recognition

ocaml-otr

Off-the-record (OTR) messaging protocol, purely in OCaml

Stars: ✭ 39 (-95.17%)

Mutual labels: end-to-end

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (-4.46%)

Mutual labels: speech-recognition

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (+959.41%)

Mutual labels: speech-recognition

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (+642.45%)

Mutual labels: speech-recognition

Speech Denoising Wavenet

A neural network for end-to-end speech denoising