Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Stars: ✭ 104 (-99.07%)

Mutual labels: speech-recognition, kaldi

open-speech-corpora

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

Stars: ✭ 841 (-92.46%)

Mutual labels: speech-recognition, speech-to-text

kim-voice-assistant

Kim，你的私人语音助理。

Stars: ✭ 70 (-99.37%)

Mutual labels: speech-recognition, speech-to-text

demo vietasr

Vietnamese Speech Recognition

Stars: ✭ 22 (-99.8%)

Mutual labels: speech-recognition, speech-to-text

srvk-eesen-offline-transcriber

Top level code to transcribe English audio/video files into text/subtitles

Stars: ✭ 22 (-99.8%)

Mutual labels: speech-recognition, kaldi

Vosk Server

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Stars: ✭ 277 (-97.52%)

Mutual labels: speech-recognition, kaldi

revai-node-sdk

Node.js SDK for the Rev AI API

Stars: ✭ 21 (-99.81%)

Mutual labels: speech-recognition, speech-to-text

SpeechToText

Speech To Text in Android

Stars: ✭ 53 (-99.52%)

Mutual labels: speech-recognition, speech-to-text

Speech Aligner

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Stars: ✭ 259 (-97.68%)

Mutual labels: speech, kaldi

Pocketsphinx Python

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Stars: ✭ 298 (-97.33%)

Mutual labels: speech-recognition, speech

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (-99.05%)

Mutual labels: speech-recognition, speech-to-text

speech-to-text-code-pattern

React app using the Watson Speech to Text service to transform voice audio into written text.

Stars: ✭ 37 (-99.67%)

Mutual labels: speech-recognition, speech-to-text

KaldiBasedSpeakerVerification

Kaldi based speaker verification

Stars: ✭ 43 (-99.61%)

Mutual labels: kaldi, speaker-verification

voce-browser

Voice Controlled Chromium Web Browser

Stars: ✭ 34 (-99.7%)

Mutual labels: speech-recognition, speech-to-text

kaldi-timit-sre-ivector

Develop speaker recognition model based on i-vector using TIMIT database

Stars: ✭ 17 (-99.85%)

Mutual labels: kaldi, speaker-verification

musicologist

Music advice from a conversational interface powered by Algolia

Stars: ✭ 19 (-99.83%)

Mutual labels: speech-recognition, speech-to-text

leon

🧠 Leon is your open-source personal assistant.

Stars: ✭ 8,560 (-23.24%)

Mutual labels: speech-recognition, speech-to-text

htk

HTK Toolkit with Linux 64 bit and Docker support

Stars: ✭ 14 (-99.87%)

Mutual labels: speech-recognition, speech-to-text

Julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Stars: ✭ 1,258 (-88.72%)

Mutual labels: speech-recognition, speech

Phonetisaurus

Phonetisaurus G2P

Stars: ✭ 277 (-97.52%)

Mutual labels: speech-recognition, speech-to-text

Vosk Android Demo

Offline speech recognition for Android with Vosk library.

Stars: ✭ 271 (-97.57%)

Mutual labels: speech-recognition, kaldi

Css10

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Stars: ✭ 302 (-97.29%)

Mutual labels: speech, speech-to-text

revai-java-sdk

Rev.ai Java SDK

Stars: ✭ 16 (-99.86%)

Mutual labels: speech-recognition, speech-to-text

Tensorflowasr

⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Stars: ✭ 400 (-96.41%)

Mutual labels: speech-recognition, speech-to-text

Neural sp

End-to-end ASR/LM implementation with PyTorch

Stars: ✭ 408 (-96.34%)

Mutual labels: speech-recognition, speech

Cheetah

On-device streaming speech-to-text engine powered by deep learning

Stars: ✭ 383 (-96.57%)

Mutual labels: speech-recognition, speech-to-text

Zamia Speech

Open tools and data for cloudless automatic speech recognition

Stars: ✭ 374 (-96.65%)

Mutual labels: speech-recognition, kaldi

Specaugment

A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain

Stars: ✭ 408 (-96.34%)

Mutual labels: speech-recognition, speech

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (-99.07%)

Mutual labels: speech-recognition, kaldi

B.e.n.j.i.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

Stars: ✭ 83 (-99.26%)

Mutual labels: speech-recognition, speech-to-text

Voice Overlay Ios

🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI

Stars: ✭ 440 (-96.05%)

Mutual labels: speech-recognition, speech-to-text

Espnet

End-to-End Speech Processing Toolkit

Stars: ✭ 4,533 (-59.35%)

Mutual labels: speech-recognition, kaldi

Rhino

On-device speech-to-intent engine powered by deep learning

Stars: ✭ 406 (-96.36%)

Mutual labels: speech-recognition, speech-to-text

Asrt speechrecognition

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

Stars: ✭ 4,943 (-55.67%)

Mutual labels: speech-recognition, speech-to-text

Holobot

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Stars: ✭ 114 (-98.98%)

Mutual labels: speech-recognition, speech

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (-99.07%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech Websocket Server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Stars: ✭ 79 (-99.29%)

Mutual labels: speech-recognition, speech-to-text

Speech To Text Benchmark

speech to text benchmark framework

Stars: ✭ 481 (-95.69%)

Mutual labels: speech-recognition, speech-to-text

Silero Models

Silero Models: pre-trained STT models and benchmarks made embarrassingly simple

Stars: ✭ 522 (-95.32%)

Mutual labels: speech-recognition, speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (-99.08%)

Mutual labels: speech-recognition, speech-to-text

Vad

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stars: ✭ 622 (-94.42%)

Mutual labels: speech-recognition, speech

Nodejs Speech

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

Stars: ✭ 545 (-95.11%)

Mutual labels: speech, speech-to-text

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (-99.09%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

Stars: ✭ 18,680 (+67.52%)

Mutual labels: speech-recognition, speech-to-text

Speech Demo

语音api示例

Stars: ✭ 454 (-95.93%)

Mutual labels: speech-recognition, speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (-87.64%)

Mutual labels: speech-recognition, speech-to-text

Wav2letter

Speech Recognition model based off of FAIR research paper built using Pytorch.

Stars: ✭ 78 (-99.3%)

Mutual labels: speech-recognition, speech-to-text

Adapt

Adapt Intent Parser

Stars: ✭ 690 (-93.81%)

Mutual labels: speech-recognition, speech-to-text

Stephanie Va

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Stars: ✭ 772 (-93.08%)

Mutual labels: speech-recognition, speech-to-text

Espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Stars: ✭ 808 (-92.75%)

Mutual labels: speech-recognition, kaldi

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (-90.93%)

Mutual labels: speech-recognition, speech-to-text

Dc tts

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Stars: ✭ 1,017 (-90.88%)

Mutual labels: speech, speech-to-text

Speech recognition

Speech recognition module for Python, supporting several engines and APIs, online and offline.

Stars: ✭ 5,999 (-46.2%)

Mutual labels: speech-recognition, speech-to-text

Kur

Descriptive Deep Learning

Stars: ✭ 811 (-92.73%)

Mutual labels: speech-recognition, speech-to-text

61-120 of 934 similar projects

‹

›

next*5