All Projects → kaldi-long-audio-alignment → Similar Projects or Alternatives

438 Open source projects that are alternatives of or similar to kaldi-long-audio-alignment

Angle

⦠ Angle: new speakable syntax for python 💡

Stars: ✭ 61 (+190.48%)

Mutual labels: speech-recognition, speech-to-text

syn-speech-samples

An application that demostrate the usage of Syn.Speech library for Speech Recognition

Stars: ✭ 24 (+14.29%)

Mutual labels: speech-recognition, asr

Discordspeechbot

A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.

Stars: ✭ 35 (+66.67%)

Mutual labels: speech-recognition, speech-to-text

2018-dlsl

UPC Deep Learning for Speech and Language 2018

Stars: ✭ 18 (-14.29%)

Mutual labels: speech-recognition, automatic-speech-recognition

Asr benchmark

Program to benchmark various speech recognition APIs

Stars: ✭ 71 (+238.1%)

Mutual labels: speech-recognition, asr

rnnt decoder cuda

An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.

Stars: ✭ 60 (+185.71%)

Mutual labels: speech-recognition, speech-to-text

Artyom.js

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Stars: ✭ 1,011 (+4714.29%)

Mutual labels: speech-recognition, speech-to-text

Unity live caption

Use Google Speech-to-Text API to do real-time live stream caption on Unity! Best when combined with your virtual character!

Stars: ✭ 26 (+23.81%)

Mutual labels: speech-recognition, speech-to-text

Nativescript Speech Recognition

💬 Speech to text, using the awesome engines readily available on the device.

Stars: ✭ 72 (+242.86%)

Mutual labels: speech-recognition, speech-to-text

speechrec

a simple speech recognition app using the Web Speech API Interfaces

Stars: ✭ 18 (-14.29%)

Mutual labels: speech-recognition, speech-to-text

Keras Sincnet

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Stars: ✭ 47 (+123.81%)

Mutual labels: speech-recognition, asr

Patter

speech-to-text in pytorch

Stars: ✭ 71 (+238.1%)

Mutual labels: speech-recognition, speech-to-text

Annyang

💬 Speech recognition for your site

Stars: ✭ 6,216 (+29500%)

Mutual labels: speech-recognition, speech-to-text

B.e.n.j.i.

B.E.N.J.I.- The Impossible Missions Force's digital assistant

Stars: ✭ 83 (+295.24%)

Mutual labels: speech-recognition, speech-to-text

Ktspeechcrawler

Automatically constructing corpus for automatic speech recognition from YouTube videos

Stars: ✭ 92 (+338.1%)

Mutual labels: speech-recognition, asr

Deepspeech Websocket Server

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Stars: ✭ 79 (+276.19%)

Mutual labels: speech-recognition, speech-to-text

Factorized Tdnn

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Stars: ✭ 98 (+366.67%)

Mutual labels: speech-recognition, kaldi

Speech And Text

Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字（PocketSphinx、百度 API、科大讯飞 API）和文字转语音（pyttsx3）

Stars: ✭ 102 (+385.71%)

Mutual labels: speech-recognition, speech-to-text

Deepspeech

A PaddlePaddle implementation of ASR.

Stars: ✭ 1,219 (+5704.76%)

Mutual labels: speech-recognition, speech-to-text

Openseq2seq

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Stars: ✭ 1,378 (+6461.9%)

Mutual labels: speech-recognition, speech-to-text

hf-experiments

Experiments with Hugging Face 🔬 🤗

Stars: ✭ 37 (+76.19%)

Mutual labels: speech-recognition, automatic-speech-recognition

Delta

DELTA is a deep learning based natural language and speech processing platform.

Stars: ✭ 1,479 (+6942.86%)

Mutual labels: speech-recognition, asr

Kaldi Gop

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Stars: ✭ 104 (+395.24%)

Mutual labels: speech-recognition, kaldi

Wav2letter.pytorch

A fully convolution-network for speech-to-text, built on pytorch.

Stars: ✭ 104 (+395.24%)

Mutual labels: speech-recognition, speech-to-text

revai-python-sdk

Rev AI Python SDK

Stars: ✭ 35 (+66.67%)

Mutual labels: speech-recognition, speech-to-text

Self Supervised Speech Recognition

speech to text with self-supervised learning based on wav2vec 2.0 framework

Stars: ✭ 106 (+404.76%)

Mutual labels: speech-recognition, speech-to-text

obvi

A Polymer 3+ webcomponent / button for doing speech recognition

Stars: ✭ 54 (+157.14%)

Mutual labels: speech-recognition, automatic-speech-recognition

anycontrol

Voice control for your websites and applications

Stars: ✭ 53 (+152.38%)

Mutual labels: speech-recognition, speech-to-text

E2e Asr

PyTorch Implementations for End-to-End Automatic Speech Recognition

Stars: ✭ 106 (+404.76%)

Mutual labels: speech-recognition, asr

web-speech-cognitive-services

Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.

Stars: ✭ 35 (+66.67%)

Mutual labels: speech-recognition, speech-to-text

Adapt

Adapt Intent Parser

Stars: ✭ 690 (+3185.71%)

Mutual labels: speech-recognition, speech-to-text

Spokestack Python

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.

Stars: ✭ 103 (+390.48%)

Mutual labels: speech-recognition, speech-to-text

Bigcidian

Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.

Stars: ✭ 99 (+371.43%)

Mutual labels: speech-recognition, asr

Deepspeechrecognition

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Stars: ✭ 1,421 (+6666.67%)

Mutual labels: speech-recognition, asr

Kalliope

Kalliope is a framework that will help you to create your own personal assistant.

Stars: ✭ 1,509 (+7085.71%)

Mutual labels: speech-recognition, speech-to-text

UHV-OTS-Speech

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Stars: ✭ 94 (+347.62%)

Mutual labels: speech-recognition, speech-transcription

Awesome Ai Services

An overview of the AI-as-a-service landscape

Stars: ✭ 133 (+533.33%)

Mutual labels: speech-recognition, speech-to-text

Speechrecognizerbutton

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Stars: ✭ 144 (+585.71%)

Mutual labels: speech-recognition, speech-to-text

Tensorflow Ctc Speech Recognition

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Stars: ✭ 127 (+504.76%)

Mutual labels: speech-recognition, speech-to-text

Go Astideepspeech

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Stars: ✭ 137 (+552.38%)

Mutual labels: speech-recognition, speech-to-text

Automatic speech recognition

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Stars: ✭ 2,751 (+13000%)

Mutual labels: speech-recognition, automatic-speech-recognition

Speech recognition with tensorflow

Implementation of a seq2seq model for Speech Recognition using the latest version of TensorFlow. Architecture similar to Listen, Attend and Spell.

Stars: ✭ 253 (+1104.76%)

Mutual labels: speech-recognition, speech-to-text

ctc-asr

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Stars: ✭ 112 (+433.33%)

Mutual labels: speech-recognition, asr

Cn2an

📦 快速转化「中文数字」和「阿拉伯数字」～ (最新特性：分数，日期、温度等转化）

Stars: ✭ 249 (+1085.71%)

Mutual labels: speech-recognition, asr

wave2vec-recognize-docker

Wave2vec 2.0 Recognize pipeline

Stars: ✭ 30 (+42.86%)

Mutual labels: automatic-speech-recognition, asr

kospeech

Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.

Stars: ✭ 456 (+2071.43%)

Mutual labels: speech-recognition, asr

Speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

Stars: ✭ 242 (+1052.38%)

Mutual labels: speech-recognition, speech-to-text

Chinese text normalization

Chinese text normalization for speech processing

Stars: ✭ 242 (+1052.38%)

Mutual labels: speech-recognition, asr

Nemo

NeMo: a toolkit for conversational AI

Stars: ✭ 3,685 (+17447.62%)

Mutual labels: speech-recognition, speech-to-text

Rnn ctc

Recurrent Neural Network and Long Short Term Memory (LSTM) with Connectionist Temporal Classification implemented in Theano. Includes a Toy training example.

Stars: ✭ 220 (+947.62%)

Mutual labels: speech-recognition, speech-to-text

Hey Jetson

Deep Learning based Automatic Speech Recognition with attention for the Nvidia Jetson.

Stars: ✭ 161 (+666.67%)

Mutual labels: speech-recognition, speech-to-text

Zzz Retired openstt

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

Stars: ✭ 146 (+595.24%)

Mutual labels: speech-recognition, speech-to-text

Kaldiio

A pure python module for reading and writing kaldi ark files

Stars: ✭ 160 (+661.9%)

Mutual labels: speech-recognition, kaldi

K6nele

An Android app that offers speech-to-text services and user interfaces to other apps

Stars: ✭ 196 (+833.33%)

Mutual labels: speech-recognition, speech-to-text

Dictate.js

A small Javascript library for browser-based real-time speech recognition, which uses Recorderjs for audio capture, and a WebSocket connection to the Kaldi GStreamer server for speech recognition.

Stars: ✭ 195 (+828.57%)

Mutual labels: speech-recognition, speech-to-text

simple-obs-stt

Speech-to-text and keyboard input captions for OBS.

Stars: ✭ 89 (+323.81%)

Mutual labels: speech-recognition, speech-to-text

Automatic Speech Recognition

🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)

Stars: ✭ 192 (+814.29%)

Mutual labels: speech-recognition, speech-to-text

Naomi

The Naomi Project is an open source, technology agnostic platform for developing always-on, voice-controlled applications!