All Projects → Tacotron_asr → Similar Projects or Alternatives

518 Open source projects that are alternatives of or similar to Tacotron_asr

kaldi ag training
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-91.52%)
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-65.45%)
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+3667.27%)
KeenASR-Android-PoC
A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html
Stars: ✭ 21 (-87.27%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-58.18%)
simple-obs-stt
Speech-to-text and keyboard input captions for OBS.
Stars: ✭ 89 (-46.06%)
Kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
Stars: ✭ 11,151 (+6658.18%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+196.97%)
ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 179 (+8.48%)
Lingvo
Lingvo
Stars: ✭ 2,361 (+1330.91%)
wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
Stars: ✭ 205 (+24.24%)
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-50.3%)
anycontrol
Voice control for your websites and applications
Stars: ✭ 53 (-67.88%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (-25.45%)
Edgedict
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
Stars: ✭ 205 (+24.24%)
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+222.42%)
Asr audio data links
A list of publically available audio data that anyone can download for ASR or other speech activities
Stars: ✭ 128 (-22.42%)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-78.79%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+638.79%)
Speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Stars: ✭ 242 (+46.67%)
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-78.79%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+138.18%)
Tacotron
Audio samples accompanying publications related to Tacotron, an end-to-end speech synthesis model.
Stars: ✭ 493 (+198.79%)
Mutual labels:  speech, tacotron
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+216.36%)
Zzz Retired openstt
RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (-11.52%)
Speechrecognizerbutton
UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.
Stars: ✭ 144 (-12.73%)
Nodejs Speech
Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.
Stars: ✭ 545 (+230.3%)
Mutual labels:  speech, speech-to-text
Speech Emotion Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
Stars: ✭ 633 (+283.64%)
Mutual labels:  speech-recognition, speech
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+3535.76%)
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+191.52%)
Tensorflow Ctc Speech Recognition
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
Stars: ✭ 127 (-23.03%)
Vad
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
Stars: ✭ 622 (+276.97%)
Mutual labels:  speech-recognition, speech
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+367.88%)
Speech Demo
语音api示例
Stars: ✭ 454 (+175.15%)
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+358.18%)
Mutual labels:  speech-recognition, speech
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+347.27%)
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+391.52%)
Pytorch Asr
ASR with PyTorch
Stars: ✭ 124 (-24.85%)
Mutual labels:  speech-recognition, speech
Dc tts
A TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Stars: ✭ 1,017 (+516.36%)
Mutual labels:  speech, speech-to-text
Soloud
Free, easy, portable audio engine for games
Stars: ✭ 1,048 (+535.15%)
Mutual labels:  speech, speech-to-text
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+166.67%)
Angle
⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-63.03%)
Adapt
Adapt Intent Parser
Stars: ✭ 690 (+318.18%)
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+512.73%)
Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (-63.03%)
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+578.79%)
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Stars: ✭ 72 (-56.36%)
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-52.73%)
B.e.n.j.i.
B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-49.7%)
Patter
speech-to-text in pytorch
Stars: ✭ 71 (-56.97%)
Deepspeech Websocket Server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-52.12%)
Go Astideepspeech
Golang bindings for Mozilla's DeepSpeech speech-to-text library
Stars: ✭ 137 (-16.97%)
Holobot
HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.
Stars: ✭ 114 (-30.91%)
Mutual labels:  speech-recognition, speech
Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (-41.21%)
Audiomate
Python library for handling audio datasets.
Stars: ✭ 99 (-40%)
Mutual labels:  speech-recognition, speech
Speech To Text Russian
Проект для распознавания речи на русском языке на основе pykaldi.
Stars: ✭ 151 (-8.48%)
Julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
Stars: ✭ 1,258 (+662.42%)
Mutual labels:  speech-recognition, speech
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+722.42%)
Allosaurus
Allosaurus is a pretrained universal phone recognizer for more than 2000 languages
Stars: ✭ 135 (-18.18%)
Mutual labels:  speech-recognition, speech
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-19.39%)
1-60 of 518 similar projects