All Projects → Wav2letter.pytorch → Similar Projects or Alternatives

802 Open source projects that are alternatives of or similar to Wav2letter.pytorch

Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (-6.73%)
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-25%)
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+1225%)
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+323.08%)
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+1204.81%)
deepspeech
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-56.73%)
htk
HTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-86.54%)
Cheetah
On-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+268.27%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+277.88%)
Sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
Stars: ✭ 764 (+634.62%)
Keras Sincnet
Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)
Stars: ✭ 47 (-54.81%)
Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (-41.35%)
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-50%)
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-66.35%)
SpeechToText
Speech To Text in Android
Stars: ✭ 53 (-49.04%)
voce-browser
Voice Controlled Chromium Web Browser
Stars: ✭ 34 (-67.31%)
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (+166.35%)
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (+193.27%)
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+4652.88%)
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-81.73%)
Adapt
Adapt Intent Parser
Stars: ✭ 690 (+563.46%)
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+5876.92%)
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+872.12%)
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+642.31%)
Patter
speech-to-text in pytorch
Stars: ✭ 71 (-31.73%)
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Stars: ✭ 72 (-30.77%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1072.12%)
speech-to-text-code-pattern
React app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-64.42%)
Deep-learning-And-Paper
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等
Stars: ✭ 62 (-40.38%)
revai-node-sdk
Node.js SDK for the Rev AI API
Stars: ✭ 21 (-79.81%)
speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-41.35%)
speech-recognition
SDKs and docs for Skit's speech to text service
Stars: ✭ 20 (-80.77%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+8130.77%)
deepspeech.mxnet
A MXNet implementation of Baidu's DeepSpeech architecture
Stars: ✭ 82 (-21.15%)
demo vietasr
Vietnamese Speech Recognition
Stars: ✭ 22 (-78.85%)
kim-voice-assistant
Kim,你的私人语音助理。
Stars: ✭ 70 (-32.69%)
Deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+17861.54%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+18.27%)
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+290.38%)
Tensorflowasr
⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+284.62%)
Speech Demo
语音api示例
Stars: ✭ 454 (+336.54%)
deep avsr
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
Stars: ✭ 104 (+0%)
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+5668.27%)
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+411.54%)
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+609.62%)
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+401.92%)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-66.35%)
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+679.81%)
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-45.19%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+371.15%)
B.e.n.j.i.
B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-20.19%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-33.65%)
Spokestack Python
Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application.
Stars: ✭ 103 (-0.96%)
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+976.92%)
kaldi-long-audio-alignment
Long audio alignment using Kaldi
Stars: ✭ 21 (-79.81%)
spokestack-ios
Spokestack: give your iOS app a voice interface!
Stars: ✭ 27 (-74.04%)
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+362.5%)
Angle
⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-41.35%)
Deepspeech Websocket Server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-24.04%)
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-1.92%)
1-60 of 802 similar projects