All Projects → Self Supervised Speech Recognition → Similar Projects or Alternatives

723 Open source projects that are alternatives of or similar to Self Supervised Speech Recognition

Vosk
VOSK Speech Recognition Toolkit
Stars: ✭ 182 (+71.7%)
Tensorflowasr
⚡️ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
Stars: ✭ 400 (+277.36%)
sova-asr
SOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+16.04%)
Vosk Api
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Stars: ✭ 1,357 (+1180.19%)
Mongolian Speech Recognition
Mongolian speech recognition with PyTorch
Stars: ✭ 97 (-8.49%)
leon
🧠 Leon is your open-source personal assistant.
Stars: ✭ 8,560 (+7975.47%)
Openseq2seq
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
Stars: ✭ 1,378 (+1200%)
Deepspeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Stars: ✭ 18,680 (+17522.64%)
Stephanie Va
Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.
Stars: ✭ 772 (+628.3%)
Alibi Detect
Algorithms for outlier and adversarial instance detection, concept drift and metrics.
Stars: ✭ 604 (+469.81%)
Wav2letter.pytorch
A fully convolution-network for speech-to-text, built on pytorch.
Stars: ✭ 104 (-1.89%)
catgan pytorch
Unsupervised and Semi-supervised Learning with Categorical Generative Adversarial Networks
Stars: ✭ 50 (-52.83%)
Speech And Text
Speech to text (PocketSphinx, Iflytex API, Baidu API) and text to speech (pyttsx3) | 语音转文字(PocketSphinx、百度 API、科大讯飞 API)和文字转语音(pyttsx3)
Stars: ✭ 102 (-3.77%)
htk
HTK Toolkit with Linux 64 bit and Docker support
Stars: ✭ 14 (-86.79%)
speech-recognition
SDKs and docs for Skit's speech to text service
Stars: ✭ 20 (-81.13%)
Phonetisaurus
Phonetisaurus G2P
Stars: ✭ 277 (+161.32%)
Tensorflow end2end speech recognition
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
Stars: ✭ 305 (+187.74%)
Wav2letter
Speech Recognition model based off of FAIR research paper built using Pytorch.
Stars: ✭ 78 (-26.42%)
kim-voice-assistant
Kim,你的私人语音助理。
Stars: ✭ 70 (-33.96%)
Silero Models
Silero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+392.45%)
Sonus
💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+401.89%)
Eesen
The official repository of the Eesen project
Stars: ✭ 738 (+596.23%)
Speech recognition
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Stars: ✭ 5,999 (+5559.43%)
Audio Pretrained Model
A collection of Audio and Speech pre-trained models.
Stars: ✭ 61 (-42.45%)
Syn Speech
Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework
Stars: ✭ 57 (-46.23%)
Patter
speech-to-text in pytorch
Stars: ✭ 71 (-33.02%)
voce-browser
Voice Controlled Chromium Web Browser
Stars: ✭ 34 (-67.92%)
deepspeech
A PyTorch implementation of DeepSpeech and DeepSpeech2.
Stars: ✭ 45 (-57.55%)
B.e.n.j.i.
B.E.N.J.I.- The Impossible Missions Force's digital assistant
Stars: ✭ 83 (-21.7%)
speech to text
how to use the Google Cloud Speech API to transcribe audio/video files.
Stars: ✭ 35 (-66.98%)
SpeechToText
Speech To Text in Android
Stars: ✭ 53 (-50%)
speech-to-text
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
Stars: ✭ 61 (-42.45%)
musicologist
Music advice from a conversational interface powered by Algolia
Stars: ✭ 19 (-82.08%)
spear
SPEAR: Programmatically label and build training data quickly.
Stars: ✭ 81 (-23.58%)
Deepspeech Websocket Server
Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments
Stars: ✭ 79 (-25.47%)
L2c
Learning to Cluster. A deep clustering strategy.
Stars: ✭ 262 (+147.17%)
Deepspeech
A PaddlePaddle implementation of ASR.
Stars: ✭ 1,219 (+1050%)
demo vietasr
Vietnamese Speech Recognition
Stars: ✭ 22 (-79.25%)
Awesome Kaldi
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
Stars: ✭ 393 (+270.75%)
Cheetah
On-device streaming speech-to-text engine powered by deep learning
Stars: ✭ 383 (+261.32%)
Rhino
On-device speech-to-intent engine powered by deep learning
Stars: ✭ 406 (+283.02%)
vosk-asterisk
Speech Recognition in Asterisk with Vosk Server
Stars: ✭ 52 (-50.94%)
Java Speech Api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Stars: ✭ 490 (+362.26%)
Speech To Text Benchmark
speech to text benchmark framework
Stars: ✭ 481 (+353.77%)
Athena
an open-source implementation of sequence-to-sequence based speech processing engine
Stars: ✭ 542 (+411.32%)
Speech Demo
语音api示例
Stars: ✭ 454 (+328.3%)
Annyang
💬 Speech recognition for your site
Stars: ✭ 6,216 (+5764.15%)
Adapt
Adapt Intent Parser
Stars: ✭ 690 (+550.94%)
Kur
Descriptive Deep Learning
Stars: ✭ 811 (+665.09%)
Voice Overlay Ios
🗣 An overlay that gets your user’s voice permission and input as text in a customizable UI
Stars: ✭ 440 (+315.09%)
Nativescript Speech Recognition
💬 Speech to text, using the awesome engines readily available on the device.
Stars: ✭ 72 (-32.08%)
Artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
Stars: ✭ 1,011 (+853.77%)
Angle
⦠ Angle: new speakable syntax for python 💡
Stars: ✭ 61 (-42.45%)
Susi
SuSi: Python package for unsupervised, supervised and semi-supervised self-organizing maps (SOM)
Stars: ✭ 42 (-60.38%)
Openasr
A pytorch based end2end speech recognition system.
Stars: ✭ 69 (-34.91%)
Deep-learning-And-Paper
【仅作为交流学习使用】机器智能--相关书目及经典论文包括AutoML、情感分类、语音识别、声纹识别、语音合成实验代码等
Stars: ✭ 62 (-41.51%)
speech-to-text-code-pattern
React app using the Watson Speech to Text service to transform voice audio into written text.
Stars: ✭ 37 (-65.09%)
Asrt speechrecognition
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Stars: ✭ 4,943 (+4563.21%)
Discordspeechbot
A speech-to-text bot for discord with music commands and more using NodeJS. Ideally for controlling your Discord server using voice commands, can also be useful for hearing-impaired people.
Stars: ✭ 35 (-66.98%)
Dragonfire
the open-source virtual assistant for Ubuntu based Linux distributions
Stars: ✭ 1,120 (+956.6%)
1-60 of 723 similar projects