Vosk ServerWebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
Stars: ✭ 277 (+385.96%)
Kaldi Gstreamer ServerReal-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
Stars: ✭ 935 (+1540.35%)
rustfstRust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Stars: ✭ 104 (+82.46%)
League Of Legends BotLeague of legends bot is a pixel bot for League Of Legends 10.19, written in C# .NET using image processing , and dependency injection (Pattern Scripting)
Stars: ✭ 275 (+382.46%)
DetectionMetricsTool to evaluate deep-learning detection and segmentation models, and to create datasets
Stars: ✭ 66 (+15.79%)
Meal V2MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
Stars: ✭ 534 (+836.84%)
Alan Sdk CordovaAlan AI Cordova SDK adds a voice assistant or chatbot to your app.
Stars: ✭ 269 (+371.93%)
kaldi ag trainingDocker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-grammar.
Stars: ✭ 14 (-75.44%)
Iflytek awaken asruse iflytek's technology to realize awaken and order recognition
Stars: ✭ 53 (-7.02%)
InimesedAn Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.
Stars: ✭ 65 (+14.04%)
PocketsphinxPocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
Stars: ✭ 2,934 (+5047.37%)
ml-with-audioHF's ML for Audio study group
Stars: ✭ 104 (+82.46%)
Lemniscate.pytorchUnsupervised Feature Learning via Non-parametric Instance Discrimination
Stars: ✭ 532 (+833.33%)
DeepSpeech-APIThe code enables users to use Mozilla's Deep Speech model over the Web Browser.
Stars: ✭ 31 (-45.61%)
HotVoiceAdds Speech Recognition support to AutoHotkey, via a C# DLL
Stars: ✭ 41 (-28.07%)
ConvNetA convolutional neural network for images recognition
Stars: ✭ 23 (-59.65%)
apiSpeechly public API definitions and generated code
Stars: ✭ 15 (-73.68%)
rnnt decoder cudaAn efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
Stars: ✭ 60 (+5.26%)
Sonus💬 /so.nus/ STT (speech to text) for Node with offline hotword detection
Stars: ✭ 532 (+833.33%)
object-size-detector-pythonMonitor mechanical bolts as they move down a conveyor belt. When a bolt of an irregular size is detected, this solution emits an alert.
Stars: ✭ 26 (-54.39%)
use-smartcropReact hook for smartcrop.js to content aware image cropping with points of interest and facial recognition.
Stars: ✭ 93 (+63.16%)
Android-TTS-STTOne line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
Stars: ✭ 77 (+35.09%)
RapiddrawA simple artificial intelligence experiment to find out if mobile neural networks can recognize human-made doodles
Stars: ✭ 39 (-31.58%)
demo vietasrVietnamese Speech Recognition
Stars: ✭ 22 (-61.4%)
CtcdecoderConnectionist Temporal Classification (CTC) decoding algorithms: best path, prefix search, beam search and token passing. Implemented in Python.
Stars: ✭ 529 (+828.07%)
ocrSimple app to extract text from pictures using Tesseract
Stars: ✭ 98 (+71.93%)
web-speech-cognitive-servicesPolyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Stars: ✭ 35 (-38.6%)
Mini ImagenetGenerate mini-ImageNet with ImageNet for fewshot learning
Stars: ✭ 22 (-61.4%)
sova-asrSOVA ASR (Automatic Speech Recognition)
Stars: ✭ 123 (+115.79%)
CarLens-iOSCarLens - Recognize and Collect Cars
Stars: ✭ 124 (+117.54%)
Silero ModelsSilero Models: pre-trained STT models and benchmarks made embarrassingly simple
Stars: ✭ 522 (+815.79%)
SAN[ECCV 2020] Scale Adaptive Network: Learning to Learn Parameterized Classification Networks for Scalable Input Images
Stars: ✭ 41 (-28.07%)
StageMateStageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.
Stars: ✭ 60 (+5.26%)
Classify-Real-Time-DesktopInception model used to classify camera feed on real time. Coded during the Deep Learning Hackathon 2017 San Francisco
Stars: ✭ 44 (-22.81%)
Parrots Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine for Chinese.
Stars: ✭ 48 (-15.79%)
rps-cvA Rock-Paper-Scissors game using computer vision and machine learning on Raspberry Pi
Stars: ✭ 102 (+78.95%)
Tensorflow object tracking videoObject Tracking in Tensorflow ( Localization Detection Classification ) developed to partecipate to ImageNET VID competition
Stars: ✭ 491 (+761.4%)
ghostnet.pytorch73.6% GhostNet 1.0x pre-trained model on ImageNet
Stars: ✭ 90 (+57.89%)
Deep-LearningIt contains the coursework and the practice I have done while learning Deep Learning.🚀 👨💻💥 🚩🌈
Stars: ✭ 21 (-63.16%)
Wavenet SttAn end-to-end speech recognition system with Wavenet. Built using C++ and python.
Stars: ✭ 18 (-68.42%)
ocr recognitionuse java opencv tesseract ocr image words detects and recognition,use python generate jTessBoxEditor train box file
Stars: ✭ 52 (-8.77%)
concurrent-video-analytic-pipeline-optimization-sample-lCreate a concurrent video analysis pipeline featuring multistream face and human pose detection, vehicle attribute detection, and the ability to encode multiple videos to local storage in a single stream.
Stars: ✭ 39 (-31.58%)
Tensorflow-Keyword-SpottingKeyword spotting using various architecture like convolutional vggnet , 1D convolutional network and CTC.
Stars: ✭ 27 (-52.63%)
download audioset📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).
Stars: ✭ 53 (-7.02%)
ImagenetTrial on kaggle imagenet object localization by yolo v3 in google cloud
Stars: ✭ 56 (-1.75%)
Image recognitionPackages for image recognition - Robocup TU/e Robotics
Stars: ✭ 53 (-7.02%)
Formant AnalyzeriOS application for finding formants in spoken sounds
Stars: ✭ 43 (-24.56%)