calvinCALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Stars: ✭ 105 (+156.1%)
iPerceiveApplying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
Stars: ✭ 52 (+26.83%)
pytorch violetA PyTorch implementation of VIOLET
Stars: ✭ 119 (+190.24%)
Arc Robot VisionMIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
Stars: ✭ 224 (+446.34%)
stereo.visionplanar fitting computation using stereo vision techniques
Stars: ✭ 19 (-53.66%)
pybvA lightweight I/O utility for the BrainVision data format, written in Python.
Stars: ✭ 18 (-56.1%)
Learnable-Image-ResizingTF 2 implementation Learning to Resize Images for Computer Vision Tasks (https://arxiv.org/abs/2103.09950v1).
Stars: ✭ 48 (+17.07%)
mediapipe plusThe purpose of this project is to apply mediapipe to more AI chips.
Stars: ✭ 38 (-7.32%)
DonkeycarOpen source hardware and software platform to build a small scale self driving car.
Stars: ✭ 2,192 (+5246.34%)
res-mlp-pytorchImplementation of ResMLP, an all MLP solution to image classification, in Pytorch
Stars: ✭ 178 (+334.15%)
OpenkaiOpenKAI: A modern framework for unmanned vehicle and robot control
Stars: ✭ 150 (+265.85%)
TokenLabelingPytorch implementation of "All Tokens Matter: Token Labeling for Training Better Vision Transformers"
Stars: ✭ 385 (+839.02%)
SAPC-APCAAPCA (Accessible Perceptual Contrast Algorithm) is a new method for predicting contrast for use in emerging web standards (WCAG 3) for determining readability contrast. APCA is derived form the SAPC (S-LUV Advanced Predictive Color) which is an accessibility-oriented color appearance model designed for self-illuminated displays.
Stars: ✭ 266 (+548.78%)
lang2segReferring Expression Object Segmentation with Caption-Aware Consistency, BMVC 2019
Stars: ✭ 30 (-26.83%)
TRAR-VQA[ICCV 2021] TRAR: Routing the Attention Spans in Transformers for Visual Question Answering -- Official Implementation
Stars: ✭ 49 (+19.51%)
Grocery-Product-DetectionThis repository builds a product detection model to recognize products from grocery shelf images.
Stars: ✭ 73 (+78.05%)
wikiHow paper listA paper list of research conducted based on wikiHow
Stars: ✭ 25 (-39.02%)
React Native Text DetectorText Detector from image for react native using firebase MLKit on android and Tesseract on iOS
Stars: ✭ 194 (+373.17%)
face age genderCan we predict the age and gender of someone given a picture of their face ?
Stars: ✭ 40 (-2.44%)
Apriltag rosA ROS wrapper of the AprilTag 3 visual fiducial detector
Stars: ✭ 160 (+290.24%)
Robotcar Dataset SdkSoftware Development Kit for the Oxford Robotcar Dataset
Stars: ✭ 151 (+268.29%)
CNN-GoogLeNet👁 Vision : Model 4: GoogLeNet : Image Classification
Stars: ✭ 17 (-58.54%)
RavensTrain robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Stars: ✭ 133 (+224.39%)
monodepthPython ROS depth estimation from RGB image based on code from the paper "High Quality Monocular Depth Estimation via Transfer Learning"
Stars: ✭ 41 (+0%)
DonkeyDriftOpen-source self-driving car based on DonkeyCar and programmable chassis
Stars: ✭ 15 (-63.41%)
sam-textvqaOfficial code for paper "Spatially Aware Multimodal Transformers for TextVQA" published at ECCV, 2020.
Stars: ✭ 51 (+24.39%)
CarLens-iOSCarLens - Recognize and Collect Cars
Stars: ✭ 124 (+202.44%)
CBPOfficial Tensorflow Implementation of the AAAI-2020 paper "Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction"
Stars: ✭ 52 (+26.83%)
frc-score-detectionA program to detect FRC match scores from their livestream.
Stars: ✭ 15 (-63.41%)
nested-transformerNested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf
Stars: ✭ 174 (+324.39%)
FaceDataA macOS app to parse face landmarks from a video for GANs training
Stars: ✭ 71 (+73.17%)
Opencv📷 Computer-Vision Demos
Stars: ✭ 244 (+495.12%)
X-VLMX-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
Stars: ✭ 283 (+590.24%)
Cs231a NotesThe course notes for Stanford's CS231A course on computer vision
Stars: ✭ 230 (+460.98%)
flutter-visioniOS and Android app built with Flutter and Firebase. Includes Firebase ML Vision, Firestore, and Storage
Stars: ✭ 45 (+9.76%)
ReferFormer[CVPR2022] Official Implementation of ReferFormer
Stars: ✭ 230 (+460.98%)
Opticalflow visualizationPython optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
Stars: ✭ 183 (+346.34%)
mlp-mixer-pytorchAn All-MLP solution for Vision, from Google AI
Stars: ✭ 771 (+1780.49%)
photonvisionPhotonVision is the free, fast, and easy-to-use computer vision solution for the FIRST Robotics Competition.
Stars: ✭ 115 (+180.49%)
ArucogenOnline ArUco markers generator
Stars: ✭ 155 (+278.05%)
non-contact-sleep-apnea-detectionGihan Jayatilaka, Harshana Weligampola, Suren Sritharan, Pankayaraj Pathmanathan, Roshan Ragel and Isuru Nawinne, "Non-contact Infant Sleep Apnea Detection," 2019 14th Conference on Industrial and Information Systems (ICIIS), Kandy, Sri Lanka, 2019, pp. 260-265, doi: 10.1109/ICIIS47346.2019.9063269.
Stars: ✭ 15 (-63.41%)
NextlevelNextLevel was initally a weekend project that has now grown into a open community of camera platform enthusists. The software provides foundational components for managing media recording, camera interface customization, gestural interaction customization, and image streaming on iOS. The same capabilities can also be found in apps such as Snapchat, Instagram, and Vine.
Stars: ✭ 1,940 (+4631.71%)
CustomVisionMicrosoftToCoreMLDemoAppThis app recognises 3 hand signs - fist, high five and victory hand [ rock, paper, scissors basically :) ] with live feed camera. It uses a HandSigns.mlmodel which has been trained using Custom Vision from Microsoft.
Stars: ✭ 25 (-39.02%)
FlowizConverts Optical Flow files to images and optionally compiles them to a video. Flow viewer GUI is also available. Check out mockup right from Github Pages:
Stars: ✭ 144 (+251.22%)
Cocoaai🤖 The Cocoa Artificial Intelligence Lab
Stars: ✭ 134 (+226.83%)
handbookWe're a small high-trust livelihood pod doing tech consulting within Enspiral.
Stars: ✭ 35 (-14.63%)
VisionComputer Vision And Neural Network with Xamarin
Stars: ✭ 54 (+31.71%)
MTL-AQAWhat and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment [CVPR 2019]
Stars: ✭ 38 (-7.32%)
SentimentVisionDemo🌅 iOS11 demo application for visual sentiment prediction.
Stars: ✭ 34 (-17.07%)