Cs231a NotesThe course notes for Stanford's CS231A course on computer vision
Arc Robot VisionMIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
Opticalflow visualizationPython optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
DonkeycarOpen source hardware and software platform to build a small scale self driving car.
Apriltag rosA ROS wrapper of the AprilTag 3 visual fiducial detector
OpenkaiOpenKAI: A modern framework for unmanned vehicle and robot control
NextlevelNextLevel was initally a weekend project that has now grown into a open community of camera platform enthusists. The software provides foundational components for managing media recording, camera interface customization, gestural interaction customization, and image streaming on iOS. The same capabilities can also be found in apps such as Snapchat, Instagram, and Vine.
FlowizConverts Optical Flow files to images and optionally compiles them to a video. Flow viewer GUI is also available. Check out mockup right from Github Pages:
RavensTrain robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Cocoaai🤖 The Cocoa Artificial Intelligence Lab
Knn MattingSource Code for KNN Matting, CVPR 2012 / TPAMI 2013. MATLAB code ready to run. Simple and robust implementation under 40 lines.
Nvidia Gpu Tensor Core Accelerator Pytorch OpencvA complete machine vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA-X, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), PyTorch, TF2, Tensorboard, and OpenCV for accelerated workloads on NVIDIA Tensor cores and GPUs.
Android OcrsampleAndroid OCR example application which uses Google Text Recognition API
Ios11 Qr Code ExampleExample showing how to use the QR-code detection API (VNDetectBarcodesRequest) in iOS 11.
CodeslamImplementation of CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM paper (https://arxiv.org/pdf/1804.00874.pdf)
EskfROS Error-State Kalman Filter based on PX4/ecl. Performs GPS/Magnetometer/Vision Pose/Optical Flow/RangeFinder fusion with IMU
EyevisAndroid based Vocal Vision for Visually Impaired. Object Detection, Voice Assistance, Optical Character Reader, Read Aloud, Face Recognition, Landmark Recognition, Image Labelling etc.
FacevisioniOS11 Vision framework example. Detection of face landmarks
CaffeCaffe: a fast open framework for deep learning.
ObjectclassifierAn iOS swift app that detects objects using machine learning (CoreML, Vision)
EvilOptical Character Recognition in Swift for iOS&macOS. 银行卡、身份证、门牌号光学识别
DeepdriveDeepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
3dmatch Toolbox3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.
CudasiftA CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
PaddlehubAwesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)
Visual Pushing GraspingTrain robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
IowncodeA curated collection of iOS, ML, AR resources sprinkled with some UI additions
Facecropper✂️ Crop faces, inside of your image, with iOS 11 Vision api.
RewritingRewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
CaerHigh-performance Vision library in Python. Scale your research, not boilerplate.
MyvisionComputer vision based ML training data generation tool 🚀
Tsdf FusionFuse multiple depth frames into a TSDF voxel volume.
Nodejs VisionNode.js client for Google Cloud Vision: Derive insight from images.