Top 138 vision open source projects

Amazing Arkit
ARKit相关资源汇总 群:326705018
Cs231a Notes
The course notes for Stanford's CS231A course on computer vision
Arc Robot Vision
MIT-Princeton Vision Toolbox for Robotic Pick-and-Place at the Amazon Robotics Challenge 2017 - Robotic Grasping and One-shot Recognition of Novel Objects with Deep Learning.
React Native Text Detector
Text Detector from image for react native using firebase MLKit on android and Tesseract on iOS
Opticalflow visualization
Python optical flow visualization following Baker et al. (ICCV 2007) as used by the MPI-Sintel challenge
Donkeycar
Open source hardware and software platform to build a small scale self driving car.
Apriltag ros
A ROS wrapper of the AprilTag 3 visual fiducial detector
Arucogen
Online ArUco markers generator
Openkai
OpenKAI: A modern framework for unmanned vehicle and robot control
Nextlevel
NextLevel was initally a weekend project that has now grown into a open community of camera platform enthusists. The software provides foundational components for managing media recording, camera interface customization, gestural interaction customization, and image streaming on iOS. The same capabilities can also be found in apps such as Snapchat, Instagram, and Vine.
Robotcar Dataset Sdk
Software Development Kit for the Oxford Robotcar Dataset
Flowiz
Converts Optical Flow files to images and optionally compiles them to a video. Flow viewer GUI is also available. Check out mockup right from Github Pages:
Ravens
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Knn Matting
Source Code for KNN Matting, CVPR 2012 / TPAMI 2013. MATLAB code ready to run. Simple and robust implementation under 40 lines.
Facelandmarksdetection
Finds facial features such as face contour, eyes, mouth and nose in an image.
Nvidia Gpu Tensor Core Accelerator Pytorch Opencv
A complete machine vision container that includes Jupyter notebooks with built-in code hinting, Anaconda, CUDA-X, TensorRT inference accelerator for Tensor cores, CuPy (GPU drop in replacement for Numpy), PyTorch, TF2, Tensorboard, and OpenCV for accelerated workloads on NVIDIA Tensor cores and GPUs.
Openwhisk Darkvisionapp
Discover dark data in videos with IBM Watson and IBM Cloud Functions
Arkit Multiplayer
ARKit multiplayer experience explanation & example
Android Ocrsample
Android OCR example application which uses Google Text Recognition API
Flowersvisiondemo
🌸 iOS11 demo application for flower classification.
Java Docs Samples
Java and Kotlin Code samples used on cloud.google.com
Sfacecompare
Simple lib for iOS to find and compare faces.
Ios11 Qr Code Example
Example showing how to use the QR-code detection API (VNDetectBarcodesRequest) in iOS 11.
Codeslam
Implementation of CodeSLAM — Learning a Compact, Optimisable Representation for Dense Visual SLAM paper (https://arxiv.org/pdf/1804.00874.pdf)
Eskf
ROS Error-State Kalman Filter based on PX4/ecl. Performs GPS/Magnetometer/Vision Pose/Optical Flow/RangeFinder fusion with IMU
Inceptionvisiondemo
🎥 iOS11 demo application for dominant objects detection.
Eyevis
Android based Vocal Vision for Visually Impaired. Object Detection, Voice Assistance, Optical Character Reader, Read Aloud, Face Recognition, Landmark Recognition, Image Labelling etc.
Facevision
iOS11 Vision framework example. Detection of face landmarks
Chineseidcardocr
[Deprecated] 🇨🇳中国二代身份证光学识别
Photoassessment
Photo Assessment using Core ML and Metal.
Caffe
Caffe: a fast open framework for deep learning.
Objectclassifier
An iOS swift app that detects objects using machine learning (CoreML, Vision)
Liooon Not A Liooon Classifier
A troll app to check if an object seen by your camera is a lion. Uses iOS CoreML, Vision APIs
Awesome Machine Learning
🎰 A curated list of machine learning resources, preferably CoreML
Evil
Optical Character Recognition in Swift for iOS&macOS. 银行卡、身份证、门牌号光学识别
3dmatch Toolbox
3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.
Cudasift
A CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
Paddlehub
Awesome pre-trained models toolkit based on PaddlePaddle.(300+ models including Image, Text, Audio and Video with Easy Inference & Serving deployment)
Visual Pushing Grasping
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Iowncode
A curated collection of iOS, ML, AR resources sprinkled with some UI additions
Facecropper
✂️ Crop faces, inside of your image, with iOS 11 Vision api.
Tsdf Fusion Python
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Rewriting
Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.
Pytorch Dense Correspondence
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
Tsdf Fusion
Fuse multiple depth frames into a TSDF voxel volume.
Nodejs Vision
Node.js client for Google Cloud Vision: Derive insight from images.
1-60 of 138 vision projects