Top 138 vision open source projects

Neural Motifs
Code for Neural Motifs: Scene Graph Parsing with Global Context (CVPR 2018)
Aravis
A vision library for genicam based cameras
R2c
Recognition to Cognition Networks (code for the model in "From Recognition to Cognition: Visual Commonsense Reasoning", CVPR 2019)
Home Platform
HoME: a Household Multimodal Environment is a platform for artificial agents to learn from vision, audio, semantics, physics, and interaction with objects and other agents, all within a realistic context.
Multi sensor fusion
Multi-Sensor Fusion (GNSS, IMU, Camera) 多源多传感器融合定位 GPS/INS组合导航 PPP/INS紧组合
Ios 11 By Examples
👨🏻‍💻 Examples of new iOS 11 APIs
Grip
Program for rapidly developing computer vision applications
Imagedetect
✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.
Apc Vision Toolbox
MIT-Princeton Vision Toolbox for the Amazon Picking Challenge 2016 - RGB-D ConvNet-based object segmentation and 6D object pose estimation.
Dest
🐼 One Millisecond Deformable Shape Tracking Library (DEST)
Dirt
DIRT: a fast differentiable renderer for TensorFlow
Facesvisiondemo
👀 iOS11 demo application for age and gender classification of facial images.
Visionfacedetection
An example of use a Vision framework for face landmarks detection in iOS 11
LogGabor
A python implementation for a LogGabor filtering and pyramid representation
Recogcis
Face detection & recognition AR app using the mlmodel to recognize company employees.
VisionLab
📺 A framework with common source code for demo projects that use Vision Framework
sim2real-docs
Synthesize image datasets of documents in natural scenes with Python+Blender3D
Spatial-Transformer-Networks-with-Keras
This repository provides a Colab Notebook that shows how to use Spatial Transformer Networks inside CNNs in Keras.
sp segmenter
Superpixel-based semantic segmentation, with object pose estimation and tracking. Provided as a ROS package.
PSCognitiveService
Powershell module to access Microsoft Azure Machine learning RESTful API's or Microsoft cognitive services
ImageCropper
✂️ Detect and crop faces, barcodes, texts or rectangle in image with iOS 11 Vision (iOS 10 Core Image) api.(图片裁剪:支持人脸、二维码/条形码、文本、方框)
e-verest
EVEREST: e-Versatile Research Stick for peoples
HRFormer
This is an official implementation of our NeurIPS 2021 paper "HRFormer: High-Resolution Transformer for Dense Prediction".
AutonomousPrecisionLanding
Precision landing on a visual target using OpenCV and dronekit-python
TextDetect
This app detects the text from the picture input using camera or photos gallery. The app uses MLVisionTextModel for on device detection. The Vision framework from MLKit of Google is used here.
vision-ml
A R-CNN machine learning model for handling Pop-up window in mobile Apps.
halonet-pytorch
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
Vision CoreML-App
This app predicts the age of a person from the picture input using camera or photos gallery. The app uses Core ML framework of iOS for the predictions. The Vision library of CoreML is used here. The trained model fed to the system is AgeNet.
iOS14-Resources
A curated collection of iOS 14 projects ranging from SwiftUI to ML, AR etc.
fuse-med-ml
A python framework accelerating ML based discovery in the medical field by encouraging code reuse. Batteries included :)
vision-api
Google Vision API made easy!
VidSitu
[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)
flutter-vision
iOS and Android app built with Flutter and Firebase. Includes Firebase ML Vision, Firestore, and Storage
non-contact-sleep-apnea-detection
Gihan Jayatilaka, Harshana Weligampola, Suren Sritharan, Pankayaraj Pathmanathan, Roshan Ragel and Isuru Nawinne, "Non-contact Infant Sleep Apnea Detection," 2019 14th Conference on Industrial and Information Systems (ICIIS), Kandy, Sri Lanka, 2019, pp. 260-265, doi: 10.1109/ICIIS47346.2019.9063269.
FaceData
A macOS app to parse face landmarks from a video for GANs training
mlp-mixer-pytorch
An All-MLP solution for Vision, from Google AI
SAPC-APCA
APCA (Accessible Perceptual Contrast Algorithm) is a new method for predicting contrast for use in emerging web standards (WCAG 3) for determining readability contrast. APCA is derived form the SAPC (S-LUV Advanced Predictive Color) which is an accessibility-oriented color appearance model designed for self-illuminated displays.
calvin
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
61-120 of 138 vision projects