All Categories → Machine Learning → computer-vision

Top 2166 computer-vision open source projects

Grl
Robotics tools in C++11. Implements soft real time arm drivers for Kuka LBR iiwa plus V-REP, ROS, Constrained Optimization based planning, Hand Eye Calibration and Inverse Kinematics integration.
Ios ml
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Handtracking
Building a Real-time Hand-Detector using Neural Networks (SSD) on Tensorflow
Planematch
[ECCV'18 Oral] PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction
Cnstream
CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream
Chilitags
Robust Fiducial Markers for Augmented Reality And Robotics
Graphcut
Graph cut image segmentation with custom GUI.
Pose Interpreter Networks
Real-Time Object Pose Estimation with Pose Interpreter Networks (IROS 2018)
Crowd counting from scratch
This is an overview and tutorial about crowd counting. In this repository, you can learn how to estimate number of pedestrians in crowd scenes through computer vision and deep learning.
Wb srgb
White balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
Efficientdet.pytorch
Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch
Fast methods
N-Dimensional Fast Methods: Fast Marching, Fast Sweeping, Group Marching, Fast Iterative, etc.
Pytorch shake shake
A PyTorch implementation of shake-shake
Hass Deepstack Face
Home Assistant custom component for using Deepstack face recognition
Awesome Image Alignment And Stitching
A curated list of awesome resources for image alignment and stitching ...
Airbnb Amenity Detection
Repo for 42 days project to replicate/improve Airbnb's amenity (object) detection pipeline.
Comicolorization
This is the implementation of the "Comicolorization: Semi-automatic Manga Colorization"
Curved Lane Lines
detect curved lane lines using HSV filtering and sliding window search.
Mivisionx
MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
Autoassign
Pytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection"
D2l En
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Antialiased Cnns
pip install antialiased-cnns to improve stability and accuracy
Awesome Machine Learning
📖 List of some awesome university courses for Machine Learning! Feel free to contribute!
Papers
读过的CV方向的一些论文,图像生成文字、弱监督分割等
Atlasnetv2
This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.
Autoalbument
AutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - https://albumentations.ai/docs/autoalbument/
Pytorch Fcn
PyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)
Objectron
Objectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
Imghash
Perceptual image hashing for Node.js
Exprgan
Facial Expression Editing with Controllable Expression Intensity
Neural Light Transport
Code and Data Release for Neural Light Transport (NLT)
Baidu Dogs
Baidu competition for classifying dogs. More information is provided at http://js.baidu.com
Driving In The Matrix
Steps to reproduce training results for the paper Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?
R2d2
[ICLR'19] Meta-learning with differentiable closed-form solvers
Isic2018
ISIC 2018: Skin Lesion Analysis Towards Melanoma Detection
Holocron
PyTorch implementations of recent Computer Vision tricks
Porousmediagan
Reconstruction of three-dimensional porous media using generative adversarial neural networks
Region Conv
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade
Bdrar
Code for the ECCV 2018 paper "Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection"
Forensic
Copy-move image forgery detection library.
360sd Net
Pytorch implementation of ICRA 2020 paper "360° Stereo Depth Estimation with Learnable Cost Volume"
Neural Api
CAI NEURAL API - Pascal based neural network API optimized for AVX, AVX2 and AVX512 instruction sets plus OpenCL capable devices including AMD, Intel and NVIDIA.
Deeplpf
Code for CVPR 2020 paper "Deep Local Parametric Filters for Image Enhancement"
Shapegf
Learning Gradient Fields for Shape Generation
Hellovision
Vision framework example for my article. https://medium.com/compileswift/swift-world-whats-new-in-ios-11-vision-456ba4156bad
Super Resolution Videos
Applying SRGAN technique implemented in https://github.com/zsdonghao/SRGAN on videos to super resolve them.
Vision Transformer
Tensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
Dped
Software and pre-trained models for automatic photo quality enhancement using Deep Convolutional Networks
Wave geometry
Manifold geometry with fast automatic derivatives and coordinate frame semantics checking
481-540 of 2166 computer-vision projects