GrlRobotics tools in C++11. Implements soft real time arm drivers for Kuka LBR iiwa plus V-REP, ROS, Constrained Optimization based planning, Hand Eye Calibration and Inverse Kinematics integration.
Ios mlList of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
HandtrackingBuilding a Real-time Hand-Detector using Neural Networks (SSD) on Tensorflow
Planematch[ECCV'18 Oral] PlaneMatch: Patch Coplanarity Prediction for Robust RGB-D Reconstruction
CnstreamCNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/SolutionSDK/CNStream
ChilitagsRobust Fiducial Markers for Augmented Reality And Robotics
GraphcutGraph cut image segmentation with custom GUI.
Crowd counting from scratchThis is an overview and tutorial about crowd counting. In this repository, you can learn how to estimate number of pedestrians in crowd scenes through computer vision and deep learning.
Wb srgbWhite balance camera-rendered sRGB images (CVPR 2019) [Matlab & Python]
Fast methodsN-Dimensional Fast Methods: Fast Marching, Fast Sweeping, Group Marching, Fast Iterative, etc.
SegmentationTensorflow implementation : U-net and FCN with global convolution
ComicolorizationThis is the implementation of the "Comicolorization: Semi-automatic Manga Colorization"
Curved Lane Linesdetect curved lane lines using HSV filtering and sliding window search.
MivisionxMIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
AutoassignPytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection"
CaireContent aware image resize library
D2l EnInteractive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Universal Data ToolCollaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
Papers读过的CV方向的一些论文,图像生成文字、弱监督分割等
Atlasnetv2This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.
AutoalbumentAutoML for image augmentation. AutoAlbument uses the Faster AutoAugment algorithm to find optimal augmentation policies. Documentation - https://albumentations.ai/docs/autoalbument/
Pytorch FcnPyTorch Implementation of Fully Convolutional Networks. (Training code to reproduce the original result is available.)
ObjectronObjectron is a dataset of short, object-centric video clips. In addition, the videos also contain AR session metadata including camera poses, sparse point-clouds and planes. In each video, the camera moves around and above the object and captures it from different views. Each object is annotated with a 3D bounding box. The 3D bounding box describes the object’s position, orientation, and dimensions. The dataset contains about 15K annotated video clips and 4M annotated images in the following categories: bikes, books, bottles, cameras, cereal boxes, chairs, cups, laptops, and shoes
ImghashPerceptual image hashing for Node.js
ExprganFacial Expression Editing with Controllable Expression Intensity
Baidu DogsBaidu competition for classifying dogs. More information is provided at http://js.baidu.com
Driving In The MatrixSteps to reproduce training results for the paper Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?
R2d2[ICLR'19] Meta-learning with differentiable closed-form solvers
Isic2018ISIC 2018: Skin Lesion Analysis Towards Melanoma Detection
HolocronPyTorch implementations of recent Computer Vision tricks
PorousmediaganReconstruction of three-dimensional porous media using generative adversarial neural networks
Region ConvNot All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade
BdrarCode for the ECCV 2018 paper "Bidirectional Feature Pyramid Network with Recurrent Attention Residual Modules for Shadow Detection"
ForensicCopy-move image forgery detection library.
360sd NetPytorch implementation of ICRA 2020 paper "360° Stereo Depth Estimation with Learnable Cost Volume"
Neural ApiCAI NEURAL API - Pascal based neural network API optimized for AVX, AVX2 and AVX512 instruction sets plus OpenCL capable devices including AMD, Intel and NVIDIA.
DeeplpfCode for CVPR 2020 paper "Deep Local Parametric Filters for Image Enhancement"
ShapegfLearning Gradient Fields for Shape Generation
HellovisionVision framework example for my article. https://medium.com/compileswift/swift-world-whats-new-in-ios-11-vision-456ba4156bad
Super Resolution VideosApplying SRGAN technique implemented in https://github.com/zsdonghao/SRGAN on videos to super resolve them.
Vision TransformerTensorflow implementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
DpedSoftware and pre-trained models for automatic photo quality enhancement using Deep Convolutional Networks
Wave geometryManifold geometry with fast automatic derivatives and coordinate frame semantics checking