All Projects β†’ rish-16 β†’ Sight

rish-16 / Sight

Licence: apache-2.0
πŸ‘ Sightseer: TensorFlow library for state-of-the-art Computer Vision and Object Detection models

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Sight

Ultimatelabeling
A multi-purpose Video Labeling GUI in Python with integrated SOTA detector and tracker
Stars: ✭ 184 (-9.36%)
Mutual labels:  object-detection
Person Detection And Tracking
A tensorflow implementation with SSD model for person detection and Kalman Filtering combined for tracking
Stars: ✭ 193 (-4.93%)
Mutual labels:  object-detection
Marvel
Marvel - Face Recognition With Android & OpenCV
Stars: ✭ 199 (-1.97%)
Mutual labels:  object-detection
Awesome Video Object Detection
This is a list of awesome articles about object detection from video.
Stars: ✭ 190 (-6.4%)
Mutual labels:  object-detection
Sarosperceptionkitti
ROS package for the Perception (Sensor Processing, Detection, Tracking and Evaluation) of the KITTI Vision Benchmark Suite
Stars: ✭ 193 (-4.93%)
Mutual labels:  object-detection
Deep Learning With Python
Deep learning codes and projects using Python
Stars: ✭ 195 (-3.94%)
Mutual labels:  object-detection
Yolov3 Tf2
YoloV3 Implemented in Tensorflow 2.0
Stars: ✭ 2,327 (+1046.31%)
Mutual labels:  object-detection
Yolo Tf
TensorFlow implementation of the YOLO (You Only Look Once)
Stars: ✭ 200 (-1.48%)
Mutual labels:  object-detection
Deepdetect
Deep Learning API and Server in C++14 support for Caffe, Caffe2, PyTorch,TensorRT, Dlib, NCNN, Tensorflow, XGBoost and TSNE
Stars: ✭ 2,306 (+1035.96%)
Mutual labels:  object-detection
Nas fpn tensorflow
NAS-FPN: Learning Scalable Feature Pyramid Architecture for Object Detection.
Stars: ✭ 198 (-2.46%)
Mutual labels:  object-detection
Layout Parser
A Python Library for Document Layout Understanding
Stars: ✭ 191 (-5.91%)
Mutual labels:  object-detection
Py R Fcn Multigpu
Code for training py-faster-rcnn and py-R-FCN on multiple GPUs in caffe
Stars: ✭ 192 (-5.42%)
Mutual labels:  object-detection
Oicr
Caffe codes for our papers "Multiple Instance Detection Network with Online Instance Classifier Refinement" and "PCL: Proposal Cluster Learning for Weakly Supervised Object Detection".
Stars: ✭ 196 (-3.45%)
Mutual labels:  object-detection
Spoonn
FPGA-based neural network inference project with an end-to-end approach (from training to implementation to deployment)
Stars: ✭ 186 (-8.37%)
Mutual labels:  object-detection
Ml Auto Baseball Pitching Overlay
βšΎπŸ€–βšΎ Automatic baseball pitching overlay in realtime
Stars: ✭ 200 (-1.48%)
Mutual labels:  object-detection
Frcnn
Faster R-CNN / R-FCN πŸ’‘ C++ version based on Caffe
Stars: ✭ 183 (-9.85%)
Mutual labels:  object-detection
Viseron
Self-hosted NVR with object detection
Stars: ✭ 192 (-5.42%)
Mutual labels:  object-detection
Neuralet
Neuralet is an open-source platform for edge deep learning models on edge TPU, Jetson Nano, and more.
Stars: ✭ 200 (-1.48%)
Mutual labels:  object-detection
Traffic Sign Detection
Traffic Sign Detection. Code for the paper entitled "Evaluation of deep neural networks for traffic sign detection systems".
Stars: ✭ 200 (-1.48%)
Mutual labels:  object-detection
Thor
thor: C++ helper library, for deep learning purpose
Stars: ✭ 197 (-2.96%)
Mutual labels:  object-detection



PyPI PyPI - License

State-of-the-art Computer Vision and Object Detection for TensorFlow.

Made by Rishabh Anand β€’ https://rish-16.github.io

sightseer provides state-of-the-art general-purpose architectures (YOLOv3, MaskRCNN, Fast/Faster RCNN, SSD...) for Computer Vision and Object Detection tasks with 30+ pretrained models written in TensorFlow 1.15.

Installation

sightseer is written in Python 3.5+ and TensorFlow 1.15.

Ideally, sightseer should be installed in a virtual environments. If you're unfamiliar with Python virtual environments, check out this tutorial on getting started.

Via PyPi

To use sightseer, you must first have TensorFlow installed. To do so, follow the instructions on the TensorFlow installation page.

When your virtual environment is set up with TensorFlow, you can install sightseer using pip:

pip install sightseer

Model Clients (as of now)

  1. YOLOv3Client (Darknet by Joseph Redmon)

By popular demand, Tiny YOLO will be out in the v1.2.0 release. For more information on model release, check out the Roadmap.

Components of sightseer

The package comes with 4 major components that help with different parts of the object detection process all the way from preparing your raw data to getting predictions and displaying them.

Component Description
Sightseer Obtains image data or video footage
Proc Provides image/frame-wise annotation and inter-format conversion tools
Zoo Stores the wrappers over all state-of-the-art models and configs
Serve Provides deployment and model serving protocols and services

If not using custom datasets, Sightseer and Zoo are the submodules majorly used for generic predictions from pre-trained models. When there is custom data involved, you can use Proc to annotate your datasets and even convert them between XML/JSON/CSV/TFRecord formats.

Serve is an experimental productionising submodule that helps deploy your models on cloud services like AWS and GCP. For more details on future tools and services, check out the Roadmap.

Features

Footage or raw images can be rendered using Sightseer before being ingested into models or further preprocessed.

1a. Loading images

from sightseer import Sightseer

ss = Sightseer()
image = ss.load_image("path/to/image") # return numpy array representation of image

1b. Loading videos

from sightseer import Sightseer

ss = Sightseer()
frames = ss.load_vidsource("path/to/video") # returns nested array of frames

Support for video, webcam footage, and screen recording will be out in the coming v1.2.0 release.

2. Using models from sightseer.zoo

Once installed, any model offered by sightseer can be accessed in less than 10 lines of code. For instance, the code to use the YOLOv3 (Darknet) model is as follows:

from sightseer import Sightseer
from sightseer.zoo import YOLOv3Client

yolo = YOLOv3Client()
yolo.load_model() # downloads weights

# loading image from local system
ss = Sightseer()
image = ss.load_image("./assets/road.jpg")

# getting labels, confidence scores, and bounding box data
preds, pred_img = yolo.predict(image, return_img=True)
ss.render_image(pred_img)

To run the model on frames from a video, you can use the framewise_predict method:

from sightseer import Sightseer
from sightseer.zoo import YOLOv3Client

yolo = YOLOv3Client()
yolo.load_model() # downloads weights

# loading video from local system
ss = Sightseer()
frames = ss.load_vidsource("./assets/video.mp4")

"""
For best results, run on a GPU
"""
# getting labels, confidence scores, and bounding box data
preds, pred_frames = yolo.framewise_predict(frames)
ss.render_footage(pred_frames) # plays the video and saves the footage

The module can even be repurposed into a Command-line Interface (CLI) app using the argparse library.

Contributing

Suggestions, improvements, and enhancements are always welcome! If you have any issues, please do raise one in the Issues section. If you have an improvement, do file an issue to discuss the suggestion before creating a PR.

All ideas – no matter how outrageous – welcome!

Before committing, please check the Roadmap to see if proposed features are already in-development or not.

Note: Please commit all changes to the development experimentation branch instead of master.

Licence

Apache Licencse 2.0

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].