All Projects → gagolucasm → Classify-Real-Time-Desktop

gagolucasm / Classify-Real-Time-Desktop

Licence: MIT license
Inception model used to classify camera feed on real time. Coded during the Deep Learning Hackathon 2017 San Francisco

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Classify-Real-Time-Desktop

Pneumonia Detection From Chest X Ray Images With Deep Learning
Detecting Pneumonia in Chest X-ray Images using Convolutional Neural Network and Pretrained Models
Stars: ✭ 64 (+45.45%)
Mutual labels:  image-recognition, inception
Cordova Plugin Tensorflow
On-device image recognition via TensorFlow/Inception. For Cordova/PhoneGap.
Stars: ✭ 51 (+15.91%)
Mutual labels:  image-recognition, inception
TensorFlow-Binary-Image-Classification-using-CNN-s
Binary Image Classification in TensorFlow
Stars: ✭ 26 (-40.91%)
Mutual labels:  image-recognition
DeepNetModel
记录每一个常用的深度模型结构的特点(图和代码)
Stars: ✭ 25 (-43.18%)
Mutual labels:  inception
SnapStitch
Built by Avik Jain, Anish Nag, Sahas Dendukuri, Hari Senthilkumar, and Rishi Upadhyay at Angelhack Silicon Valley 2017.
Stars: ✭ 20 (-54.55%)
Mutual labels:  inception
mxnet-retrain
Create mxnet finetuner (retrain) for mac/linux ,no need install docker and supports CPU, GPU(eGpu/cudnn).support the inception,resnet ,squeeznet,mobilenet...
Stars: ✭ 32 (-27.27%)
Mutual labels:  inception
ImageRetrieval
Content Based Image Retrieval Techniques (e.g. knn, svm using MatLab GUI)
Stars: ✭ 51 (+15.91%)
Mutual labels:  image-recognition
TF2DeepFloorplan
TF2 Deep FloorPlan Recognition using a Multi-task Network with Room-boundary-Guided Attention. Enable tensorboard, quantization, flask, tflite, docker, github actions and google colab.
Stars: ✭ 98 (+122.73%)
Mutual labels:  image-recognition
UnityProminentColor
Tool to gather main colors of an image using Unity.
Stars: ✭ 40 (-9.09%)
Mutual labels:  image-recognition
visualsearch
Visual Search is a little app to find and cluster similar images using Tagbox
Stars: ✭ 31 (-29.55%)
Mutual labels:  image-recognition
jpetstore-kubernetes
Modernize and Extend: JPetStore on IBM Cloud Kubernetes Service
Stars: ✭ 21 (-52.27%)
Mutual labels:  image-recognition
Image-Classification
Pre-trained VGG-Net Model for image classification using tensorflow
Stars: ✭ 29 (-34.09%)
Mutual labels:  image-recognition
tensorflow-image-recognition-chrome-extension
Chrome browser extension for using TensorFlow image recognition on web pages
Stars: ✭ 88 (+100%)
Mutual labels:  image-recognition
image space
Interactive Image similarity and Visual Search and Retrieval application
Stars: ✭ 91 (+106.82%)
Mutual labels:  image-recognition
object-flaw-detector-cpp
Detect various irregularities of a product as it moves along a conveyor belt.
Stars: ✭ 19 (-56.82%)
Mutual labels:  image-recognition
Paper-Notes
Paper notes in deep learning/machine learning and computer vision
Stars: ✭ 37 (-15.91%)
Mutual labels:  image-recognition
Learnable-Image-Resizing
TF 2 implementation Learning to Resize Images for Computer Vision Tasks (https://arxiv.org/abs/2103.09950v1).
Stars: ✭ 48 (+9.09%)
Mutual labels:  image-recognition
LIT
[AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"
Stars: ✭ 79 (+79.55%)
Mutual labels:  image-recognition
TensorFlow-Multiclass-Image-Classification-using-CNN-s
Balanced Multiclass Image Classification with TensorFlow on Python.
Stars: ✭ 57 (+29.55%)
Mutual labels:  image-recognition
rps-cv
A Rock-Paper-Scissors game using computer vision and machine learning on Raspberry Pi
Stars: ✭ 102 (+131.82%)
Mutual labels:  image-recognition

Classify real time desktop and speech

Overview

Team DeepThings (Mez Gebre and I) won the Best Product Category at the Deep Learning Hackathon in San Francisco. We developed in three days a real-time system capable of identifying objects and speaking what it sees, thinking about making a useful tool for the visually impaired, as it could make navigation easier. Proof of concept on a laptop, final model running on Android.

This is only the first prototype for Windows.

The goals / steps of this project are the following:

  • Get the Webcam feed without bottlenecks.
  • Recognize images using Inception v3.
  • Text to speech with Google TTS API.
  • Making a functional model.
  • Tuninning the parameters.
  • Output visual display of the results.

Dependencies

This module requires:

Usage

Just run: python classify_real_time_v2.py

The output should look like this:

alt text

More details

For more information, check my medium post here

Licence

This proyect is Copyright © 2016-2017 Lucas Gago. It is free software, and may be redistributed under the terms specified in the MIT Licence.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].