All Projects → DtCarrot → sg-food-ml

DtCarrot / sg-food-ml

Licence: other
This script is used to scrap images from the Internet to classify 5 common noodle "mee" dishes in Singapore. Wanton Mee, Bak Chor Mee, Lor Mee, Prawn Mee and Mee Siam.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to sg-food-ml

activemodel-datastore
Ruby on Rails with Active Model and Google Cloud Datastore. Extracted from Agrimatics Aero.
Stars: ✭ 47 (+161.11%)
Mutual labels:  google-cloud
kube-secrets-init
Kubernetes mutating webhook for `secrets-init` injection
Stars: ✭ 106 (+488.89%)
Mutual labels:  google-cloud
swarm-gcp-faas
Setup OpenFaaS on Google Cloud with Terraform, Docker Swarm and Weave
Stars: ✭ 15 (-16.67%)
Mutual labels:  google-cloud
Scrapping
Mastering the art of scrapping 🎓
Stars: ✭ 24 (+33.33%)
Mutual labels:  scraping
google-cloud-powershell
PowerShell cmdlets for the Google Cloud Platform
Stars: ✭ 120 (+566.67%)
Mutual labels:  google-cloud
monolog-google-cloud-json-formatter
A Monolog extension for formatting log entries for Google Cloud Logging
Stars: ✭ 15 (-16.67%)
Mutual labels:  google-cloud
copycat
A PHP Scraping Class
Stars: ✭ 70 (+288.89%)
Mutual labels:  scraping
google-drive-direct-remote-upload
Directly upload files from the web to Google Drive without saving them completely on the server before.
Stars: ✭ 62 (+244.44%)
Mutual labels:  google-cloud
scavenger
Scrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-22.22%)
Mutual labels:  scraping
bigquery-to-datastore
Export a whole BigQuery table to Google Datastore with Apache Beam/Google Dataflow
Stars: ✭ 56 (+211.11%)
Mutual labels:  google-cloud
gtoken
Securely access AWS services from GKE cluster
Stars: ✭ 43 (+138.89%)
Mutual labels:  google-cloud
SGP.NET
C# SGP4 Satellite Prediction Library. Load satellites from TLEs, convert between coordinate systems and reference frames, observe satellites from ground stations, and more.
Stars: ✭ 15 (-16.67%)
Mutual labels:  prediction
bigquery-kafka-connect
☁️ nodejs kafka connect connector for Google BigQuery
Stars: ✭ 17 (-5.56%)
Mutual labels:  google-cloud
anime-scraper
[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3
Stars: ✭ 21 (+16.67%)
Mutual labels:  scraping
angel.co-companies-list-scraping
No description or website provided.
Stars: ✭ 54 (+200%)
Mutual labels:  scraping
The-Purchase-and-Redemption-Forecast-Challenge-baseline
天池“资金流入流出预测——挑战baseline”的解决方案,线上效果143.5
Stars: ✭ 78 (+333.33%)
Mutual labels:  prediction
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+294.44%)
Mutual labels:  scraping
google translate diff
Google Translate API wrapper translates only changes between revisions of big texts
Stars: ✭ 51 (+183.33%)
Mutual labels:  google-cloud
dataflow-contact-center-speech-analysis
Speech Analysis Framework, a collection of components and code from Google Cloud that you can use to transcribe audio files to create analytics.
Stars: ✭ 46 (+155.56%)
Mutual labels:  google-cloud
torchestrator
Spin up Tor containers and then proxy HTTP requests via these Tor instances
Stars: ✭ 32 (+77.78%)
Mutual labels:  scraping

sg-food-ml

This script is used to scrap images from the Internet to classify 5 common noodle "mee" dishes in Singapore. Wanton Mee, Bak Chor Mee, Lor Mee, Prawn Mee and Mee Siam.

After scraping, the script will automatically upload the images to the indicated Google Cloud Storage bucket and import them into the AutoML dataset. It will then automatically create a new model (train the dataset) which will take a while.

I have written an article to explain in detail how this script works.

Prediction

In order to make prediction programatically, I have included the predict.py script. To execute the script, run the following command

python predict.py <path of image we will like to receive>

Limitations

  1. At this point of time, the scraper can only scrap 100 images per keyword. In order to scrap more than 100 images, you will need to install Selenium driver along with chromedriver. We are using [google-images-download] (https://help.github.com/articles/basic-writing-and-formatting-syntax/#links) under the hoods, so you can refer to their Github page on how to do so.

  2. Currently, I haven't automated the logic to automatically create a new bucket on Google Cloud Storage, you will need to manually create your own bucket.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].