Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.

Stars: ✭ 273 (+378.95%)

Mutual labels: jupyter-notebook, dataset

Caffenet Benchmark

Evaluation of the CNN design choices performance on ImageNet-2012.

Stars: ✭ 700 (+1128.07%)

Mutual labels: jupyter-notebook, dataset

Hate Speech And Offensive Language

Repository for the paper "Automated Hate Speech Detection and the Problem of Offensive Language", ICWSM 2017

Stars: ✭ 543 (+852.63%)

Mutual labels: jupyter-notebook, dataset

Covid19 twitter

Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development

Stars: ✭ 304 (+433.33%)

Mutual labels: jupyter-notebook, dataset

Deep learning projects

Stars: ✭ 28 (-50.88%)

Mutual labels: jupyter-notebook, dataset

Datascience course

Curso de Data Science em Português

Stars: ✭ 294 (+415.79%)

Mutual labels: jupyter-notebook, dataset

Whylogs

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 328 (+475.44%)

Mutual labels: jupyter-notebook, dataset

Tehran Stocks

A python package to access tsetmc data

Stars: ✭ 282 (+394.74%)

Mutual labels: jupyter-notebook, dataset

Vpgnet

VPGNet: Vanishing Point Guided Network for Lane and Road Marking Detection and Recognition (ICCV 2017)

Stars: ✭ 382 (+570.18%)

Mutual labels: jupyter-notebook, dataset

Taco

🌮 Trash Annotations in Context Dataset Toolkit

Stars: ✭ 243 (+326.32%)

Mutual labels: jupyter-notebook, dataset

Dataset Api

The ApolloScape Open Dataset for Autonomous Driving and its Application.

Stars: ✭ 260 (+356.14%)

Mutual labels: jupyter-notebook, dataset

Tensor House

A collection of reference machine learning and optimization models for enterprise operations: marketing, pricing, supply chain

Stars: ✭ 449 (+687.72%)

Mutual labels: jupyter-notebook, models

Tedsds

Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark

Stars: ✭ 14 (-75.44%)

Mutual labels: jupyter-notebook, dataset

View All Similar Projects ➔

CinemaNet

CinemaNet is a set of data and trained models to help run inference to classify images / frames of a video with an eye for photographic, cinemgraphic, composition and color labelling.

CinemaNet aims to give out of the box useful classification of images / frames of video to cinematographers, editors, archivists, and anyone interested in extracting classification in a cinema / video context.

The Labels

The CinemaNet project aims to make a quasi knowledge graph of visual concepts useful to cinematographers, photographers, artists, designs, illustrators - and as such has labels ranging from composition theory to shot locations. The first round of label concents and categories is meant to provide an immediately helpful set of concepts and provide a baseline for the future. Note our label naming scheme uses a reverse DNS system - where top level naming helps to provide context for interpreting categories, concepts and sub-concepts.

The most up to date list of active labels can be viewed here:

See Labels.md for information on the taxonomy, list of categories and their concepts (might be slightly out of date).

The Data Set

Note, the raw data set imahges are only useful if you plan on training your own models or are interested in helping optimize, classify and iterate on the quality of the models. Generally speaking its probably not needed!

1: Check out the repository

Ensure that you have git checked out this repository or have done a download of this repository via the green clone or download button on the project page.

2: Install the dependencies for our data set download script:

Ensure you have PIP installed. Install Google Image Downloader and install Google Chrome Driver with a matching version to your currently installed Google Chrome browser (for me, it was 74.x). Google Chrome Driver is required to download more than 100 images per google image query.

2.1: Install PIP if necessary.

sudo easy_install pip in your Terminal.app command line.

2.2: Install Google Image Downloader via:

pip install google_images_download in your terminal.app

2.3: Install Chrome Driver

Check that you have a version of Google Chrome installed in the defaul /Applications/Google Chrome.app location. Launch Chrome and check the version number by going to 'About Chrome' in the Chrome Menu.

Download a matching version of Google Chrome Driver and place it into the same directory as these scripts.

3.0: Downloading the Data Set

You can then run python synopsis_categories_and_concepts_image_downloader.py to get the unfiltered raw data set - which will contain some noisy / misclassified images in the training set due to how Google Images returns results.

This download should be roughly 7.5 GB and contain roughly 63 thousand images sorted into a folder structure for the label category and concepts. The data set then requires manually pruneing from irrelevant or off topic images from the folder structure.

Training your own models

You can follow along with the Running Training Notes to see the steps we are taking if you want to train yourself.

Sign up for Googles AutoML Vision cloud service if you want to train your own model. At the time of this writing you will get approximately $300 in free credits.

See Running Training Notes for more info on training a model.

Running the Auto Labeller / Model Cleaner and other utilities

If you want to use the other utilities (auto labeler / HTML prediction preview script, model metadata and label name clean up script, and label print out script) you need to install Apples coremltools python package.

You can install via:

pip install -U coremltools

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 57

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (11) 🔗