All Projects → Featuretools → Automated Manual Comparison

Featuretools / Automated Manual Comparison

Licence: bsd-3-clause
Automated vs Manual Feature Engineering Comparison. Implemented using Featuretools.

Projects that are alternatives of or similar to Automated Manual Comparison

Stock market
Stars: ✭ 286 (-1.72%)
Mutual labels:  jupyter-notebook
Dianping textmining
大众点评评论文本挖掘,包括点评数据爬取、数据清洗入库、数据分析、评论情感分析等的完整挖掘项目
Stars: ✭ 289 (-0.69%)
Mutual labels:  jupyter-notebook
Mathtoolsforneuroscience
Materials for Mathematical Tools for Neuroscience course at Harvard (Neurobio 212)
Stars: ✭ 287 (-1.37%)
Mutual labels:  jupyter-notebook
Thinking In Tensors Writing In Pytorch
Thinking in tensors, writing in PyTorch (a hands-on deep learning intro)
Stars: ✭ 287 (-1.37%)
Mutual labels:  jupyter-notebook
Bokeh Cn
Bokeh中文文档(翻译中)
Stars: ✭ 288 (-1.03%)
Mutual labels:  jupyter-notebook
Image Captioning
Image Captioning using InceptionV3 and beam search
Stars: ✭ 290 (-0.34%)
Mutual labels:  jupyter-notebook
Google Drive Online Decompression
使用Google Colab对Google Drive里面的压缩包进行操作,支持7z和rar以及zip等格式,引擎采用unrar和unzip以及7z
Stars: ✭ 288 (-1.03%)
Mutual labels:  jupyter-notebook
Coms4995 S19
COMS W4995 Applied Machine Learning - Spring 19
Stars: ✭ 292 (+0.34%)
Mutual labels:  jupyter-notebook
Tensorflow Glove
An implementation of GloVe in TensorFlow
Stars: ✭ 288 (-1.03%)
Mutual labels:  jupyter-notebook
Pyssim
A Python module for computing the Structural Similarity Image Metric (SSIM)
Stars: ✭ 290 (-0.34%)
Mutual labels:  jupyter-notebook
Building A Simple Chatbot In Python Using Nltk
Building a Simple Chatbot from Scratch in Python (using NLTK)
Stars: ✭ 286 (-1.72%)
Mutual labels:  jupyter-notebook
Code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
Stars: ✭ 287 (-1.37%)
Mutual labels:  jupyter-notebook
Sscnet
Semantic Scene Completion from a Single Depth Image
Stars: ✭ 290 (-0.34%)
Mutual labels:  jupyter-notebook
Python
This repository helps you understand python from the scratch.
Stars: ✭ 285 (-2.06%)
Mutual labels:  jupyter-notebook
Covid Model
Stars: ✭ 292 (+0.34%)
Mutual labels:  jupyter-notebook
Fire Detect Yolov4
fire-smoke-detection-dataset and fire-detection-yolov4-v5,火灾检测,烟雾检测
Stars: ✭ 280 (-3.78%)
Mutual labels:  jupyter-notebook
Simple Faster Rcnn Pytorch
A simplified implemention of Faster R-CNN that replicate performance from origin paper
Stars: ✭ 3,422 (+1075.95%)
Mutual labels:  jupyter-notebook
Fpn tensorflow
This is a tensorflow re-implementation of Feature Pyramid Networks for Object Detection.
Stars: ✭ 291 (+0%)
Mutual labels:  jupyter-notebook
G Darknet
darknet with GIoU
Stars: ✭ 292 (+0.34%)
Mutual labels:  jupyter-notebook
Python for data science
A rapid on-ramp primer for programmers who want to learn Python for doing data science research and development.
Stars: ✭ 290 (-0.34%)
Mutual labels:  jupyter-notebook

Manual vs Automated Feature Engineering Comparison

The traditional process of manual feature engineering requires building one feature at a time by hand informed by domain knowledge. This is tedious, time-consuming, error prone, and perhaps most importantly, specific to each dataset, which means the code will have to be re-written for each problem.

Automated feature engineering with Featuretools allows one to create thousands of features automatically from a set of related tables using a framework that can be easily applied to any problem.

Featuretools

Highlights

Featuretools offers us the following benefits:

  1. Up to 10x reduction in development time
  2. Better predictive performance
  3. Interpretable features with real-world significance
  4. Fits into existing machine learning pipelines
  5. Ensures data is valid in time-series problems

Automated feature engineering will change the way you do machine learning by allowing you to develop better predictive models in a fraction of the time as the traditional approach.

Article

For the highlights of the project, check out "Why Automated Feature Engineering Will Change the Way You Do Machine Learning" on Towards Data Science (Link)

Results

Each of the 3 projects in this repository demonstrates different benefits of using automated feature enginering.

  1. Loan Repayment Prediction: Build Better Models Faster

Given a dataset of 58 millions rows spread across 7 tables and the task of predicting whether or not a client will default on a loan, Featuretools delivered a better predictive model in a fraction of the time as manual feature engineering. The features built by Featuretools are also human-intrepretable and can give us insight into the problem:

  1. Retail Spending Prediction: Ensure Models Use Valid Data

When we have time-series data, we traditionally have to be extremely careful about making sure our model only trains on valid data. Often, a model will work in development only to completely fail in deployment because the training data was not properly filtered based on the time. Featuretools can take care of time filters automatically, allowing us to focus on other aspects of the machine learning pipeline and delivering better overall predictive models:

  1. Engine Life Prediction: Automatically Create Meaningful Features

In this problem of predicting how long an engine will run until it fails, we observe that Featuretools creates meaningful features which can inform our thinking about real-world problems as seen in the most important features:

Scaling with Dask

For examples of how Featuretools can scale - either on a single machine or a cluster - see the Feature Matrix with Dask EntitySet and Featuretools on Dask notebooks.

Feature Labs

Featuretools

Featuretools is an open source project created by Feature Labs. To see the other open source projects we're working on visit Feature Labs Open Source. If building impactful data science pipelines is important to you or your business, please get in touch.

Contact

Any questions can be directed to [email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].