All Projects → georgian-io → foreshadow

georgian-io / foreshadow

Licence: Apache-2.0 license
An automatic machine learning system

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to foreshadow

Hungabunga
HungaBunga: Brute-Force all sklearn models with all parameters using .fit .predict!
Stars: ✭ 614 (+2017.24%)
Mutual labels:  sklearn, automl
human-in-the-loop-machine-learning-tool-tornado
Tornado is a human-in-the-loop machine learning framework that helps you exploit your unlabelled data to train models through a simple and easy to use web interface.
Stars: ✭ 37 (+27.59%)
Mutual labels:  sklearn, automl
Auto ts
Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Now updated with Dask to handle millions of rows.
Stars: ✭ 195 (+572.41%)
Mutual labels:  sklearn, automl
codeflare
Simplifying the definition and execution, scaling and deployment of pipelines on the cloud.
Stars: ✭ 163 (+462.07%)
Mutual labels:  sklearn, automl
Automl alex
State-of-the art Automated Machine Learning python library for Tabular Data
Stars: ✭ 132 (+355.17%)
Mutual labels:  sklearn, automl
Mlmodels
mlmodels : Machine Learning and Deep Learning Model ZOO for Pytorch, Tensorflow, Keras, Gluon models...
Stars: ✭ 145 (+400%)
Mutual labels:  sklearn, automl
Igel
a delightful machine learning tool that allows you to train, test, and use models without writing code
Stars: ✭ 2,956 (+10093.1%)
Mutual labels:  sklearn, automl
oboe
An AutoML pipeline selection system to quickly select a promising pipeline for a new dataset.
Stars: ✭ 76 (+162.07%)
Mutual labels:  automl
tymon
An AI Assistant More Than a Toolkit
Stars: ✭ 46 (+58.62%)
Mutual labels:  sklearn
trt pose hand
Real-time hand pose estimation and gesture classification using TensorRT
Stars: ✭ 137 (+372.41%)
Mutual labels:  sklearn
merkalysis
A marketing tool that helps you to market your products using organic marketing. This tool can potentially save you 1000s of dollars every year. The tool predicts the reach of your posts on social media and also suggests you hashtags for captions in such a way that it increases your reach.
Stars: ✭ 28 (-3.45%)
Mutual labels:  sklearn
S2E
Q. Yao, H. Yang, B. Han, G. Niu, J. Kwok. Searching to Exploit Memorization Effect in Learning from Noisy Labels. ICML 2020
Stars: ✭ 18 (-37.93%)
Mutual labels:  automl
open-box
Generalized and Efficient Blackbox Optimization System.
Stars: ✭ 64 (+120.69%)
Mutual labels:  automatic-machine-learning
NiaAML
Python automated machine learning framework.
Stars: ✭ 25 (-13.79%)
Mutual labels:  automl
flask-angular-data-science
Repository for a data science starter app using Flask, Angular and Docker. https://medium.com/@dvelsner/deploying-a-simple-machine-learning-model-in-a-modern-web-application-flask-angular-docker-a657db075280
Stars: ✭ 84 (+189.66%)
Mutual labels:  sklearn
Word2VecAndTsne
Scripts demo-ing how to train a Word2Vec model and reduce its vector space
Stars: ✭ 45 (+55.17%)
Mutual labels:  sklearn
sdsj-automl
Sberbank Data Science Jorney Auto-ML competition
Stars: ✭ 28 (-3.45%)
Mutual labels:  automl
ml course
"Learning Machine Learning" Course, Bogotá, Colombia 2019 #LML2019
Stars: ✭ 22 (-24.14%)
Mutual labels:  sklearn
techloop-ml-plus
Archives and Tasks for ML+ sessions
Stars: ✭ 23 (-20.69%)
Mutual labels:  sklearn
data sciences campaign
【数据科学家系列课程】
Stars: ✭ 91 (+213.79%)
Mutual labels:  automatic-machine-learning

Foreshadow: Simple Machine Learning Scaffolding

BuildStatus Documentation Status Coverage Code Style License

Foreshadow is an automatic pipeline generation tool that makes creating, iterating, and evaluating machine learning pipelines a fast and intuitive experience allowing data scientists to spend more time on data science and less time on code.

Key Features

  • Scikit-Learn compatible
  • Automatic column intent inference
    • Numerical
    • Categorical
    • Text
    • Droppable (All values in a column are either the same or different)
  • Allow user override on column intent and transformation functions
  • Automatic feature preprocessing depending on the column intent type
    • Numerical: imputation followed by scaling
    • Categorical: a variety of categorical encoding
    • Text: TFIDF followed by SVD
  • Automatic model selection
  • Rapid pipeline development / iteration

Features in the road map

  • Automatic feature engineering
  • Automatic parameter optimization

Foreshadow supports python 3.6+

Installing Foreshadow

$ pip install foreshadow

Read the documentation to set up the project from source.

Getting Started

To get started with foreshadow, install the package using pip install. This will also install the dependencies. Now create a simple python script that uses all the defaults with Foreshadow.

First import foreshadow

from foreshadow.foreshadow import Foreshadow
from foreshadow.estimators import AutoEstimator
from foreshadow.utils import ProblemType

Also import sklearn, pandas, and numpy for the demo

import pandas as pd

from sklearn.datasets import boston_housing
from sklearn.model_selection import train_test_split

Now load in the boston housing dataset from sklearn into pandas dataframes. This is a common dataset for testing machine learning models and comes built in to scikit-learn.

boston = load_boston()
bostonX_df = pd.DataFrame(boston.data, columns=boston.feature_names)
bostony_df = pd.DataFrame(boston.target, columns=['target'])

Next, exactly as if working with an sklearn estimator, perform a train test split on the data and pass the train data into the fit function of a new Foreshadow object

X_train, X_test, y_train, y_test = train_test_split(bostonX_df,
   bostony_df, test_size=0.2)

problem_type = ProblemType.REGRESSION

estimator = AutoEstimator(
    problem_type=problem_type,
    auto="tpot",
    estimator_kwargs={"max_time_mins": 1},
)
shadow = Foreshadow(estimator=estimator, problem_type=problem_type)
shadow.fit(X_train, y_train)

Now fs is a fit Foreshadow object for which all feature engineering has been performed and the estimator has been trained and optimized. It is now possible to utilize this exactly as a fit sklearn estimator to make predictions.

shadow.score(X_test, y_test)

Great, you now have a working Foreshaow installation! Keep reading to learn how to export, modify and construct pipelines of your own.

Tutorial

We also have a jupyter notebook tutorial to go through more details under the examples folder.

Documentation

Read the docs!

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].