All Projects → alteryx → Open_source_demos

alteryx / Open_source_demos

Licence: bsd-3-clause
A collection of demos showcasing automated feature engineering and machine learning in diverse use cases

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Open source demos

Learnpythonforresearch
This repository provides everything you need to get started with Python for (social science) research.
Stars: ✭ 163 (-58.31%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Thesemicolon
This repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.
Stars: ✭ 345 (-11.76%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (-50.38%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+338.87%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Functional intro to python
[tutorial]A functional, Data Science focused introduction to Python
Stars: ✭ 228 (-41.69%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-68.54%)
Mutual labels:  jupyter-notebook, data-science, feature-engineering
Deltapy
DeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (-12.02%)
Mutual labels:  jupyter-notebook, data-science, feature-engineering
Kaggle Competitions
There are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-78.01%)
Mutual labels:  jupyter-notebook, data-science, feature-engineering
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-44.25%)
Mutual labels:  jupyter-notebook, data-science, feature-engineering
Tutorials
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
Stars: ✭ 204 (-47.83%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Python Data Science Handbook
A Chinese translation of Jake Vanderplas' "Python Data Science Handbook". 《Python数据科学手册》在线Jupyter notebook中文翻译
Stars: ✭ 102 (-73.91%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Cryptocurrency Analysis Python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
Stars: ✭ 278 (-28.9%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Recommenders
Best Practices on Recommendation Systems
Stars: ✭ 11,818 (+2922.51%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Scipy con 2019
Tutorial Sessions for SciPy Con 2019
Stars: ✭ 142 (-63.68%)
Mutual labels:  jupyter-notebook, data-science, tutorial
H2o Tutorials
Tutorials and training material for the H2O Machine Learning Platform
Stars: ✭ 1,305 (+233.76%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Trump Lies
Tutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (-48.59%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Allstate capstone
Allstate Kaggle Competition ML Capstone Project
Stars: ✭ 72 (-81.59%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Learning python
Source material for Python Like You Mean it
Stars: ✭ 78 (-80.05%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Sc17
SuperComputing 2017 Deep Learning Tutorial
Stars: ✭ 211 (-46.04%)
Mutual labels:  jupyter-notebook, data-science, tutorial
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (-38.62%)
Mutual labels:  jupyter-notebook, data-science, feature-engineering

Alteryx Open Source Demos

This repository consists of a series of demos that leverage EvalML, Featuretools, Woodwork, and Compose. The demos rely on a different subset of these libraries and to various levels of complexity.

Building an accurate machine learning model requires several important steps. One of the most complex and time consuming is extracting information through features. Finding the right features is a crucial component of both interpreting the dataset as a whole as well as building a model with great predictive power. Another core component of any machine learning process is selecting the right estimator to use for the problem at hand. By combining the best features with the most accurate estimator and its corresponding hyperparameters, we can build a machine learning model that can generalize well to unknown data. Just as the process of feature engineering is made simple by Featuretools, we have made automated machine learning easy to implement using EvalML.

Running these tutorials

  1. Clone the repository.

    git clone https://github.com/alteryx/open-source-demos

  2. Install the requirements. It's recommended to create a new environment to work in to install these libraries separately.

    pip install -r requirements.txt

    In order to properly execute the demos, please install Graphviz according to the Featuretools documentation.

  3. Download the data.

    You can download the data for each demo by following the instructions in each tutorial. The dataset will usually be kept in a folder named data within the project structure.

  4. The tutorials can be run in Jupyter Notebook.

    jupyter notebook

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].