All Projects → whiteboxml → Teaching Ml In Production

whiteboxml / Teaching Ml In Production

Projects that are alternatives of or similar to Teaching Ml In Production

Do Zero Ao Ml
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Pure Numpy Feedfowardnn
Simple feedforward neural network class written in pure python+numpy
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Live Video Analytics
A collection of reference applications using live video analytics capabilities in Azure Media Services
Stars: ✭ 50 (+0%)
Mutual labels:  jupyter-notebook
Spark Tutorials
Code and Notebooks for Spark Tutorials for Learning Journal @ Youtube
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Randomized Svd
demos for PyBay talk: Using Randomness to make code faster
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Tensorflow From Zero To One
TensorFlow 最佳学习资源大全(含课程、书籍、博客、公开课等内容)
Stars: ✭ 1,052 (+2004%)
Mutual labels:  jupyter-notebook
Accurate Binary Convolution Network
Binary Convolution Network for faster real-time processing in ASICs
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Probandstats Pydatanyc2019
Introduction to Probability and Statistics
Stars: ✭ 50 (+0%)
Mutual labels:  jupyter-notebook
Teal deer
Teal deer (from TL;DR) helps you get the gist of all the stuff you need to read, so you don't have to read it all at once.
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
My Projects
It's my projects
Stars: ✭ 50 (+0%)
Mutual labels:  jupyter-notebook
Universodiscreto
Códigos explicados nos vídeos do canal Universo Discreto (YouTube)
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Salmonte
SalmonTE is an ultra-Fast and Scalable Quantification Pipeline of Transpose Element (TE) Abundances
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Feature Engineering Book
Code repo for the book "Feature Engineering for Machine Learning," by Alice Zheng and Amanda Casari, O'Reilly 2018
Stars: ✭ 1,052 (+2004%)
Mutual labels:  jupyter-notebook
Lipreading
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Wsdm Adhoc Document Retrieval
This is our solution for WSDM - DiggSci 2020. We implemented a simple yet robust search pipeline which ranked 2nd in the validation set and 4th in the test set. We won the gold prize at innovation track and bronze prize at dataset track.
Stars: ✭ 50 (+0%)
Mutual labels:  jupyter-notebook
Eeg Classification Using Recurrent Neural Network
Used LSTM Network to classify eeg signals based on stimuli the subject recieved (visual or audio)
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Mlapp Solutions
Solutions in Python for Kevin Murphy's Machine Learning: a Probabilistic Perspective
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook
Presentations
Talks & Workshops by the CODAIT team
Stars: ✭ 50 (+0%)
Mutual labels:  jupyter-notebook
Vapoursynthcolab
AI Video Processing/Upscaling With VapourSynth in Google Colab
Stars: ✭ 47 (-6%)
Mutual labels:  jupyter-notebook
Winter 2016 Cs231n
Assignments: CNN for Visual Recognition.
Stars: ✭ 49 (-2%)
Mutual labels:  jupyter-notebook

ML-IN-PRODUCTION-MADRID

This repository contains all materials from the workshop about putting Machine Learning models to production we teached in September 2019 at IronHack.

Approach

This is a practical workshop with the goals of learning the following concepts:

  • How to setup MLFLow, a tool for ML experiment tracking and model deploying, from zero to hero.
  • How to track ML experiments with MLFLow
  • How to put models to production with MLFLow.
  • How to deploy models to production in AWS Sagemaker with just a couple lines of code.
  • How to setup Apache Airflow, a powerful tool to design, schedule and monitor workflows.
  • How to create workflows that take advantage of deployed models.

In order to follow tutorials in a standard setup, there is a Linux VM included in this repository with repository itself and conda preinstalled. Please download VirtualBox and import vm/ubuntu.ova. As this is a large file, you can download it from here:

VM login credentials are:

  • username: ubuntu
  • password: ubuntu

In case you want to follow examples in this repo using your very own setup, we highly recommend using an Ubuntu 18.04 machine with conda installed.

Calendar

  • Friday 27/09/2019 from 17 to 20h

    • Introduction to Machine Learning in Production
    • Introduction to MLFlow, MLFLow full setup
    • Introduction to Dataset and Business Case (Renfe AVE ticket price forecasting)
    • MLFLow training API
  • Saturday from 10 to 20h

    • MLFLow deployment API
    • Python Virtual Environments distribution
    • AWS model deployment with SageMaker
    • Introduction to Apache Airflow
    • Airflow orchestration

Business Case

All examples will use our dataset about high speed train tickets in Spain. You can download the dataset from Kaggle or using this link. The following use cases are covered here:

Unsupervised learning - high speed train tickets clustering using the following algorithms:

  • Dimensionality reduction with UMAP
  • HDBSCAN clustering
  • Model to production using MLFlow so that the REST API returns a cluster ID for new tickets

Supervised learning - high speed train tickets forecasting using the following algorithms:

  • XGBoost implementation of AWS Sagemaker (both cloud training and model deployment)
  • scikit-learn Random Forest (local training and cloud deployment in AWS Sagemaker)

Model deployment:

  • Putting models to production in virtually any linux machine or server
  • Putting model to production in cloud with AWS SageMaker

Scheduling:

  • Orchestration of (batch) clustering and price forecasting for new data using Apache Airflow
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].