All Projects → docker-for-data-science → Docker For Data Science Tutorial

docker-for-data-science / Docker For Data Science Tutorial

Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OH

Projects that are alternatives of or similar to Docker For Data Science Tutorial

Deeplearningmodels
Stars: ✭ 116 (-1.69%)
Mutual labels:  jupyter-notebook
Objectdetection
Some experiments with object detection in PyTorch
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Cvnd Image Captioning Project
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Data Science 45min Intros
Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques
Stars: ✭ 1,513 (+1182.2%)
Mutual labels:  jupyter-notebook
How To Build Own Text Summarizer Using Deep Learning
In this notebook, we will build an abstractive based text summarizer using deep learning from the scratch in python using keras
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Speechcmdrecognition
A neural attention model for speech command recognition
Stars: ✭ 116 (-1.69%)
Mutual labels:  jupyter-notebook
Demo Docker
Demo notebooks inside a docker for end-to-end examples
Stars: ✭ 116 (-1.69%)
Mutual labels:  jupyter-notebook
Dl cshse ami
Материалы курса "Глубинное обучение", ФКН ВШЭ, бакалаврская программа ПМИ
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Blog
Source code for my personal blog
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Notebooks
Curated Notebooks from STScI
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Snns
Tutorials and implementations for "Self-normalizing networks"
Stars: ✭ 1,525 (+1192.37%)
Mutual labels:  jupyter-notebook
Theseus growth
Theseus is a Python library for cohort analysis
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Hands On Data Analysis With Pandas
Materials for following along with Hands-On Data Analysis with Pandas.
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Python Ecology Lesson
Data Analysis and Visualization in Python for Ecologists
Stars: ✭ 116 (-1.69%)
Mutual labels:  jupyter-notebook
Perfil Politico
A platform for profiling public figures in Brazilian politics
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Advanced training
Advanced Scikit-learn training session
Stars: ✭ 116 (-1.69%)
Mutual labels:  jupyter-notebook
Ruijin round1
瑞金医院MMC人工智能辅助构建知识图谱大赛初赛
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Dynamic neural manifold
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Hands On Recommendation Systems With Python
Hands-On Recommendation Systems with Python published by Packt
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook
Deep Learning
Course: Deep Learning
Stars: ✭ 117 (-0.85%)
Mutual labels:  jupyter-notebook

Docker for Data Science

Alt text

Materials for "Docker for Data Science" tutorial presented at PyCon 2018 in Cleveland, OH.

YouTube / Slides


Description

Jupyter notebooks simplify the process of developing and sharing Data Science projects across groups and organizations. However, when we want to deploy our work into production, we need to extract the model from the notebook and package it up with the required artifacts (data, dependencies, configurations, etc) to ensure it works in other environments. Containerization technologies such as Docker can be used to streamline this workflow.

This hands-on tutorial presents Docker in the context of Reproducible Data Science - from idea to application deployment. You will get a thorough introduction to the world of containers; learn how to incorporate Docker into various Data Science projects; and walk through the process of building a Machine Learning model in Jupyter and deploying it as a containerized Flask REST API.

Audience

This session is geared towards Data Scientists who are interested in learning about Docker and want to understand how to incorporate it in their projects. No prior knowledge of Docker is assumed. Proficiency with Git and the Command Line is not a prerequisite, but will make it easier to follow along.

Upon completion of this tutorial, students will be able to:

  • Navigate the Docker ecosystem with ease
  • Leverage containers as part of their data science workflow
  • Productionize & deploy a Machine Learning model wrapped in an API

Learn how to become a Full-Stack Data Scientist!

Installation Instructions

Step 1: Install Docker and Docker-Compose

Mac

  1. Download Docker for Mac. Contains both Docker and Docker-Compose.

  2. Install

Linux

  1. Update your package manager.

  2. Use package manager to install Docker.

  3. Use package manager to install Docker-Compose.

Might need to add user account to docker group.

Windows

Note: Windows 10 users can use the Linux subsystem to install Docker and Docker-Compose. Instructions from a post we found on Medium.

Please also make sure to install Docker-Compose when you are installing Docker. Then proceed to Step 2

Otherwise, we have created a VM image. USB sticks with the image will be available at the tutorial

  1. Download VirtualBox for Windows Hosts.

  2. Download VirtualBox image containing all required files and containers. We also have USB sticks containing these images to reduce strain on the conference WiFi.

  3. Open VirtualBox Manager.

  4. File > Import Applicance > point to the file you just downloaded. Import it in.

  5. Double-click VM to start an instance.

  6. Login: osboxes | Password: osboxes.org | Root password: osboxes.org

The image you download contains images as well as repositories that were cloned to ~/docker-for-data-science.

  1. Update cloned repos by going into each folder and doing a git pull. Skip Steps 2 and 3.

Step 2: Clone Git Repositories

  1. Create a folder for this tutorial, we recommend ~/docker-for-data-science as this will be the folder we use in all of our examples.

  2. cd into folder

  3. Download both repositories:

git clone https://github.com/docker-for-data-science/docker-for-data-science-tutorial.git
git clone https://github.com/docker-for-data-science/talkvoter.git

Step 3: Download Docker Images

Please pre-download Docker images to reduce the strain on the conference WiFi.

  1. cd ~/docker-for-data-science/docker-for-data-science-tutorial/installation_files

  2. Run the shell script: ./download_docker_images.sh

  3. Build images for Talk Recommendation application:

cd ~/docker-for-data-science/talkvoter
docker-compose build
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].