All Projects → ogrisel → Parallel_ml_tutorial

ogrisel / Parallel_ml_tutorial

Tutorial on scikit-learn and IPython for parallel machine learning

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Parallel ml tutorial

Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Deeptime
Deep learning meets molecular dynamics.
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Ssd Plate detection
SSD-based plate detection
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Advanced Deep Learning And Reinforcement Learning Deepmind
🎮 Advanced Deep Learning and Reinforcement Learning at UCL & DeepMind | YouTube videos 👉
Stars: ✭ 121 (-92.27%)
Mutual labels:  jupyter-notebook
Autoencoders
Implementation of simple autoencoders networks with Keras
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Pytorch Sift
PyTorch implementation of SIFT descriptor
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Melgan Vc
MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Kaggle Web Traffic
1st place solution
Stars: ✭ 1,641 (+4.79%)
Mutual labels:  jupyter-notebook
Learn jupyter
This is a jupyter practical tutorial. Welcome to edit together!
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Triage
General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems
Stars: ✭ 122 (-92.21%)
Mutual labels:  jupyter-notebook
Python Artificial Intelligence Projects For Beginners
Python Artificial Intelligence Projects for Beginners, published by Packt
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Conversationai Models
A repository to house model building experiments and tools that are part of the Conversation AI effort.
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Helm Chart
A store of Helm chart tarballs for deploying JupyterHub and BinderHub on a Kubernetes cluster
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Neural Processes
Pytorch implementation of Neural Processes for functions and images 🎆
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Statistical Inference For Everyone
Introductory Statistical Inference
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Openradar
An open source library for interacting with and processing radar data, specialized for MIMO mmWave radars
Stars: ✭ 122 (-92.21%)
Mutual labels:  jupyter-notebook
Part reid
Code for ICCV2017 paper: Deeply-Learned Part-Aligned Representations for Person Re-Identification
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook
Aws Machine Learning University Accelerated Nlp
Machine Learning University: Accelerated Natural Language Processing Class
Stars: ✭ 1,695 (+8.24%)
Mutual labels:  jupyter-notebook
India Election Data
To map publicly available datasets related to General Assembly (Lok Sabha) elections in India.
Stars: ✭ 122 (-92.21%)
Mutual labels:  jupyter-notebook
Carnd
Stars: ✭ 123 (-92.15%)
Mutual labels:  jupyter-notebook

Parallel Machine Learning with scikit-learn and IPython

Video Tutorial

Video recording of this tutorial given at PyCon in 2013. The tutorial material has been rearranged in part and extended. Look at the title of the of the notebooks to be able to follow along the presentation.

Browse the static notebooks on nbviewer.ipython.org.

Scope of this tutorial:

  • Learn common machine learning concepts and how they match the scikit-learn Estimator API.

  • Learn about scalable feature extraction for text classification and clustering

  • Learn how to perform parallel cross validation and hyper parameters grid search in parallel with IPython.

  • Learn to analyze the kinds of common errors predictive models are subject to and how to refine your modeling to take this analysis into account.

  • Learn to optimize memory allocation on your computing nodes with numpy memory mapping features.

  • Learn how to run a cheap IPython cluster for interactive predictive modeling on the Amazon EC2 spot instances using StarCluster.

Target audience

This tutorial targets developers with some experience with scikit-learn and machine learning concepts in general.

It is recommended to first go through one of the tutorials hosted at scikit-learn.org if you are new to scikit-learn.

You might might also want to have a look at SciPy Lecture Notes first if you are new to the NumPy / SciPy / matplotlib ecosystem.

Setup

Install NumPy, SciPy, matplotlib, IPython, psutil, and scikit-learn in their latest stable version (e.g. IPython 2.2.0 and scikit-learn 0.15.2 at the time of writing).

You can find up to date installation instructions on scikit-learn.org and ipython.org .

To check your installation, launch the ipython interactive shell in a console and type the following import statements to check each library:

>>> import numpy
>>> import scipy
>>> import matplotlib
>>> import psutil
>>> import sklearn

If you don't get any message, everything is fine. If you get an error message, please ask for help on the mailing list of the matching project and don't forget to mention the version of the library you are trying to install along with the type of platform and version (e.g. Windows 8.1, Ubuntu 14.04, OSX 10.9...).

You can exit the ipython shell by typing exit.

Fetching the data

It is recommended to fetch the datasets ahead of time before diving into the tutorial material itself. To do so run the fetch_data.py script in this folder:

python fetch_data.py

Using the IPython notebook to follow the tutorial

The tutorial material and exercises are hosted in a set of IPython executable notebook files.

To run them interactively do:

$ cd notebooks
$ ipython notebook

This should automatically open a new browser window listing all the notebooks of the folder.

You can then execute the cell in order by hitting the "Shift-Enter" keys and watch the output display directly under the cell and the cursor move on to the next cell. Go to the "Help" menu for links to the notebook tutorial.

Credits

Some of this material is adapted from the scipy 2013 tutorial:

http://github.com/jakevdp/sklearn_scipy2013

Original authors:

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].