All Projects → jseabold → Pycon Ds 2018

jseabold / Pycon Ds 2018

Introduction to Python for Data Science for PyCon 2018

Projects that are alternatives of or similar to Pycon Ds 2018

Optimization Python
General optimization (LP, MIP, QP, continuous and discrete optimization etc.) using Python
Stars: ✭ 133 (-1.48%)
Mutual labels:  jupyter-notebook
Notebooks
Speedml Python Notebooks
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Ssss
"Deep Learning and Quantum Programming" Spring School @ Song Shan Lake
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook
Tianchiorgame
天池比赛,kaggle等等(Keras/PyTorch实战)
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Deep Learning With Python
Example projects I completed to understand Deep Learning techniques with Tensorflow. Please note that I do no longer maintain this repository.
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Curso Python Udemy
Curso Maestro Python 3 en Udemy (20 horas)
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Machine Learning Concepts
Machine Learning Concepts with Concepts
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Part bilinear reid
Code for ECCV2018 paper: Part-Aligned Bilinear Representations for Person Re-Identification
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook
Deeplearning
深度学习相关教程
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Handwriting Synthesis
Implementation of "Generating Sequences With Recurrent Neural Networks" https://arxiv.org/abs/1308.0850
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook
Python Machine Learning
机器学习算法项目
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Ml
Machine Learning Projects and Learning Content
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Mxnet Gtc Tutorial
MXNet Tutorial for NVidia GTC 2016.
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Python tips
Some Python tips for beginner to intermediate users. Also used as a personal cheat sheet. Featured here https://bit.ly/2ZaV4Pl
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Sourcecodeofmongoredis
《左手MongoDB,右手Redis——从入门到商业实战》书籍配套源代码。
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook
Code With Ai
Interface for people to use my model which predicts which techniques one should use to solve a competitive programming problem to get an AC
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Tf Tutorial
Short tutorial for TensorFlow, designed to be presented in-person
Stars: ✭ 134 (-0.74%)
Mutual labels:  jupyter-notebook
Ml learning in action
ML study process record
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook
Beyond Jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook
Machine Learning Projects
This repository consists of all my Machine Learning Projects.
Stars: ✭ 135 (+0%)
Mutual labels:  jupyter-notebook

Introduction to Python for Data Science

This is the Pycon 2018 tutorial Introduction to Python for Data Science.

Outcomes

Attendees will come away with knowledge about how to effectively use Python and packages in the PyData stack like pandas, matplotlib, and scikit-learn to do data science. We'll learn how to get and read data, clean and munge data, conduct exploratory data analysis, and use scikit-learn to do machine learning with a real-world data set. We will focus on best practices in using Python along the way.

Pre-Requisites

Some very basic knowledge of Python will be helpful. If you have never used Python before, but have programmed in some other language, that's fine. If you don't have a lot of experience with programming, you may want to look at at least the first five sections of the python tutorial. Other than the basics of Python, no familiarity with Python tools will be expected.

Some knowledge of statistics and data science will be helpful.

Requirements

Download the Tutorial

If you would like to code along during the workshop, follow these steps.

If you are familiar with git, clone this repository. To do so at the command line, type

git clone [email protected]:jseabold/pycon-ds-2018

If you are not familiar with git, you can download the code directly from GitHub.

download zip button

The day before the conference, you will want to check for any code changes. From the directory that contains the code, run

git pull

Or, download the zip file again.

Install the Packages

Before you arrive, install all of the pre-requisite packages. You may not be able to rely on conference Wi-Fi to do this.

If you don't have conda or anaconda installed, you may want to download the Python Anaconda Distribution. Go to anaconda.org and click on Download Anaconda.

download anaconda

Select the Python 3.X version for your OS (it should be automatically chosen). Download and install anaconda.

python 3

Now that you have conda or anaconda, type the following from the directory with this code at the terminal or command prompt.

conda env create -f environment.yaml

Activate the Environment with the Packages

Now you need to switch to the environment we just created with the packages installed. If you are on macOS or linux, run in the terminal

source activate pycon-ds-2018

If you are on windows, run at the command prompt

activate pycon-ds-2018

Make sure you can run the Jupyter Notebooks

jupyter notebook

You can make sure that all of the packages were installed correctly, by running the following at a new or different terminal or command prompt (i.e., not the one you just typed jupyter notebook in) within the same directory with the code.

python imports-test.py

Data

Go ahead and unzip the data in the data/ directory. Depending on how you unzip the archive this may create a subdirectory, if it does then you need to move all of the files into the parent directory. All of the data files should be in the data/ directory not data/data.

Outline

Introduction

What's data science? Why Python? What does it mean to write Pythonic code?

Geting and Reading Data with Python

Working with CSVs, json, web data using pure Python and pandas.

Data Wrangling with Pandas

How can I use pandas to clean my data?

Exploratory Analysis with Pandas

How can I use pandas to learn more about data?

Exploratory Plotting

How can I use Python plotting libraries like matplotlib and seaborn to understand the structure in my data?

Scikit-Learn

How can I put all of this together and build a machine-learning pipeline on some real world data?

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].