All Projects → alanderex → Pydata Pandas Workshop

alanderex / Pydata Pandas Workshop

Material for my PyData Jupyter & Pandas Workshops, I'm also available for personal in-house trainings on request

Projects that are alternatives of or similar to Pydata Pandas Workshop

Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+2540%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Programming With Data
🐍 Learn Python and Pandas from the ground up
Stars: ✭ 156 (+140%)
Mutual labels:  jupyter-notebook, pandas, workshop
Pbpython
Code, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+2552.31%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
100 Pandas Puzzles
100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Stars: ✭ 1,382 (+2026.15%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+366.15%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+2232.31%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Data Analysis
主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (+118.46%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (+75.38%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+320%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Edaviz
edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Stars: ✭ 220 (+238.46%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Bcs workshop apr 20
Workshop on basic machine learning, computational modeling, psychophysics, basic data analysis and experiment design
Stars: ✭ 134 (+106.15%)
Mutual labels:  jupyter-notebook, data-analysis, workshop
Pandas exercises
Practice your pandas skills!
Stars: ✭ 7,140 (+10884.62%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+4306.15%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Pydata Notebook
利用Python进行数据分析 第二版 (2017) 中文翻译笔记
Stars: ✭ 4,300 (+6515.38%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+12713.85%)
Mutual labels:  jupyter-notebook, data-analysis, pandas
Data Forge Ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+1387.69%)
Mutual labels:  data-analysis, pandas
Machine Learning Alpine
Alpine Container for Machine Learning
Stars: ✭ 30 (-53.85%)
Mutual labels:  jupyter-notebook, pandas
Mldotnet Real Time Data Streaming Workshop
A Machine Learning and Real-Time Data Analytics Workshop
Stars: ✭ 34 (-47.69%)
Mutual labels:  jupyter-notebook, workshop
Mlcourse.ai
Open Machine Learning Course
Stars: ✭ 7,963 (+12150.77%)
Mutual labels:  data-analysis, pandas
Python for ml
brief introduction to Python for machine learning
Stars: ✭ 29 (-55.38%)
Mutual labels:  jupyter-notebook, pandas

Analytics with Pandas and Jupyterlab

Follow-Along tutorial to get you started.

Poster

Pandas is the Swiss-Multipurpose Knife for Data Analysis in Python. With Pandas dealing with data-analysis is easy and simple but there are some things you need to get your head around first as Data-Frames and Data-Series.

The tutorial provides a compact introduction to Pandas for beginners for I/O, data visualisation, statistical data analysis and aggregation within Jupiter notebooks.

Binder

Run Jupyterlab in the cloud, requires internet access.

Binder

Installation

Local Installation

Copy this repository to your computer

# get this repository
git clone https://github.com/alanderex/pydata-pandas-workshop.git
cd pydata-pandas-workshop

Make sure to update to the latest vesion just when the training starts:

git pull

Having Anaconda installed simply create your ENV with

# install working environment with conda
conda env create -n pydata-pandas-workshop -f environment.yml

# environment should be activated now
# if not type: source activate pydata-pandas-workshop

In case the installation via file fails, simply:

conda env create -n pydata-pandas-workshop python=3.6
source activate pydata-pandas-workshop
conda install pandas jupyterlab xlrd xlsxwriter dask seaborn -y

Alternatively you can also create a python virtual enviroment and

pip install -r requirements.txt

If you don't want to use anaconda, you can use python3 and

pip install pandas jupyter barnum numpy matplotlib xlsxwriter seaborn bokeh jupyterlab parquet dask

(at your own risk)

Start Juypterlab

jupyter lab
# paste the url displayed in your browser, if it doesn't open anyway:
# http://localhost:8888/lab

A Practical Start: Reading and Writing Data Across Multiple Formats

  • CSV

  • Excel

  • JSON

  • Clipboard

  • data

    • .info
    • .describe

DataSeries & DataFrames / NumPy

  • Ode to NumPy
  • Data-Series
  • Data-Frames

Data selection & Indexing

  • Data-Series:
    • Slicing
    • Access by label
    • Index
  • Data-Frames:
    • Slicing
    • Access by label
    • Peek into joining data
  • Returns a copy / inplace
  • Boolean indexing

Operations

  • add/substract
  • multiply

Data Visualisation

  • plot your data directly into your notebook

Peek Into Statistical Data Analysis & Aggregation

  • Merging
  • Multi-Index
  • DateTime Index
  • Resampling
  • Pivoting

Scaling and Optimizing

  • Faster file I/O with Parquet
  • Scaling and Distributing with Dask
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].