All Projects → stefmolin → python-data-viz-workshop

stefmolin / python-data-viz-workshop

Licence: MIT license
A workshop on data visualization in Python with notebooks and exercises for following along.

Programming Languages

Jupyter Notebook
11667 projects
HTML
75241 projects

Projects that are alternatives of or similar to python-data-viz-workshop

The-Data-Visualization-Workshop
A New, Interactive Approach to Learning Data Visualization
Stars: ✭ 59 (-56.62%)
Mutual labels:  bokeh, matplotlib
Holoviews
With Holoviews, your data visualizes itself.
Stars: ✭ 2,065 (+1418.38%)
Mutual labels:  holoviews, holoviz
traceml
Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.
Stars: ✭ 445 (+227.21%)
Mutual labels:  bokeh, matplotlib
Covid19 Dashboard
A site that displays up to date COVID-19 stats, powered by fastpages.
Stars: ✭ 1,212 (+791.18%)
Mutual labels:  data-visualisation, matplotlib
Bokeh
Interactive Data Visualization in the browser, from Python
Stars: ✭ 15,822 (+11533.82%)
Mutual labels:  data-visualisation, bokeh
QM-Simulator-1D
Interactive simulation of a quantum particle in 1D.
Stars: ✭ 69 (-49.26%)
Mutual labels:  matplotlib, matplotlib-animation
11 Python Matplotlib Module
Matplotlib is an amazing visualization library in Python for 2D plots of arrays. Matplotlib is a multi-platform data visualization library built on NumPy arrays and designed to work with the broader SciPy stack. It was introduced by John Hunter in the year 2002. One of the greatest benefits of visualization is that it allows us visual access to …
Stars: ✭ 206 (+51.47%)
Mutual labels:  matplotlib, matplotlib-tutorial
ai for robotics
Visualizations of algorithms covered in Sebastian Thrun's excellent Artificial Intelligence for Robotics course on Udacity.
Stars: ✭ 125 (-8.09%)
Mutual labels:  matplotlib, matplotlib-animation
mplcolors
A command-line information tool written in Python 3.x to display matplotlib colors.
Stars: ✭ 34 (-75%)
Mutual labels:  matplotlib
Python-Data-Visualization
D-Lab's 3 hour introduction to data visualization with Python. Learn how to create histograms, bar plots, box plots, scatter plots, compound figures, and more, using matplotlib and seaborn.
Stars: ✭ 42 (-69.12%)
Mutual labels:  matplotlib
hover-binder
Binder repository for the hover package
Stars: ✭ 43 (-68.38%)
Mutual labels:  bokeh
aws-amplify-sns-workshop-in-vscode
VSCode remote-containers environment for "Amplify SNS Workshop"
Stars: ✭ 27 (-80.15%)
Mutual labels:  workshop-materials
mltb
Machine Learning Tool Box
Stars: ✭ 25 (-81.62%)
Mutual labels:  matplotlib
stock-market-prediction-via-google-trends
Attempt to predict future stock prices based on Google Trends data.
Stars: ✭ 45 (-66.91%)
Mutual labels:  data-visualisation
PandasVersusExcel
Python数据分析入门,数据分析师入门
Stars: ✭ 120 (-11.76%)
Mutual labels:  matplotlib
dufte
📈 Minimalistic Matplotlib style
Stars: ✭ 196 (+44.12%)
Mutual labels:  matplotlib
armcompanion
Companion materials for the rstudio::conf 2019 Advanced R Markdown workshop
Stars: ✭ 15 (-88.97%)
Mutual labels:  workshop-materials
dataquest-guided-projects-solutions
My dataquest project solutions
Stars: ✭ 35 (-74.26%)
Mutual labels:  matplotlib
rover
Interactive Terraform visualization. State and configuration explorer.
Stars: ✭ 2,039 (+1399.26%)
Mutual labels:  interactive-visualizations
corona-kakao-bot
🤖 카카오톡 코로나 알리미
Stars: ✭ 17 (-87.5%)
Mutual labels:  matplotlib

Beyond the Basics: Data Visualization in Python

Binder Nbviewer View slides in browser

The human brain excels at finding patterns in visual representations, which is why data visualizations are essential to any analysis. Done right, they bridge the gap between those analyzing the data and those consuming the analysis. However, learning to create impactful, aesthetically-pleasing visualizations can often be challenging. This session will equip you with the skills to make customized visualizations for your data using Python.

While there are many plotting libraries to choose from, the prolific Matplotlib library is always a great place to start. Since various Python data science libraries utilize Matplotlib under the hood, familiarity with Matplotlib itself gives you the flexibility to fine tune the resulting visualizations (e.g., add annotations, animate, etc.). This session will also introduce interactive visualizations using HoloViz, which provides a higher-level plotting API capable of using Matplotlib and Bokeh (a Python library for generating interactive, JavaScript-powered visualizations) under the hood.

Workshop Outline

This is a workshop on data visualization in Python first delivered at ODSC West 2021 and subsequently at ODSC East 2022, PyCon Italia 2022, ODSC Europe 2022, and EuroPython 2022. It's divided into the following sections:

Section 1: Getting Started With Matplotlib

We will begin by familiarizing ourselves with Matplotlib. Moving beyond the default options, we will explore how to customize various aspects of our visualizations. By the end of this section, you will be able to generate plots using the Matplotlib API directly, as well as customize the plots that libraries like pandas and Seaborn create for you.

Section 2: Moving Beyond Static Visualizations

Static visualizations are limited in how much information they can show. To move beyond these limitations, we can create animated and/or interactive visualizations. Animations make it possible for our visualizations to tell a story through movement of the plot components (e.g., bars, points, lines). Interactivity makes it possible to explore the data visually by hiding and displaying information based on user interest. In this section, we will focus on creating animated visualizations using Matplotlib before moving on to create interactive visualizations in the next section.

Section 3: Building Interactive Visualizations for Data Exploration

When exploring our data, interactive visualizations can provide the most value. Without having to create multiple iterations of the same plot, we can use mouse actions (e.g., click, hover, zoom, etc.) to explore different aspects and subsets of the data. In this section, we will learn how to use a few of the libraries in the HoloViz ecosystem to create interactive visualizations for exploring our data utilizing the Bokeh backend.


Prerequisites

You should have basic knowledge of Python and be comfortable working in Jupyter Notebooks. Check out this notebook for a crash course in Python or work through the official Python tutorial for a more formal introduction. The environment we will use for this workshop comes with JupyterLab, which is pretty intuitive, but be sure to familiarize yourself using notebooks in JupyterLab and additional functionality in JupyterLab. In addition, a basic understanding of pandas will be beneficial, but is not required; reviewing the first section of my pandas workshop will be sufficient.


Setup Instructions

  1. Install Anaconda/Miniconda. Note that you can use this Binder environment instead if you don't want to install anything on your machine.

  2. Fork this repository:

    location of fork button in GitHub

  3. Clone your forked repository:

    location of clone button in GitHub

  4. Create and activate a conda virtual environment (on Windows, these commands should be run in Anaconda Prompt):

    $ cd python-data-viz-workshop
    ~/python-data-viz-workshop$ conda install mamba -n base -c conda-forge
    ~/python-data-viz-workshop$ mamba env create --file environment.yml
    ~/python-data-viz-workshop$ conda activate data_viz_workshop
    (data_viz_workshop) ~/python-data-viz-workshop$
  5. Launch JupyterLab:

    (data_viz_workshop) ~/python-data-viz-workshop$ jupyter lab
  6. Navigate to the 0-check_your_env.ipynb notebook in the notebooks/ folder:

    open 0-check_your_env.ipynb

  7. Run the notebook to confirm everything is set up properly:

    check env


About the Author

Stefanie Molin (@stefmolin) is a software engineer and data scientist at Bloomberg in New York City, where she tackles tough problems in information security, particularly those revolving around data wrangling/visualization, building tools for gathering data, and knowledge sharing. She is also the author of Hands-On Data Analysis with Pandas, which is currently in its second edition. She holds a bachelor’s of science degree in operations research from Columbia University's Fu Foundation School of Engineering and Applied Science. She is currently pursuing a master’s degree in computer science, with a specialization in machine learning, from Georgia Tech. In her free time, she enjoys traveling the world, inventing new recipes, and learning new languages spoken among both people and computers.

Related Content

All examples herein were developed exclusively for this workshop. Hands-On Data Analysis with Pandas contains additional examples and exercises, as does this blog post and this workshop on pandas.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].