All Projects → krishnan-r → Sparkmonitor

krishnan-r / Sparkmonitor

Licence: apache-2.0
Monitor Apache Spark from Jupyter Notebook

Programming Languages

javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to Sparkmonitor

Enterprise gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Stars: ✭ 412 (+167.53%)
Mutual labels:  spark, jupyter
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: ✭ 707 (+359.09%)
Mutual labels:  spark, jupyter
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (+68.18%)
Mutual labels:  spark, jupyter
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+488.96%)
Mutual labels:  spark, jupyter
Kamu Cli
Next generation tool for decentralized exchange and transformation of semi-structured data
Stars: ✭ 69 (-55.19%)
Mutual labels:  spark, jupyter
Spark On Kubernetes Helm
Spark on Kubernetes infrastructure Helm charts repo
Stars: ✭ 92 (-40.26%)
Mutual labels:  spark, jupyter
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+249.35%)
Mutual labels:  spark, jupyter
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+519.48%)
Mutual labels:  spark, jupyter
Vagrant Projects
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
Stars: ✭ 34 (-77.92%)
Mutual labels:  spark, jupyter
Ds Cheatsheets
List of Data Science Cheatsheets to rule the world
Stars: ✭ 9,452 (+6037.66%)
Mutual labels:  spark, jupyter
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+779.22%)
Mutual labels:  spark, jupyter
Uinavigationitem Margin
Margins for UINavigationItem
Stars: ✭ 151 (-1.95%)
Mutual labels:  extension
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-2.6%)
Mutual labels:  spark
Jetson
Helmut Hoffer von Ankershoffen experimenting with arm64 based NVIDIA Jetson (Nano and AGX Xavier) edge devices running Kubernetes (K8s) for machine learning (ML) including Jupyter Notebooks, TensorFlow Training and TensorFlow Serving using CUDA for smart IoT.
Stars: ✭ 151 (-1.95%)
Mutual labels:  jupyter
Pg qualstats
A PostgreSQL extension for collecting statistics about predicates, helping find what indices are missing
Stars: ✭ 150 (-2.6%)
Mutual labels:  extension
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-0.65%)
Mutual labels:  spark
Jupyter themes
A plugin to select syntax highlighting on Jupyter
Stars: ✭ 151 (-1.95%)
Mutual labels:  jupyter
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-4.55%)
Mutual labels:  spark
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+1417.53%)
Mutual labels:  jupyter
Spacy Course
👩‍🏫 Advanced NLP with spaCy: A free online course
Stars: ✭ 1,920 (+1146.75%)
Mutual labels:  jupyter

Build Status

Spark Monitor - An extension for Jupyter Notebook

Note: This project is now maintained at https://github.com/swan-cern/jupyter-extensions

Google Summer of Code - Final Report

For the google summer of code final report of this project click here

About

+ =
SparkMonitor is an extension for Jupyter Notebook that enables the live monitoring of Apache Spark Jobs spawned from a notebook. The extension provides several features to monitor and debug a Spark job from within the notebook interface itself.

jobdisplay

Features

  • Automatically displays a live monitoring tool below cells that run Spark jobs in a Jupyter notebook
  • A table of jobs and stages with progressbars
  • A timeline which shows jobs, stages, and tasks
  • A graph showing number of active tasks & executor cores vs time
  • A notebook server extension that proxies the Spark UI and displays it in an iframe popup for more details
  • For a detailed list of features see the use case notebooks
  • How it Works

Quick Installation

pip install sparkmonitor
jupyter nbextension install sparkmonitor --py --user --symlink 
jupyter nbextension enable sparkmonitor --py --user            
jupyter serverextension enable --py --user sparkmonitor
ipython profile create && echo "c.InteractiveShellApp.extensions.append('sparkmonitor.kernelextension')" >>  $(ipython profile locate default)/ipython_kernel_config.py

For more detailed instructions click here

To do a quick test of the extension:

docker run -it -p 8888:8888 krishnanr/sparkmonitor

Integration with ROOT and SWAN

At CERN, the SparkMonitor extension would find two main use cases:

  • Distributed analysis with ROOT and Apache Spark using the DistROOT module. Here is an example demonstrating this use case.
  • Integration with SWAN, A service for web based analysis, via a modified container image for SWAN user sessions.
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].