All Projects β†’ sabman β†’ Pysparkgeoanalysis

sabman / Pysparkgeoanalysis

🌐 Interactive Workshop on GeoAnalysis using PySpark

Projects that are alternatives of or similar to Pysparkgeoanalysis

Pyspark Learning
Updated repository
Stars: ✭ 147 (+133.33%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+2023.81%)
Mutual labels:  jupyter-notebook, spark, pyspark
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (+1.59%)
Mutual labels:  jupyter-notebook, spark, pyspark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (+161.9%)
Mutual labels:  jupyter-notebook, spark, pyspark
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+1465.08%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+138.1%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-63.49%)
Mutual labels:  jupyter-notebook, spark, pyspark
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1414.29%)
Mutual labels:  jupyter-notebook, spark, pyspark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (+150.79%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+217.46%)
Mutual labels:  jupyter-notebook, spark, pyspark
Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+1004.76%)
Mutual labels:  spark, pyspark
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+904.76%)
Mutual labels:  spark, pyspark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+8877.78%)
Mutual labels:  jupyter-notebook, spark
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+753.97%)
Mutual labels:  jupyter-notebook, spark
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+1082.54%)
Mutual labels:  jupyter-notebook, spark
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: ✭ 707 (+1022.22%)
Mutual labels:  jupyter-notebook, spark
Yandex Big Data Engineering
Stars: ✭ 17 (-73.02%)
Mutual labels:  jupyter-notebook, spark
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-61.9%)
Mutual labels:  jupyter-notebook, pyspark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+555.56%)
Mutual labels:  jupyter-notebook, spark
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+1339.68%)
Mutual labels:  jupyter-notebook, spark

Docker Image Test Status:

CircleCI

A Small Course on Big Data - GeoAnalysis using PySpark

House Keeping

Who's Here?

I love staying in touch here's a link to a form where you can add your details for me to stay in touch with you. I also love feedback good and bad! I love to get better at my job. So as we go though this course I want you to keep in mind that I will ask you to provide some feedback afterwards. You can keep it anonymous of choose to tell me who you are. See feedback form here: Feedback Form

  • Who is using Spark in Production?
  • Who is doing Geospatial Analysis using Spark?
  • Who is a programmer?
  • Who is a Data Janitor... err I mean Scientist πŸ˜„
  • Who is a hedge fund manager? ... here's my number 181821113 (bank account number, that is!)
  • Who is doing something else? I have missed?

Introduction

This workshop will introduce you to Apache Spark via the exciting domain of Geospatial Analysis.

Setup

Dependencies:

See: docker/README.md

Data

If you use docker the data will automatically downloaded into the work-flow folder. See docker/README.md

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].