sabman / Pysparkgeoanalysis
π Interactive Workshop on GeoAnalysis using PySpark
Stars: β 63
Projects that are alternatives of or similar to Pysparkgeoanalysis
Pyspark Learning
Updated repository
Stars: β 147 (+133.33%)
Mutual labels: jupyter-notebook, spark, pyspark
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: β 1,338 (+2023.81%)
Mutual labels: jupyter-notebook, spark, pyspark
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: β 64 (+1.59%)
Mutual labels: jupyter-notebook, spark, pyspark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: β 165 (+161.9%)
Mutual labels: jupyter-notebook, spark, pyspark
Optimus
π Agile Data Preparation Workflows madeΒ easy with dask, cudf, dask_cudf and pyspark
Stars: β 986 (+1465.08%)
Mutual labels: jupyter-notebook, spark, pyspark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: β 150 (+138.1%)
Mutual labels: jupyter-notebook, spark, pyspark
Spark Tdd Example
A simple Spark TDD example
Stars: β 23 (-63.49%)
Mutual labels: jupyter-notebook, spark, pyspark
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: β 954 (+1414.29%)
Mutual labels: jupyter-notebook, spark, pyspark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: β 158 (+150.79%)
Mutual labels: jupyter-notebook, spark, pyspark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: β 200 (+217.46%)
Mutual labels: jupyter-notebook, spark, pyspark
Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: β 696 (+1004.76%)
Mutual labels: spark, pyspark
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: β 633 (+904.76%)
Mutual labels: spark, pyspark
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: β 5,656 (+8877.78%)
Mutual labels: jupyter-notebook, spark
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: β 538 (+753.97%)
Mutual labels: jupyter-notebook, spark
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: β 745 (+1082.54%)
Mutual labels: jupyter-notebook, spark
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: β 707 (+1022.22%)
Mutual labels: jupyter-notebook, spark
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: β 24 (-61.9%)
Mutual labels: jupyter-notebook, pyspark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: β 413 (+555.56%)
Mutual labels: jupyter-notebook, spark
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: β 907 (+1339.68%)
Mutual labels: jupyter-notebook, spark
Docker Image Test Status:
A Small Course on Big Data - GeoAnalysis using PySpark
House Keeping
Who's Here?
I love staying in touch here's a link to a form where you can add your details for me to stay in touch with you. I also love feedback good and bad! I love to get better at my job. So as we go though this course I want you to keep in mind that I will ask you to provide some feedback afterwards. You can keep it anonymous of choose to tell me who you are. See feedback form here: Feedback Form
- Who is using Spark in Production?
- Who is doing Geospatial Analysis using Spark?
- Who is a programmer?
- Who is a Data Janitor... err I mean Scientist π
- Who is a hedge fund manager? ... here's my number 181821113 (bank account number, that is!)
- Who is doing something else? I have missed?
Introduction
This workshop will introduce you to Apache Spark via the exciting domain of Geospatial Analysis.
Setup
Dependencies:
See: docker/README.md
Data
If you use docker the data will automatically downloaded into the work-flow folder. See docker/README.md
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].