All Projects → Pyspark Learning → Similar Projects or Alternatives

6372 Open source projects that are alternatives of or similar to Pyspark Learning

Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-84.35%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+2.04%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (+36.05%)
Mutual labels:  jupyter-notebook, spark, pyspark
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-56.46%)
Mutual labels:  jupyter-notebook, spark, pyspark
Pysparkgeoanalysis
🌐 Interactive Workshop on GeoAnalysis using PySpark
Stars: ✭ 63 (-57.14%)
Mutual labels:  jupyter-notebook, spark, pyspark
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (+46.94%)
Mutual labels:  spark, pyspark, spark-streaming
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+810.2%)
Mutual labels:  jupyter-notebook, spark, pyspark
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (+7.48%)
Mutual labels:  jupyter-notebook, spark, pyspark
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+548.98%)
Mutual labels:  jupyter-notebook, spark, pyspark
Optimus
🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+570.75%)
Mutual labels:  jupyter-notebook, spark, pyspark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (+12.24%)
Mutual labels:  jupyter-notebook, spark, pyspark
Pyspark Examples
Code examples on Apache Spark using python
Stars: ✭ 58 (-60.54%)
Enterprise gateway
A lightweight, multi-tenant, scalable and secure gateway that enables Jupyter Notebooks to share resources across distributed clusters such as Apache Spark, Kubernetes and others.
Stars: ✭ 412 (+180.27%)
Mutual labels:  jupyter-notebook, spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+180.95%)
Mutual labels:  jupyter-notebook, spark
Cdap
An open source framework for building data analytic applications.
Stars: ✭ 509 (+246.26%)
Mutual labels:  spark, spark-streaming
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+330.61%)
Mutual labels:  spark, pyspark
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+248.98%)
Mutual labels:  spark, spark-streaming
Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+373.47%)
Mutual labels:  spark, pyspark
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+406.8%)
Mutual labels:  jupyter-notebook, spark
Pyspark Setup Demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-83.67%)
Mutual labels:  jupyter-notebook, pyspark
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+531.97%)
Mutual labels:  spark, spark-streaming
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-90.48%)
Mutual labels:  spark, pyspark
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-74.83%)
Mutual labels:  spark, spark-streaming
Angel
A Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+4293.2%)
Mutual labels:  spark, spark-streaming
Tedsds
Apache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-90.48%)
Mutual labels:  jupyter-notebook, spark
Pixiedust
Python Helper library for Jupyter Notebooks
Stars: ✭ 998 (+578.91%)
Mutual labels:  jupyter-notebook, spark
Utils4s
scala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+627.89%)
Mutual labels:  spark, spark-streaming
Spark Syntax
This is a repo documenting the best practices in PySpark.
Stars: ✭ 412 (+180.27%)
Mutual labels:  jupyter-notebook, pyspark
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+176.19%)
Mutual labels:  spark, pyspark
Learningspark
Scala examples for learning to use Spark
Stars: ✭ 421 (+186.39%)
Mutual labels:  spark, spark-streaming
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+2157.14%)
Mutual labels:  spark, spark-streaming
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+3747.62%)
Mutual labels:  jupyter-notebook, spark
Justenoughscalaforspark
A tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+265.99%)
Mutual labels:  jupyter-notebook, spark
Elasticsearch Spark Recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
Stars: ✭ 707 (+380.95%)
Mutual labels:  jupyter-notebook, spark
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+106.12%)
Mutual labels:  jupyter-notebook, spark
Spark Scala Tutorial
A free tutorial for Apache Spark.
Stars: ✭ 907 (+517.01%)
Mutual labels:  jupyter-notebook, spark
Sparkling Titanic
Training models with Apache Spark, PySpark for Titanic Kaggle competition
Stars: ✭ 12 (-91.84%)
Mutual labels:  spark, pyspark
Yandex Big Data Engineering
Stars: ✭ 17 (-88.44%)
Mutual labels:  jupyter-notebook, spark
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-74.83%)
Mutual labels:  spark, spark-streaming
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-43.54%)
Mutual labels:  spark, spark-streaming
Helk
The Hunting ELK
Stars: ✭ 3,097 (+2006.8%)
Mutual labels:  jupyter-notebook, spark
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-42.86%)
Mutual labels:  jupyter-notebook, spark
Spark Nlp Models
Models and Pipelines for the Spark NLP library
Stars: ✭ 88 (-40.14%)
Mutual labels:  jupyter-notebook, spark
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-51.7%)
Mutual labels:  jupyter-notebook, spark
Spark python ml examples
Spark 2.0 Python Machine Learning examples
Stars: ✭ 87 (-40.82%)
Mutual labels:  spark, pyspark
Pyspark Tutorial
PySpark Code for Hands-on Learners
Stars: ✭ 91 (-38.1%)
Mutual labels:  jupyter-notebook, pyspark
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-38.1%)
Mutual labels:  jupyter-notebook, pyspark
Udacity Data Engineering
Udacity Data Engineering Nano Degree (DEND)
Stars: ✭ 89 (-39.46%)
Mutual labels:  jupyter-notebook, spark
Relation extraction
Relation Extraction using Deep learning(CNN)
Stars: ✭ 96 (-34.69%)
Mutual labels:  spark, pyspark
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-26.53%)
Mutual labels:  spark, pyspark
Almond
A Scala kernel for Jupyter
Stars: ✭ 1,354 (+821.09%)
Mutual labels:  jupyter-notebook, spark
Data Science Cookbook
🎓 Jupyter notebooks from UFC data science course
Stars: ✭ 60 (-59.18%)
Mutual labels:  jupyter-notebook, spark
Spark Mllib Twitter Sentiment Analysis
🌟 ✨ Analyze and visualize Twitter Sentiment on a world map using Spark MLlib
Stars: ✭ 113 (-23.13%)
Mutual labels:  spark, spark-streaming
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-23.81%)
Mutual labels:  jupyter-notebook, spark
Kinesis Sql
Kinesis Connector for Structured Streaming
Stars: ✭ 120 (-18.37%)
Mutual labels:  spark, spark-streaming
Eat pyspark in 10 days
pyspark🍒🥭 is delicious,just eat it!😋😋
Stars: ✭ 116 (-21.09%)
Mutual labels:  spark, pyspark
Waterdrop
Production Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+1162.59%)
Mutual labels:  spark, spark-streaming
Example Spark Kafka
Apache Spark and Apache Kafka integration example
Stars: ✭ 120 (-18.37%)
Mutual labels:  spark, spark-streaming
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+1070.75%)
Mutual labels:  spark, spark-streaming
basin
Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-82.99%)
Mutual labels:  spark, pyspark
1-60 of 6372 similar projects