Quinnpyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (+158.33%)
Mutual labels: apache-spark, pyspark
Spark GotchasSpark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (+266.67%)
Mutual labels: apache-spark, pyspark
pyspark-asyncactionsAsynchronous actions for PySpark
Stars: ✭ 30 (-64.29%)
Mutual labels: apache-spark, pyspark
spark3DSpark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Stars: ✭ 23 (-72.62%)
Mutual labels: apache-spark, pyspark
isarn-sketches-sparkRoutines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-66.67%)
Mutual labels: apache-spark, pyspark
pyspark-cheatsheetPySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (+36.9%)
Mutual labels: apache-spark, pyspark
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+3351.19%)
Mutual labels: apache-spark, pyspark
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-39.29%)
Mutual labels: apache-spark, pyspark
Awesome SparkA curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+1163.1%)
Mutual labels: apache-spark, pyspark
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-83.33%)
Mutual labels: apache-spark, pyspark
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (+78.57%)
Mutual labels: apache-spark, pyspark
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+3894.05%)
Mutual labels: apache-spark, pyspark
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+32.14%)
Mutual labels: apache-spark, pyspark
jupyterlab-sparkmonitorJupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (-7.14%)
Mutual labels: apache-spark, pyspark
mmtf-workshop-2018Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (-40.48%)
Mutual labels: apache-spark, pyspark
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-53.57%)
Mutual labels: apache-spark, pyspark
Pyspark BoilerplateA boilerplate for writing PySpark Jobs
Stars: ✭ 318 (+278.57%)
Mutual labels: apache-spark, pyspark
Pyspark StubsApache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (+16.67%)
Mutual labels: apache-spark, pyspark