isarn-sketches-sparkRoutines and data structures for using isarn-sketches idiomatically in Apache Spark
Stars: ✭ 28 (-90.91%)
Bash Handbook📖 For those who wanna learn Bash
Stars: ✭ 4,691 (+1423.05%)
Awesome SparkA curated list of awesome Apache Spark packages and resources.
Stars: ✭ 1,061 (+244.48%)
pyspark-cheatsheetPySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster
Stars: ✭ 115 (-62.66%)
spark3DSpark extension for processing large-scale 3D data sets: Astrophysics, High Energy Physics, Meteorology, …
Stars: ✭ 23 (-92.53%)
MmlsparkSimple and Distributed Machine Learning
Stars: ✭ 2,899 (+841.23%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-87.34%)
Pure Bash Bible📖 A collection of pure bash alternatives to external processes.
Stars: ✭ 28,109 (+9026.3%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-51.3%)
Pure Sh Bible📖 A collection of pure POSIX sh alternatives to external processes.
Stars: ✭ 3,246 (+953.9%)
Live log analyzer sparkSpark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-95.45%)
Quinnpyspark methods to enhance developer productivity 📣 👯 🎉
Stars: ✭ 217 (-29.55%)
Pyspark StubsApache (Py)Spark type annotations (stub files).
Stars: ✭ 98 (-68.18%)
jupyterlab-sparkmonitorJupyterLab extension that enables monitoring launched Apache Spark jobs from within a notebook
Stars: ✭ 78 (-74.68%)
SynapseMLSimple and Distributed Machine Learning
Stars: ✭ 3,355 (+989.29%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-64.94%)
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-83.44%)
Python GuidePython best practices guidebook, written for humans.
Stars: ✭ 24,050 (+7708.44%)
Choo Handbook🚂✋📖 - Learn the choo framework through a set of exercises
Stars: ✭ 266 (-13.64%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-63.96%)
mmtf-workshop-2018Structural Bioinformatics Training Workshop & Hackathon 2018
Stars: ✭ 50 (-83.77%)
Hexo Douban 💿 A simple plugin for hexo that helps us generate pages for douban books ,movies and games.
Stars: ✭ 277 (-10.06%)
ProvisioningKubernetes cluster provisioning using Terraform.
Stars: ✭ 277 (-10.06%)
Golang TutorialsGo Tutorials - Let's get our hands really dirty by writing a lot of Golang code
Stars: ✭ 277 (-10.06%)
Read WeeklyThink Outside The Box And Monkey Reading / 每周一书
Stars: ✭ 300 (-2.6%)
Nodebook📖 Livre publié aux Éditions Eyrolles • Première édition : Node.js v10 et npm v6.
Stars: ✭ 286 (-7.14%)
Hott IntroAn introductory course to Homotopy Type Theory
Stars: ✭ 277 (-10.06%)
Tdigestt-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
Stars: ✭ 274 (-11.04%)
Fe Guide🌈汇集了前端技术书籍、前端热门技术、前端发展等资料。
Stars: ✭ 276 (-10.39%)
MorpheusMorpheus brings the leading graph query language, Cypher, onto the leading distributed processing platform, Spark.
Stars: ✭ 303 (-1.62%)
TbdSource for TrunkBasedDevelopment.com
Stars: ✭ 299 (-2.92%)
Tooltip SequenceA simple step by step tooltip helper for any site
Stars: ✭ 287 (-6.82%)
Byte Of Vim"A Byte of Vim" is a book which aims to help you to learn how to use the Vim editor (version 7), even if all you know is how to use the computer keyboard.
Stars: ✭ 283 (-8.12%)
RwdtowRuby Web Dev: The Other Way. Personal best practices guide.
Stars: ✭ 267 (-13.31%)
SparkflowEasy to use library to bring Tensorflow on Apache Spark
Stars: ✭ 282 (-8.44%)
R4dsR for data science: a book
Stars: ✭ 3,231 (+949.03%)