All Projects → japila-books → Apache Spark Internals

japila-books / Apache Spark Internals

Licence: apache-2.0
The Internals of Apache Spark

Projects that are alternatives of or similar to Apache Spark Internals

Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (-77.61%)
Mutual labels:  spark, apache-spark, book
Spark Structured Streaming Book
The Internals of Spark Structured Streaming
Stars: ✭ 371 (-64.5%)
Mutual labels:  spark, apache-spark, book
Docker practice
Learn and understand Docker technologies, with real DevOps practice!
Stars: ✭ 19,768 (+1791.67%)
Mutual labels:  spark, book
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-60.48%)
Mutual labels:  spark, apache-spark
Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (-25.84%)
Mutual labels:  spark, apache-spark
Spark Tda
SparkTDA is a package for Apache Spark providing Topological Data Analysis Functionalities.
Stars: ✭ 45 (-95.69%)
Mutual labels:  spark, apache-spark
Sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (-64.78%)
Mutual labels:  spark, apache-spark
Kafka Storm Starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (-30.33%)
Mutual labels:  spark, apache-spark
Spark Gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (-70.53%)
Mutual labels:  apache-spark, book
Spark Examples
Spark examples
Stars: ✭ 41 (-96.08%)
Mutual labels:  spark, apache-spark
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-11.1%)
Mutual labels:  spark, apache-spark
Live log analyzer spark
Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.
Stars: ✭ 14 (-98.66%)
Mutual labels:  spark, apache-spark
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-68.23%)
Mutual labels:  spark, apache-spark
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+217.51%)
Mutual labels:  spark, apache-spark
Learningsparkv2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (-70.62%)
Mutual labels:  spark, apache-spark
Sparkle
Haskell on Apache Spark.
Stars: ✭ 419 (-59.9%)
Mutual labels:  spark, apache-spark
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-96.46%)
Mutual labels:  spark, apache-spark
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-75.22%)
Mutual labels:  spark, apache-spark
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+194.83%)
Mutual labels:  spark, apache-spark
Goodreads etl pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
Stars: ✭ 793 (-24.11%)
Mutual labels:  spark, apache-spark

The Internals of Apache Spark Online Book

CI

The project contains the sources of The Internals of Apache Spark online book.

Tools

The project is based on or uses the following tools:

Previewing Book

Custom Docker Image

This project uses a custom Docker image (based on the Insiders image) since the official Docker image includes just a few plugins only.

Build the custom Docker image using the following command:

docker build \
  -t jaceklaskowski/mkdocs-material-insiders \
  -t jaceklaskowski/mkdocs-material-insiders:6.2.3-insiders-1.15.0 \
  .

Building Book

Run the following command to build the book.

docker run \
  -it \
  -p 8000:8000 \
  -v ${PWD}:/docs \
  jaceklaskowski/mkdocs-material-insiders \
  build --clean

TIP: Consult the Material for MkDocs documentation to get started.

Live Editing

Start mkdocs serve (with --dirtyreload for faster reloads) as follows:

docker run \
  -it \
  -p 8000:8000 \
  -v ${PWD}:/docs \
  jaceklaskowski/mkdocs-material-insiders \
  serve --dirtyreload --verbose --dev-addr 0.0.0.0:8000

You should start the above command in the project root (the folder with mkdocs.yml).

No Sphinx?! Why?

Read Giving up on Read the Docs, reStructuredText and Sphinx.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].