All Projects → jaceklaskowski → Spark Structured Streaming Book

jaceklaskowski / Spark Structured Streaming Book

Licence: apache-2.0
The Internals of Spark Structured Streaming

Projects that are alternatives of or similar to Spark Structured Streaming Book

Apache Spark Internals
The Internals of Apache Spark
Stars: ✭ 1,045 (+181.67%)
Mutual labels:  spark, apache-spark, book
Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (-36.93%)
Mutual labels:  spark, apache-spark, book
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-33.42%)
Mutual labels:  spark, apache-spark
spark-gradle-template
Apache Spark in your IDE with gradle
Stars: ✭ 39 (-89.49%)
Mutual labels:  spark, apache-spark
spark-structured-streaming-examples
Spark structured streaming examples with using of version 3.0.0
Stars: ✭ 23 (-93.8%)
Mutual labels:  spark, apache-spark
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-42.05%)
Mutual labels:  spark, apache-spark
Spark Workshop
Apache Spark™ and Scala Workshops
Stars: ✭ 224 (-39.62%)
Mutual labels:  spark, apache-spark
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-70.08%)
Mutual labels:  spark, apache-spark
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-59.57%)
Mutual labels:  spark, apache-spark
Spark Gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (-16.98%)
Mutual labels:  apache-spark, book
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+730.46%)
Mutual labels:  spark, apache-spark
Learningsparkv2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (-17.25%)
Mutual labels:  spark, apache-spark
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+681.4%)
Mutual labels:  spark, apache-spark
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-55.53%)
Mutual labels:  spark, apache-spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-55.8%)
Mutual labels:  spark, apache-spark
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-96.5%)
Mutual labels:  spark, apache-spark
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-10.51%)
Mutual labels:  spark, apache-spark
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-63.07%)
Mutual labels:  spark, apache-spark
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-62.26%)
Mutual labels:  spark, apache-spark
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (-30.19%)
Mutual labels:  spark, apache-spark

The Internals of Spark Structured Streaming Online Book

CI

The project contains the sources of The Internals of Spark Structured Streaming online book.

Tools

The project is based on or uses the following tools:

Previewing Book

Custom Docker Image

This project uses a custom Docker image (based on the Insiders image) since the official Docker image includes just a few plugins only.

Build the custom Docker image using the following command:

docker build \
  -t jaceklaskowski/mkdocs-material-insiders \
  -t jaceklaskowski/mkdocs-material-insiders:6.2.3-insiders-1.15.0 \
  .

Building Book

Run the following command to build the book.

docker run \
  -it \
  -p 8000:8000 \
  -v ${PWD}:/docs \
  jaceklaskowski/mkdocs-material-insiders \
  build --clean

TIP: Consult the Material for MkDocs documentation to get started.

Live Editing

Start mkdocs serve (with --dirtyreload for faster reloads) as follows:

docker run \
  -it \
  -p 8000:8000 \
  -v ${PWD}:/docs \
  jaceklaskowski/mkdocs-material-insiders \
  serve --dirtyreload --verbose --dev-addr 0.0.0.0:8000

You should start the above command in the project root (the folder with mkdocs.yml).

No Sphinx?! Why?

Read Giving up on Read the Docs, reStructuredText and Sphinx.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].