All Projects → Splash → Similar Projects or Alternatives

1006 Open source projects that are alternatives of or similar to Splash

Azure Event Hubs Spark

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Stars: ✭ 140 (+33.33%)

Mutual labels: spark, bigdata, apache-spark

Spark

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Stars: ✭ 1,721 (+1539.05%)

Mutual labels: spark, bigdata, apache-spark

Mobius

C# and F# language binding and extensions to Apache Spark

Stars: ✭ 929 (+784.76%)

Mutual labels: spark, bigdata, apache-spark

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-87.62%)

Mutual labels: spark, apache-spark, bigdata

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (+104.76%)

Mutual labels: spark, bigdata, apache-spark

yuzhouwan

Code Library for My Blog

Stars: ✭ 39 (-62.86%)

Mutual labels: spark, bigdata

Big Data Rosetta Code

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

Stars: ✭ 254 (+141.9%)

Mutual labels: spark, bigdata

Docker Spark Cluster

A simple spark standalone cluster for your testing environment purposses

Stars: ✭ 261 (+148.57%)

Mutual labels: spark, bigdata

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+10367.62%)

Mutual labels: spark, bigdata

Sidekick

High Performance HTTP Sidecar Load Balancer

Stars: ✭ 366 (+248.57%)

Mutual labels: spark, bigdata

Agile data code 2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition

Stars: ✭ 413 (+293.33%)

Mutual labels: spark, apache-spark

Cortx

CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.

Stars: ✭ 426 (+305.71%)

Mutual labels: bigdata, storage

SparkProgrammingInScala

Apache Spark Course Material

Stars: ✭ 57 (-45.71%)

Mutual labels: apache-spark, bigdata

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (-52.38%)

Mutual labels: spark, bigdata

spark-structured-streaming-examples

Spark structured streaming examples with using of version 3.0.0

Stars: ✭ 23 (-78.1%)

Mutual labels: spark, apache-spark

Spark Jupyter Aws

A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support

Stars: ✭ 259 (+146.67%)

Mutual labels: spark, apache-spark

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+1174.29%)

Mutual labels: spark, bigdata

Wirbelsturm

Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.

Stars: ✭ 332 (+216.19%)

Mutual labels: spark, apache-spark

Big data architect skills

一个大数据架构师应该掌握的技能

Stars: ✭ 400 (+280.95%)

Mutual labels: spark, bigdata

Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.

Stars: ✭ 3,081 (+2834.29%)

Mutual labels: spark, apache-spark

Sparklyr

R interface for Apache Spark

Stars: ✭ 775 (+638.1%)

Mutual labels: spark, apache-spark

Goodreads etl pipeline

An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.

Stars: ✭ 793 (+655.24%)

Mutual labels: spark, apache-spark

Spark States

Custom state store providers for Apache Spark

Stars: ✭ 83 (-20.95%)

Mutual labels: spark, apache-spark

Spark Streaming Monitoring With Lightning

Plot live-stats as graph from ApacheSpark application using Lightning-viz

Stars: ✭ 15 (-85.71%)

Mutual labels: bigdata, apache-spark

Real Time Stream Processing Engine

This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.

Stars: ✭ 37 (-64.76%)

Mutual labels: spark, apache-spark

Spark Flamegraph

Easy CPU Profiling for Apache Spark applications

Stars: ✭ 30 (-71.43%)

Mutual labels: spark, apache-spark

Optimus

🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark

Stars: ✭ 986 (+839.05%)

Mutual labels: spark, bigdata

SparkTwitterAnalysis

An Apache Spark standalone application using the Spark API in Scala. The application uses Simple Build Tool(SBT) for building the project.

Stars: ✭ 29 (-72.38%)

Mutual labels: apache-spark, bigdata

gan deeplearning4j

Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.

Stars: ✭ 19 (-81.9%)

Mutual labels: apache-spark, bigdata

spark-gradle-template

Apache Spark in your IDE with gradle

Stars: ✭ 39 (-62.86%)

Mutual labels: spark, apache-spark

Every Single Day I Tldr

A daily digest of the articles or videos I've found interesting, that I want to share with you.

Stars: ✭ 249 (+137.14%)

Mutual labels: spark, bigdata

aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Stars: ✭ 111 (+5.71%)

Mutual labels: spark, apache-spark

incubator-linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,459 (+2241.9%)

Mutual labels: spark, storage

Data Accelerator

Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.

Stars: ✭ 247 (+135.24%)

Mutual labels: spark, apache-spark

Coolplayspark

酷玩 Spark: Spark 源代码解析、Spark 类库等

Stars: ✭ 3,318 (+3060%)

Mutual labels: spark, apache-spark

Learningsparkv2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Stars: ✭ 307 (+192.38%)

Mutual labels: spark, apache-spark

Cuesheet

A framework for writing Spark 2.x applications in a pretty way

Stars: ✭ 86 (-18.1%)

Mutual labels: spark, apache-spark

Spline

Data Lineage Tracking And Visualization Solution

Stars: ✭ 306 (+191.43%)

Mutual labels: spark, bigdata

Spark Structured Streaming Book

The Internals of Spark Structured Streaming

Stars: ✭ 371 (+253.33%)

Mutual labels: spark, apache-spark

Sparkmeasure

This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.

Stars: ✭ 368 (+250.48%)

Mutual labels: spark, apache-spark

Sparkle

Haskell on Apache Spark.

Stars: ✭ 419 (+299.05%)

Mutual labels: spark, apache-spark

Dpark

Python clone of Spark, a MapReduce alike framework in Python

Stars: ✭ 2,668 (+2440.95%)

Mutual labels: spark, bigdata

Coding Now

学习记录的一些笔记，以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等

Stars: ✭ 750 (+614.29%)

Mutual labels: spark, bigdata

Spark Movie Lens

An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset

Stars: ✭ 745 (+609.52%)

Mutual labels: spark, bigdata

Bigdataguide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

Stars: ✭ 817 (+678.1%)

Mutual labels: spark, bigdata

Kafka Storm Starter

Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.

Stars: ✭ 728 (+593.33%)

Mutual labels: spark, apache-spark

Live log analyzer spark

Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.

Stars: ✭ 14 (-86.67%)

Mutual labels: spark, apache-spark

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (+716.19%)

Mutual labels: spark, bigdata

Bigdataie

大数据博客、笔试题、教程、项目、面经的整理