All Projects → Data Accelerator → Similar Projects or Alternatives

2515 Open source projects that are alternatives of or similar to Data Accelerator

Azure Event Hubs Spark

Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs

Stars: ✭ 140 (-43.32%)

Mutual labels: azure, kafka, spark, apache-spark, spark-streaming, streaming

Spark

.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.

Stars: ✭ 1,721 (+596.76%)

Mutual labels: azure, spark, apache-spark, spark-streaming, streaming

Sparta

Real Time Analytics and Data Pipelines based on Spark Streaming

Stars: ✭ 513 (+107.69%)

Mutual labels: kafka, spark, spark-streaming, streaming-data, streaming

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-28.34%)

Mutual labels: kafka, big-data, apache-spark, spark-streaming

Logisland

Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.

Stars: ✭ 97 (-60.73%)

Mutual labels: kafka, spark, big-data, kafka-streams

Seldon Server

Machine Learning Platform and Recommendation Engine built on Kubernetes

Stars: ✭ 1,435 (+480.97%)

Mutual labels: azure, kafka, spark, kafka-streams

Awesome Kafka

A list about Apache Kafka

Stars: ✭ 397 (+60.73%)

Mutual labels: kafka, apache-spark, kafka-streams, streaming-data

Mobius

C# and F# language binding and extensions to Apache Spark

Stars: ✭ 929 (+276.11%)

Mutual labels: spark, apache-spark, spark-streaming, streaming

Real Time Stream Processing Engine

This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.

Stars: ✭ 37 (-85.02%)

Mutual labels: kafka, spark, apache-spark, spark-streaming

Streamline

StreamLine - Streaming Analytics

Stars: ✭ 151 (-38.87%)

Mutual labels: kafka, kafka-streams, spark-streaming, streaming

Mmlspark

Simple and Distributed Machine Learning

Stars: ✭ 2,899 (+1073.68%)

Mutual labels: azure, spark, big-data, apache-spark

Gimel

Big Data Processing Framework - Unified Data API or SQL on Any Storage

Stars: ✭ 216 (-12.55%)

Mutual labels: kafka, spark, big-data, spark-streaming

Spark On Lambda

Apache Spark on AWS Lambda

Stars: ✭ 137 (-44.53%)

Mutual labels: spark, big-data, apache-spark

Azure Event Hubs

☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs

Stars: ✭ 233 (-5.67%)

Mutual labels: azure, spark, streaming

Spark States

Custom state store providers for Apache Spark

Stars: ✭ 83 (-66.4%)

Mutual labels: spark, apache-spark, spark-streaming

Streaming Readings

Streaming System 相关的论文读物

Stars: ✭ 554 (+124.29%)

Mutual labels: apache-spark, spark-streaming, streaming

Kafka Ui

Open-Source Web GUI for Apache Kafka Management

Stars: ✭ 230 (-6.88%)

Mutual labels: kafka, big-data, kafka-streams

Kafka Connect Hdfs

Kafka Connect HDFS connector

Stars: ✭ 400 (+61.94%)

Mutual labels: kafka, big-data, streaming

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+4349.8%)

Mutual labels: kafka, spark, big-data

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-39.27%)

Mutual labels: spark, big-data, apache-spark

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-94.74%)

Mutual labels: big-data, spark, apache-spark

Agile data code 2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition

Stars: ✭ 413 (+67.21%)

Mutual labels: kafka, spark, apache-spark

aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Stars: ✭ 111 (-55.06%)

Mutual labels: big-data, spark, apache-spark

Coolplayspark

酷玩 Spark: Spark 源代码解析、Spark 类库等

Stars: ✭ 3,318 (+1243.32%)

Mutual labels: spark, apache-spark, spark-streaming

Kafka Streams

equivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨

Stars: ✭ 613 (+148.18%)

Mutual labels: kafka, big-data, kafka-streams

Go Streams

A lightweight stream processing library for Go

Stars: ✭ 615 (+148.99%)

Mutual labels: kafka, kafka-streams, streaming-data

Spark Streaming With Kafka

Self-contained examples of Apache Spark streaming integrated with Apache Kafka.

Stars: ✭ 180 (-27.13%)

Mutual labels: kafka, spark, spark-streaming

Bandar Log

Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.

Stars: ✭ 19 (-92.31%)

Mutual labels: kafka, big-data, spark-streaming

Kafka Storm Starter

Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.

Stars: ✭ 728 (+194.74%)

Mutual labels: kafka, spark, apache-spark

Thingsboard

Open-source IoT Platform - Device management, data collection, processing and visualization.

Stars: ✭ 10,526 (+4161.54%)

Mutual labels: kafka, spark, iot

Kafka Streams In Action

Source code for the Kafka Streams in Action Book

Stars: ✭ 167 (-32.39%)

Mutual labels: kafka, streaming-data, streaming

Bigdata Notebook

Stars: ✭ 100 (-59.51%)

Mutual labels: kafka, spark, streaming

Sparkrdma

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (-12.96%)

Mutual labels: spark, big-data, apache-spark

Wirbelsturm

Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.

Stars: ✭ 332 (+34.41%)

Mutual labels: kafka, spark, apache-spark

Streamx

kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)

Stars: ✭ 96 (-61.13%)

Mutual labels: kafka, big-data, streaming

Flink Learning

flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例，还有 Flink 落地应用的大型项目案例（PVUV、日志存储、百亿数据实时去重、监控告警）分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》

Stars: ✭ 11,378 (+4506.48%)

Mutual labels: kafka, spark, streaming

Example Spark Kafka

Apache Spark and Apache Kafka integration example

Stars: ✭ 120 (-51.42%)

Mutual labels: kafka, spark, spark-streaming

Flogo

Project Flogo is an open source ecosystem of opinionated event-driven capabilities to simplify building efficient & modern serverless functions, microservices & edge apps.

Stars: ✭ 1,891 (+665.59%)

Mutual labels: iot, streaming

Sparkling Graph

SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.

Stars: ✭ 139 (-43.72%)

Mutual labels: spark, big-data

Aliyun Emapreduce Datasources

Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.

Stars: ✭ 132 (-46.56%)

Mutual labels: kafka, spark

Eel Sdk

Big Data Toolkit for the JVM

Stars: ✭ 140 (-43.32%)

Mutual labels: kafka, big-data

Kafka Tutorials

Kafka Tutorials microsite

Stars: ✭ 144 (-41.7%)

Mutual labels: kafka, kafka-streams

Oryx

Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning

Stars: ✭ 1,785 (+622.67%)

Mutual labels: kafka, apache-spark

Mastering Spark Sql Book

The Internals of Spark SQL

Stars: ✭ 234 (-5.26%)

Mutual labels: spark, apache-spark

Samsara

Samsara is a real-time analytics platform

Stars: ✭ 132 (-46.56%)

Mutual labels: kafka, iot

Hydrograph

A visual ETL development and debugging tool for big data

Stars: ✭ 144 (-41.7%)

Mutual labels: big-data, apache-spark

Technology Talk

汇总java生态圈常用技术框架、开源中间件，系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识

Stars: ✭ 12,136 (+4813.36%)

Mutual labels: kafka, spark

Supersafebank

Sample Event Sourcing implementation with .NET Core

Stars: ✭ 142 (-42.51%)

Mutual labels: azure, kafka

A Kafka Story

Kafka ecosystem ... but step by step!

Stars: ✭ 148 (-40.08%)

Mutual labels: kafka, kafka-streams

Azkarra Streams

🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.

Stars: ✭ 146 (-40.89%)

Mutual labels: kafka, kafka-streams

Tributary

Streaming reactive and dataflow graphs in Python

Stars: ✭ 231 (-6.48%)

Mutual labels: kafka, streaming

Pyspark Learning

Updated repository

Stars: ✭ 147 (-40.49%)

Mutual labels: spark, spark-streaming

Netty Learning Example

🥚 Netty实践学习案例，见微知著！带着你的心，跟着教程。我相信你行欧。