All Projects → Data Accelerator → Similar Projects or Alternatives

2515 Open source projects that are alternatives of or similar to Data Accelerator

Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-43.32%)
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+596.76%)
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (+107.69%)
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-28.34%)
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-60.73%)
Mutual labels:  kafka, spark, big-data, kafka-streams
Seldon Server
Machine Learning Platform and Recommendation Engine built on Kubernetes
Stars: ✭ 1,435 (+480.97%)
Mutual labels:  azure, kafka, spark, kafka-streams
Awesome Kafka
A list about Apache Kafka
Stars: ✭ 397 (+60.73%)
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+276.11%)
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-85.02%)
Mutual labels:  kafka, spark, apache-spark, spark-streaming
Streamline
StreamLine - Streaming Analytics
Stars: ✭ 151 (-38.87%)
Mmlspark
Simple and Distributed Machine Learning
Stars: ✭ 2,899 (+1073.68%)
Mutual labels:  azure, spark, big-data, apache-spark
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-12.55%)
Mutual labels:  kafka, spark, big-data, spark-streaming
Spark On Lambda
Apache Spark on AWS Lambda
Stars: ✭ 137 (-44.53%)
Mutual labels:  spark, big-data, apache-spark
Azure Event Hubs
☁️ Cloud-scale telemetry ingestion from any stream of data with Azure Event Hubs
Stars: ✭ 233 (-5.67%)
Mutual labels:  azure, spark, streaming
Spark States
Custom state store providers for Apache Spark
Stars: ✭ 83 (-66.4%)
Mutual labels:  spark, apache-spark, spark-streaming
Streaming Readings
Streaming System 相关的论文读物
Stars: ✭ 554 (+124.29%)
Kafka Ui
Open-Source Web GUI for Apache Kafka Management
Stars: ✭ 230 (-6.88%)
Mutual labels:  kafka, big-data, kafka-streams
Kafka Connect Hdfs
Kafka Connect HDFS connector
Stars: ✭ 400 (+61.94%)
Mutual labels:  kafka, big-data, streaming
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+4349.8%)
Mutual labels:  kafka, spark, big-data
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-39.27%)
Mutual labels:  spark, big-data, apache-spark
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-94.74%)
Mutual labels:  big-data, spark, apache-spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+67.21%)
Mutual labels:  kafka, spark, apache-spark
aut
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-55.06%)
Mutual labels:  big-data, spark, apache-spark
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+1243.32%)
Mutual labels:  spark, apache-spark, spark-streaming
Kafka Streams
equivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+148.18%)
Mutual labels:  kafka, big-data, kafka-streams
Go Streams
A lightweight stream processing library for Go
Stars: ✭ 615 (+148.99%)
Mutual labels:  kafka, kafka-streams, streaming-data
Spark Streaming With Kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
Stars: ✭ 180 (-27.13%)
Mutual labels:  kafka, spark, spark-streaming
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-92.31%)
Mutual labels:  kafka, big-data, spark-streaming
Kafka Storm Starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+194.74%)
Mutual labels:  kafka, spark, apache-spark
Thingsboard
Open-source IoT Platform - Device management, data collection, processing and visualization.
Stars: ✭ 10,526 (+4161.54%)
Mutual labels:  kafka, spark, iot
Kafka Streams In Action
Source code for the Kafka Streams in Action Book
Stars: ✭ 167 (-32.39%)
Mutual labels:  kafka, streaming-data, streaming
Bigdata Notebook
Stars: ✭ 100 (-59.51%)
Mutual labels:  kafka, spark, streaming
Sparkrdma
RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark
Stars: ✭ 215 (-12.96%)
Mutual labels:  spark, big-data, apache-spark
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (+34.41%)
Mutual labels:  kafka, spark, apache-spark
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (-61.13%)
Mutual labels:  kafka, big-data, streaming
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+4506.48%)
Mutual labels:  kafka, spark, streaming
Example Spark Kafka
Apache Spark and Apache Kafka integration example
Stars: ✭ 120 (-51.42%)
Mutual labels:  kafka, spark, spark-streaming
Flogo
Project Flogo is an open source ecosystem of opinionated event-driven capabilities to simplify building efficient & modern serverless functions, microservices & edge apps.
Stars: ✭ 1,891 (+665.59%)
Mutual labels:  iot, streaming
Sparkling Graph
SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-43.72%)
Mutual labels:  spark, big-data
Aliyun Emapreduce Datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Stars: ✭ 132 (-46.56%)
Mutual labels:  kafka, spark
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (-43.32%)
Mutual labels:  kafka, big-data
Kafka Tutorials
Kafka Tutorials microsite
Stars: ✭ 144 (-41.7%)
Mutual labels:  kafka, kafka-streams
Oryx
Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+622.67%)
Mutual labels:  kafka, apache-spark
Mastering Spark Sql Book
The Internals of Spark SQL
Stars: ✭ 234 (-5.26%)
Mutual labels:  spark, apache-spark
Samsara
Samsara is a real-time analytics platform
Stars: ✭ 132 (-46.56%)
Mutual labels:  kafka, iot
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (-41.7%)
Mutual labels:  big-data, apache-spark
Technology Talk
汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+4813.36%)
Mutual labels:  kafka, spark
Supersafebank
Sample Event Sourcing implementation with .NET Core
Stars: ✭ 142 (-42.51%)
Mutual labels:  azure, kafka
A Kafka Story
Kafka ecosystem ... but step by step!
Stars: ✭ 148 (-40.08%)
Mutual labels:  kafka, kafka-streams
Azkarra Streams
🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
Stars: ✭ 146 (-40.89%)
Mutual labels:  kafka, kafka-streams
Tributary
Streaming reactive and dataflow graphs in Python
Stars: ✭ 231 (-6.48%)
Mutual labels:  kafka, streaming
Pyspark Learning
Updated repository
Stars: ✭ 147 (-40.49%)
Mutual labels:  spark, spark-streaming
Netty Learning Example
🥚 Netty实践学习案例,见微知著!带着你的心,跟着教程。我相信你行欧。
Stars: ✭ 2,146 (+768.83%)
Mutual labels:  kafka, iot
Redpanda
Redpanda is the real-time engine for modern apps. Kafka API Compatible; 10x faster 🚀 See more at vectorized.io/redpanda
Stars: ✭ 3,114 (+1160.73%)
Mutual labels:  kafka, streaming
Spark.jl
Julia binding for Apache Spark
Stars: ✭ 153 (-38.06%)
Mutual labels:  spark, big-data
Video Stream Analytics
Stars: ✭ 240 (-2.83%)
Mutual labels:  kafka, spark
Whylogs Java
Profile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-33.6%)
Mutual labels:  spark, apache-spark
Iot Traffic Monitor
Stars: ✭ 131 (-46.96%)
Mutual labels:  kafka, spark
Parquetviewer
Simple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-41.3%)
Mutual labels:  big-data, apache-spark
Datasciencevm
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-38.06%)
Mutual labels:  azure, big-data
1-60 of 2515 similar projects