HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+440.21%)
ottlaAn opinionated clojure framework for writing kafka machines
Stars: ✭ 14 (-98.38%)
FaustPython Stream Processing
Stars: ✭ 5,899 (+583.55%)
spark-utilsBasic framework utilities to quickly start writing production ready Apache Spark applications
Stars: ✭ 25 (-97.1%)
BistroA general-purpose data analysis engine radically changing the way batch and stream data is processed
Stars: ✭ 333 (-61.41%)
artmlARTML- Real time learning
Stars: ✭ 20 (-97.68%)
bandar-logMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 20 (-97.68%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (-41.02%)
transform-hubFlexible and efficient data processing engine and an evolution of the popular Scramjet Framework based on node.js. Our Transform Hub was designed specifically for data processing and has its own unique algorithms included.
Stars: ✭ 38 (-95.6%)
GearpumpLightweight real-time big data streaming engine over Akka
Stars: ✭ 745 (-13.67%)
dspatchThe Refreshingly Simple Cross-Platform C++ Dataflow / Pipelining / Stream Processing / Reactive Programming Framework
Stars: ✭ 124 (-85.63%)
LearningsparkScala examples for learning to use Spark
Stars: ✭ 421 (-51.22%)
trafficMassively real-time traffic streaming application
Stars: ✭ 25 (-97.1%)
Json MachineEfficient, easy-to-use, and fast PHP JSON stream parser
Stars: ✭ 376 (-56.43%)
mageMAGE - Memgraph Advanced Graph Extensions 🔮
Stars: ✭ 89 (-89.69%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (-28.74%)
talariaTalariaDB is a distributed, highly available, and low latency time-series database for Presto
Stars: ✭ 148 (-82.85%)
Yomo🦖 Streaming-Serverless Framework for Low-latency Edge Computing applications, running atop QUIC protocol, engaging 5G technology.
Stars: ✭ 279 (-67.67%)
awesome-bigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 11,093 (+1185.4%)
keralaDistributed KV Streams
Stars: ✭ 16 (-98.15%)
SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (-40.56%)
JPStreamJPStream: JSONPath Stream Processing in Parallel
Stars: ✭ 19 (-97.8%)
Spring Cloud DataflowA microservices-based Streaming and Batch data processing in Cloud Foundry and Kubernetes
Stars: ✭ 753 (-12.75%)
Stream JsonThe micro-library of Node.js stream components for creating custom JSON processing pipelines with a minimal memory footprint. It can parse JSON files far exceeding available memory streaming individual primitives using a SAX-inspired API.
Stars: ✭ 462 (-46.47%)
theodoliteTheodolite is a framework for benchmarking the horizontal and vertical scalability of cloud-native applications.
Stars: ✭ 20 (-97.68%)
MobiusC# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (+7.65%)
KsqlThe database purpose-built for stream processing applications.
Stars: ✭ 4,668 (+440.9%)
litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-95.83%)
NYC Taxi PipelineDesign/Implement stream/batch architecture on NYC taxi data | #DE
Stars: ✭ 16 (-98.15%)
KasperKasper is a lightweight library for processing Kafka topics.
Stars: ✭ 413 (-52.14%)
swirlHigh-Performance Erlang Stream Processor
Stars: ✭ 52 (-93.97%)
Hazelcast JetDistributed Stream and Batch Processing
Stars: ✭ 855 (-0.93%)
stream-registryStream Discovery and Stream Orchestration
Stars: ✭ 105 (-87.83%)
Awesome System DesignA curated list of awesome System Design (A.K.A. Distributed Systems) resources.
Stars: ✭ 4,999 (+479.26%)
EsperIoTSmall and simple stream-based CEP tool for IoT devices connected to an MQTT broker
Stars: ✭ 18 (-97.91%)
AutomiA stream processing API for Go (alpha)
Stars: ✭ 617 (-28.51%)
storm-mlan online learning algorithm library for Storm
Stars: ✭ 18 (-97.91%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (-58.05%)
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-97.1%)
Bandar LogMonitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-97.8%)
godsendA simple and eloquent workflow for streaming messages to micro-services.
Stars: ✭ 15 (-98.26%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+284.47%)
rippleSimple shared surface streaming application
Stars: ✭ 17 (-98.03%)
Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (-28.97%)
Tuna🐟 A streaming ETL for fish
Stars: ✭ 11 (-98.73%)
VectorA reliable, high-performance tool for building observability data pipelines.
Stars: ✭ 8,736 (+912.28%)
AngelA Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+648.32%)
SmooksAn extensible Java framework for building XML and non-XML streaming applications
Stars: ✭ 293 (-66.05%)
gostreamStream Processing Library for Go
Stars: ✭ 51 (-94.09%)