fdp-modelserverAn umbrella project for multiple implementations of model serving
Stars: ✭ 47 (+27.03%)
StreamlineStreamLine - Streaming Analytics
Stars: ✭ 151 (+308.11%)
RegistrySchema Registry
Stars: ✭ 184 (+397.3%)
KsppA high performance/ real-time C++ Kafka streams framework (C++17)
Stars: ✭ 80 (+116.22%)
Coolplayspark酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+8867.57%)
SANSA-StackBig Data RDF Processing and Analytics Stack built on Apache Spark and Apache Jena http://sansa-stack.github.io/SANSA-Stack/
Stars: ✭ 130 (+251.35%)
football-eventsEvent-Driven microservices with Kafka Streams
Stars: ✭ 57 (+54.05%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+4916.22%)
Flink Sql CookbookThe Apache Flink SQL Cookbook is a curated collection of examples, patterns, and use cases of Apache Flink SQL. Many of the recipes are completely self-contained and can be run in Ververica Platform as is.
Stars: ✭ 189 (+410.81%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (+567.57%)
WormholeWormhole is a SPaaS (Stream Processing as a Service) Platform
Stars: ✭ 863 (+2232.43%)
daggerDagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.
Stars: ✭ 238 (+543.24%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+30651.35%)
LogislandScalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (+162.16%)
litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-2.7%)
Go StreamsA lightweight stream processing library for Go
Stars: ✭ 615 (+1562.16%)
product-spAn open source, cloud-native streaming data integration and analytics product optimized for agile digital businesses
Stars: ✭ 80 (+116.22%)
FaustPython Stream Processing
Stars: ✭ 5,899 (+15843.24%)
ExamplesDemo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (+1443.24%)
Pulsar FlinkElastic data processing with Apache Pulsar and Apache Flink
Stars: ✭ 126 (+240.54%)
HazelcastOpen-source distributed computation and storage platform
Stars: ✭ 4,662 (+12500%)
Kafka Streamsequivalent to kafka-streams 🐙 for nodejs ✨🐢🚀✨
Stars: ✭ 613 (+1556.76%)
cassandra.realtimeDifferent ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-32.43%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (+878.38%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (+48.65%)
rippleSimple shared surface streaming application
Stars: ✭ 17 (-54.05%)
AmadeusHarmonious distributed data analysis in Rust.
Stars: ✭ 240 (+548.65%)
yakutSimple CLI tool for diagnostics and debugging of Cyphal networks
Stars: ✭ 29 (-21.62%)
openPDCOpen Source Phasor Data Concentrator
Stars: ✭ 109 (+194.59%)
CrowdFlowOptical Flow Dataset and Benchmark for Visual Crowd Analysis
Stars: ✭ 87 (+135.14%)
spStream Processors on Kafka in Golang
Stars: ✭ 29 (-21.62%)
TiBigDataTiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (+418.92%)
flink-deployerA tool that help automate deployment to an Apache Flink cluster
Stars: ✭ 143 (+286.49%)
dislibThe Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (+5.41%)
Elkeid-HUBElkeid HUB is a rule/event processing engine maintained by the Elkeid Team that supports streaming/offline (not yet supported by the community edition) data processing. The original intention is to solve complex data/event processing and external system linkage requirements through standardized rules.
Stars: ✭ 62 (+67.57%)
Theano-MPIMPI Parallel framework for training deep learning models built in Theano
Stars: ✭ 55 (+48.65%)
dlinkDinky is an out of the box one-stop real-time computing platform dedicated to the construction and practice of Unified Streaming & Batch and Unified Data Lake & Data Warehouse. Based on Apache Flink, Dinky provides the ability to connect many big data frameworks including OLAP and Data Lake.
Stars: ✭ 1,535 (+4048.65%)
flink-connector-kudu基于Apache-bahir-kudu-connector的flink-connector-kudu,支持Flink1.11.x DynamicTableSource/Sink,支持Range分区等
Stars: ✭ 40 (+8.11%)
dpkb大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+232.43%)
fink-brokerAstronomy Broker based on Apache Spark
Stars: ✭ 18 (-51.35%)
flink-clientJava library for managing Apache Flink via the Monitoring REST API
Stars: ✭ 48 (+29.73%)
Spark ALS基于spark-ml,spark-mllib,spark-streaming的推荐算法实现
Stars: ✭ 89 (+140.54%)
kafka-scala-examplesExamples of Avro, Kafka, Schema Registry, Kafka Streams, Interactive Queries, KSQL, Kafka Connect in Scala
Stars: ✭ 53 (+43.24%)
fahclientDockerized Folding@home client with NVIDIA GPU support to help battle COVID-19
Stars: ✭ 38 (+2.7%)
distogramA library to compute histograms on distributed environments, on streaming data
Stars: ✭ 19 (-48.65%)
spark-gdeltBinding the GDELT universe in a Spark environment
Stars: ✭ 20 (-45.95%)
fleexFleex makes it easy to create multiple VPS on cloud providers and use them to distribute workloads.
Stars: ✭ 181 (+389.19%)
logparserEasy parsing of Apache HTTPD and NGINX access logs with Java, Hadoop, Hive, Pig, Flink, Beam, Storm, Drill, ...
Stars: ✭ 139 (+275.68%)
Tweet-Analysis-With-Kafka-and-SparkA real time analytics dashboard to analyze the trending hashtags and @ mentions at any location using kafka and spark streaming.
Stars: ✭ 18 (-51.35%)