Awesome PulsarA curated list of Pulsar tools, integrations and resources.
Stars: ✭ 57 (-76.92%)
River🌊 Online machine learning in Python
Stars: ✭ 2,980 (+1106.48%)
CmakCMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+4168.83%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-77.73%)
UsqlU-SQL Examples and Issue Tracking
Stars: ✭ 221 (-10.53%)
IotdbApache IoTDB
Stars: ✭ 1,221 (+394.33%)
CuesheetA framework for writing Spark 2.x applications in a pretty way
Stars: ✭ 86 (-65.18%)
Utils4sscala、spark使用过程中,各种测试用例以及相关资料整理
Stars: ✭ 1,070 (+333.2%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+441.7%)
Kinesis SqlKinesis Connector for Structured Streaming
Stars: ✭ 120 (-51.42%)
SlimmessagebusLightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Stars: ✭ 120 (-51.42%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-48.18%)
WillaA Clojure DSL for Kafka Streams
Stars: ✭ 97 (-60.73%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-57.49%)
Awesome Azure IotA curated list of awesome Azure Internet of Things projects and resources.
Stars: ✭ 104 (-57.89%)
WaterdropProduction Ready Data Integration Product, documentation:
Stars: ✭ 1,856 (+651.42%)
Awesome KafkaA collection of kafka-resources
Stars: ✭ 116 (-53.04%)
BigdataclassTwo-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-55.47%)
SamsaraSamsara is a real-time analytics platform
Stars: ✭ 132 (-46.56%)
TeddySpark Streaming监控平台,支持任务部署与告警、自启动
Stars: ✭ 120 (-51.42%)
Spark NkpNatural Korean Processor for Apache Spark
Stars: ✭ 50 (-79.76%)
FlogoProject Flogo is an open source ecosystem of opinionated event-driven capabilities to simplify building efficient & modern serverless functions, microservices & edge apps.
Stars: ✭ 1,891 (+665.59%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-47.37%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-41.7%)
Sparkling GraphSparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.
Stars: ✭ 139 (-43.72%)
OryxOryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning
Stars: ✭ 1,785 (+622.67%)
Technology Talk汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Stars: ✭ 12,136 (+4813.36%)
Eel SdkBig Data Toolkit for the JVM
Stars: ✭ 140 (-43.32%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+564.78%)
Azkarra Streams🚀 Azkarra is a lightweight java framework to make it easy to develop, deploy and manage cloud-native streaming microservices based on Apache Kafka Streams.
Stars: ✭ 146 (-40.89%)
A Kafka StoryKafka ecosystem ... but step by step!
Stars: ✭ 148 (-40.08%)
ParquetviewerSimple windows desktop application for viewing & querying Apache Parquet files
Stars: ✭ 145 (-41.3%)
RedpandaRedpanda is the real-time engine for modern apps. Kafka API Compatible; 10x faster 🚀 See more at vectorized.io/redpanda
Stars: ✭ 3,114 (+1160.73%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-38.06%)
GeniA Clojure dataframe library that runs on Spark
Stars: ✭ 152 (-38.46%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-48.18%)
SupersafebankSample Event Sourcing implementation with .NET Core
Stars: ✭ 142 (-42.51%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-38.06%)
Whylogs JavaProfile and monitor your ML data pipeline end-to-end
Stars: ✭ 164 (-33.6%)
MaterializeMaterialize lets you ask questions of your live data, which it answers and then maintains for you as your data continue to change. The moment you need a refreshed answer, you can get it in milliseconds. Materialize is designed to help you interactively explore your streaming data, perform data warehousing analytics against live relational data, or just increase the freshness and reduce the load of your dashboard and monitoring tasks.
Stars: ✭ 3,341 (+1252.63%)
Smart openUtils for streaming large files (S3, HDFS, gzip, bz2...)
Stars: ✭ 2,306 (+833.6%)
Sparkstreaming💥 🚀 封装sparkstreaming动态调节batch time(有数据就执行计算);🚀 支持运行过程中增删topic;🚀 封装sparkstreaming 1.6 - kafka 010 用以支持 SSL。
Stars: ✭ 179 (-27.53%)
RegistrySchema Registry
Stars: ✭ 184 (-25.51%)
MockedstreamsScala DSL for Unit-Testing Processing Topologies in Kafka Streams
Stars: ✭ 184 (-25.51%)