Fast Data DevKafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, Landoop Tools, 20+ connectors
firehoseFirehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
beneathBeneath is a serverless real-time data platform ⚡️
noronhaDataOps framework for Machine Learning projects.
chartsThis repository is home to the original helm charts for products throughout the open data platform ecosystem.
lenses-goLenses.io CLI (command-line interface)
datatileA library for managing, validating, summarizing, and visualizing data.
vulknLove your Data. Love the Environment. Love VULKИ.
shieldShield is a role-based cloud-native user management system, identity & access proxy, and authorization server for your applications and API endpoints.
versatile-data-kitVersatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
sirenSiren provides an easy-to-use universal alert, notification, channels management framework for the entire observability infrastructure.
cliPolyaxon Core Client & CLI to streamline MLOps
guardianGuardian is a tool for extensible and universal data access with automated access workflows and security controls across data stores, analytical systems, and cloud products.
daggerDagger is an easy-to-use, configuration over code, cloud-native framework built on top of Apache Flink for stateful processing of real-time streaming data.
raccoonRaccoon is a high-throughput, low-latency service to collect events in real-time from your web, mobile apps, and services using multiple network protocols.