Vscode Data Preview
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Mu (μ) is a purely functional framework for building micro services.
Command Line Tool for managing Apache Kafka
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Gradle Avro Plugin
A Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
Apache Avro is a data serialization system.
☀️ A tool for validating data using JSON Schema and converting JSON Schema documents into different data-interchange formats
Flexible, Fast & Compact Serialization with RPC
Kafka Connect Mongodb
**Unofficial / Community** Kafka Connect MongoDB Sink Connector - Find the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
Record Query - A tool for doing record analysis and transformation
Avro SerDe for Apache Spark structured APIs.
Lightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Scala library to eliminate boilerplate
Avro Hadoop Starter
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
A collection of Magnolia add-on modules
Avro support for kotlinx.serialization
Bigdata File Viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Open Bank Mark
A bank simulation application using mainly Clojure, which can be used to end-to-end test and show some graphs.
⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
GCS support for avro-tools, parquet-tools and protobuf
Demo applications and code examples for Confluent Platform and Apache Kafka
Go Kafka Avro
A library provides consumer/producer to work with kafka, avro and schema registry
Avrocado is a convenience library to handle Avro in Golang
☀️ Avro, Protobuf, Thrift on Swagger
Kafka Storm Starter
Code examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
pmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Avro schema generation and serialization / deserialization for Scala
Benchmark comparing various data serialization libraries (thrift, protobuf etc.) for C++
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Iceberg is a table format for large, slow-moving tabular data
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
A tool for data sampling, data generation, and data diffing
A SQL-like language for performing ETL transformations.
Goal: create a Spring Boot application that handles users using Event Sourcing. So, whenever a user is created, updated, or deleted, an event informing this change is sent to Kafka. Also, we will implement another application that listens to those events and saves them in Cassandra. Finally, we will use Testcontainers for integration testing.
A convenient Elixir library to work with Avro schemas and Confluent® Schema Registry
A client UI to inspect Kafka topics, consume, produce and much more
Goavro translates between Go native types and binary and textual Avro data
An Avro Library that emphasizes testability and ease of use.