KebsScala library to eliminate boilerplate
Stars: ✭ 113 (+88.33%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+83.33%)
Schema RegistryA CLI and Go client for Kafka Schema Registry
Stars: ✭ 105 (+75%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (+61.67%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (+35%)
Kaufmann exKafka backed service library.
Stars: ✭ 86 (+43.33%)
Avro4kAvro support for kotlinx.serialization
Stars: ✭ 82 (+36.67%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (+43.33%)
Open Bank MarkA bank simulation application using mainly Clojure, which can be used to end-to-end test and show some graphs.
Stars: ✭ 81 (+35%)
Avro BuilderRuby DSL to create Avro schemas
Stars: ✭ 82 (+36.67%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-3.33%)
Gcs ToolsGCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (-5%)
Dcos MetricsThe metrics pipeline for DC/OS 1.9-1.11
Stars: ✭ 57 (-5%)
ExamplesDemo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (+851.67%)
Go Kafka AvroA library provides consumer/producer to work with kafka, avro and schema registry
Stars: ✭ 39 (-35%)
AvrocadoAvrocado is a convenience library to handle Avro in Golang
Stars: ✭ 21 (-65%)
AvscAvro for JavaScript ⚡️
Stars: ✭ 930 (+1450%)
Aptos☀️ Avro, Protobuf, Thrift on Swagger
Stars: ✭ 17 (-71.67%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+1113.33%)
Pmacctpmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Stars: ✭ 677 (+1028.33%)
Avro4sAvro schema generation and serialization / deserialization for Scala
Stars: ✭ 593 (+888.33%)
Cpp SerializersBenchmark comparing various data serialization libraries (thrift, protobuf etc.) for C++
Stars: ✭ 533 (+788.33%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+576.67%)
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+555%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+520%)
RatatoolA tool for data sampling, data generation, and data diffing
Stars: ✭ 279 (+365%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-53.33%)
avro-schema-generatorLibrary for generating avro schema files (.avsc) based on DB tables structure
Stars: ✭ 38 (-36.67%)
AvroConvertApache Avro serializer for .NET
Stars: ✭ 44 (-26.67%)
Kafka RestConfluent REST Proxy for Kafka
Stars: ✭ 1,863 (+3005%)
Fast Data DevKafka Docker for development. Kafka, Zookeeper, Schema Registry, Kafka-Connect, Landoop Tools, 20+ connectors
Stars: ✭ 1,707 (+2745%)