parquet-extraA collection of Apache Parquet add-on modules
Stars: ✭ 30 (+3.45%)
Vscode Data PreviewData Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (+744.83%)
NoprotoFlexible, Fast & Compact Serialization with RPC
Stars: ✭ 138 (+375.86%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+100%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+510.34%)
Gcs ToolsGCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (+96.55%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (+196.55%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1300%)
Mu HaskellMu (μ) is a purely functional framework for building micro services.
Stars: ✭ 215 (+641.38%)
RatatoolA tool for data sampling, data generation, and data diffing
Stars: ✭ 279 (+862.07%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-17.24%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+1182.76%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (+234.48%)
columnifyMake record oriented data to columnar format.
Stars: ✭ 28 (-3.45%)
PucketBucketing and partitioning system for Parquet
Stars: ✭ 29 (+0%)
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+1255.17%)
Cpp SerializersBenchmark comparing various data serialization libraries (thrift, protobuf etc.) for C++
Stars: ✭ 533 (+1737.93%)
Kafka Connect Mongodb**Unofficial / Community** Kafka Connect MongoDB Sink Connector - Find the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
Stars: ✭ 137 (+372.41%)
sbt-avroPlugin SBT to Generate Scala classes from Apache Avro schemas hosted on a remote Confluent Schema Registry.
Stars: ✭ 15 (-48.28%)
AvroA fast Go Avro codec
Stars: ✭ 132 (+355.17%)
SlimmessagebusLightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Stars: ✭ 120 (+313.79%)
protobluffA modular Protocol Buffers implementation for C
Stars: ✭ 66 (+127.59%)
openmrs-fhir-analyticsA collection of tools for extracting FHIR resources and analytics services on top of that data.
Stars: ✭ 55 (+89.66%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+279.31%)
Schema RegistryA CLI and Go client for Kafka Schema Registry
Stars: ✭ 105 (+262.07%)
schema-registry-php-clientA PHP 7.3+ API client for the Confluent Schema Registry REST API based on Guzzle 6 - http://docs.confluent.io/current/schema-registry/docs/index.html
Stars: ✭ 40 (+37.93%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (+179.31%)
rules proto grpcBazel rules for building Protobuf and gRPC code and libraries from proto_library targets
Stars: ✭ 201 (+593.1%)
RqRecord Query - A tool for doing record analysis and transformation
Stars: ✭ 1,808 (+6134.48%)
kafka-scala-examplesExamples of Avro, Kafka, Schema Registry, Kafka Streams, Interactive Queries, KSQL, Kafka Connect in Scala
Stars: ✭ 53 (+82.76%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (+348.28%)
avro-serde-phpAvro Serialisation/Deserialisation (SerDe) library for PHP 7.3+ & 8.0 with a Symfony Serializer integration
Stars: ✭ 43 (+48.28%)
KebsScala library to eliminate boilerplate
Stars: ✭ 113 (+289.66%)
qsvCSVs sliced, diced & analyzed.
Stars: ✭ 438 (+1410.34%)
Schema RegistryConfluent Schema Registry for Kafka
Stars: ✭ 1,647 (+5579.31%)
rasterA micro server framework, support coroutine, and parallel-computing, used for building flatbuffers/thrift/protobuf/http protocol service.
Stars: ✭ 19 (-34.48%)
j2cl-protobufProtocol Buffers implementation for J2CL
Stars: ✭ 23 (-20.69%)
Kaufmann exKafka backed service library.
Stars: ✭ 86 (+196.55%)
Avro4kAvro support for kotlinx.serialization
Stars: ✭ 82 (+182.76%)
miniparquetLibrary to read a subset of Parquet files
Stars: ✭ 38 (+31.03%)
StoragetapperStorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
Stars: ✭ 232 (+700%)
Open Bank MarkA bank simulation application using mainly Clojure, which can be used to end-to-end test and show some graphs.
Stars: ✭ 81 (+179.31%)
vscode-bufVisual Studio Code integration for Buf.
Stars: ✭ 40 (+37.93%)
Jackson Dataformats BinaryUber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile
Stars: ✭ 221 (+662.07%)
Avro BuilderRuby DSL to create Avro schemas
Stars: ✭ 82 (+182.76%)
Dcos MetricsThe metrics pipeline for DC/OS 1.9-1.11
Stars: ✭ 57 (+96.55%)
ExamplesDemo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (+1868.97%)
KafkactlCommand Line Tool for managing Apache Kafka
Stars: ✭ 177 (+510.34%)
Go Kafka AvroA library provides consumer/producer to work with kafka, avro and schema registry
Stars: ✭ 39 (+34.48%)
thrift-parserA Thrift Parser built in TypeScript that generates a TypeScript AST that retains the Thrift grammar
Stars: ✭ 84 (+189.66%)
AvrocadoAvrocado is a convenience library to handle Avro in Golang
Stars: ✭ 21 (-27.59%)
AvscAvro for JavaScript ⚡️
Stars: ✭ 930 (+3106.9%)
Aptos☀️ Avro, Protobuf, Thrift on Swagger
Stars: ✭ 17 (-41.38%)
Gradle Avro PluginA Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
Stars: ✭ 176 (+506.9%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (+2410.34%)
Pmacctpmacct is a small set of multi-purpose passive network monitoring tools [NetFlow IPFIX sFlow libpcap BGP BMP RPKI IGP Streaming Telemetry].
Stars: ✭ 677 (+2234.48%)