RatatoolA tool for data sampling, data generation, and data diffing
Stars: ✭ 279 (+830%)
Mutual labels: avro, parquet
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-20%)
Mutual labels: avro, parquet
columnifyMake record oriented data to columnar format.
Stars: ✭ 28 (-6.67%)
Mutual labels: avro, parquet
Gcs ToolsGCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (+90%)
Mutual labels: avro, parquet
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+490%)
Mutual labels: avro, parquet
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+1140%)
Mutual labels: avro, parquet
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+1210%)
Mutual labels: avro, parquet
Vscode Data PreviewData Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (+716.67%)
Mutual labels: avro, parquet
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1253.33%)
Mutual labels: avro, parquet
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (+186.67%)
Mutual labels: avro, parquet
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+93.33%)
Mutual labels: avro, parquet
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (+223.33%)
Mutual labels: avro, parquet
sbt-avroPlugin SBT to Generate Scala classes from Apache Avro schemas hosted on a remote Confluent Schema Registry.
Stars: ✭ 15 (-50%)
Mutual labels: avro
KafkactlCommand Line Tool for managing Apache Kafka
Stars: ✭ 177 (+490%)
Mutual labels: avro
Gradle Avro PluginA Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
Stars: ✭ 176 (+486.67%)
Mutual labels: avro
qsvCSVs sliced, diced & analyzed.
Stars: ✭ 438 (+1360%)
Mutual labels: parquet
avro-serde-phpAvro Serialisation/Deserialisation (SerDe) library for PHP 7.3+ & 8.0 with a Symfony Serializer integration
Stars: ✭ 43 (+43.33%)
Mutual labels: avro
Mongo KafkaMongoDB Kafka Connector
Stars: ✭ 166 (+453.33%)
Mutual labels: avro
AvroApache Avro is a data serialization system.
Stars: ✭ 2,005 (+6583.33%)
Mutual labels: avro