All Projects → nevillelyh → parquet-extra

nevillelyh / parquet-extra

Licence: Apache-2.0 license
A collection of Apache Parquet add-on modules

Programming Languages

scala
5932 projects
java
68154 projects - #9 most used programming language

Projects that are alternatives of or similar to parquet-extra

Ratatool
A tool for data sampling, data generation, and data diffing
Stars: ✭ 279 (+830%)
Mutual labels:  avro, parquet
parquet-flinktacular
How to use Parquet in Flink
Stars: ✭ 29 (-3.33%)
Mutual labels:  avro, parquet
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-20%)
Mutual labels:  avro, parquet
columnify
Make record oriented data to columnar format.
Stars: ✭ 28 (-6.67%)
Mutual labels:  avro, parquet
Gcs Tools
GCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (+90%)
Mutual labels:  avro, parquet
Bigdata Playground
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (+490%)
Mutual labels:  avro, parquet
Choetl
ETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+1140%)
Mutual labels:  avro, parquet
Iceberg
Iceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+1210%)
Mutual labels:  avro, parquet
Vscode Data Preview
Data Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (+716.67%)
Mutual labels:  avro, parquet
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+1253.33%)
Mutual labels:  avro, parquet
Bigdata File Viewer
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (+186.67%)
Mutual labels:  avro, parquet
Rumble
⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (+93.33%)
Mutual labels:  avro, parquet
Schemer
Schema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (+223.33%)
Mutual labels:  avro, parquet
sbt-avro
Plugin SBT to Generate Scala classes from Apache Avro schemas hosted on a remote Confluent Schema Registry.
Stars: ✭ 15 (-50%)
Mutual labels:  avro
Kafkactl
Command Line Tool for managing Apache Kafka
Stars: ✭ 177 (+490%)
Mutual labels:  avro
Gradle Avro Plugin
A Gradle plugin to allow easily performing Java code generation for Apache Avro. It supports JSON schema declaration files, JSON protocol declaration files, and Avro IDL files.
Stars: ✭ 176 (+486.67%)
Mutual labels:  avro
qsv
CSVs sliced, diced & analyzed.
Stars: ✭ 438 (+1360%)
Mutual labels:  parquet
avro-serde-php
Avro Serialisation/Deserialisation (SerDe) library for PHP 7.3+ & 8.0 with a Symfony Serializer integration
Stars: ✭ 43 (+43.33%)
Mutual labels:  avro
Mongo Kafka
MongoDB Kafka Connector
Stars: ✭ 166 (+453.33%)
Mutual labels:  avro
Avro
Apache Avro is a data serialization system.
Stars: ✭ 2,005 (+6583.33%)
Mutual labels:  avro

parquet-extra

Build Status codecov.io GitHub license Maven Central Scala Steward badge

A collection of Apache Parquet add-on modules.

  • parquet-avro - Scala macros for generating column projections and filter predicates from lambda functions.
  • parquet-tensorflow - TensorFlow Example read/write support.

License

Copyright 2019 Neville Li.

Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].