Gcs ToolsGCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (-79.57%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (-70.97%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-79.21%)
Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-69.18%)
Vscode Data PreviewData Preview 🈸 extension for importing 📤 viewing 🔎 slicing 🔪 dicing 🎲 charting 📊 & exporting 📥 large JSON array/config, YAML, Apache Arrow, Avro, Parquet & Excel data files
Stars: ✭ 245 (-12.19%)
Jackson Dataformats BinaryUber-project for standard Jackson binary format backends: avro, cbor, ion, protobuf, smile
Stars: ✭ 221 (-20.79%)
RqRecord Query - A tool for doing record analysis and transformation
Stars: ✭ 1,808 (+548.03%)
IcebergIceberg is a table format for large, slow-moving tabular data
Stars: ✭ 393 (+40.86%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-36.56%)
Devops Python Tools80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (+45.52%)
Cpp SerializersBenchmark comparing various data serialization libraries (thrift, protobuf etc.) for C++
Stars: ✭ 533 (+91.04%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-65.23%)
ChoetlETL Framework for .NET / c# (Parser / Writer for CSV, Flat, Xml, JSON, Key-Value, Parquet, Yaml, Avro formatted files)
Stars: ✭ 372 (+33.33%)
parquet-extraA collection of Apache Parquet add-on modules
Stars: ✭ 30 (-89.25%)
Schema RegistryConfluent Schema Registry for Kafka
Stars: ✭ 1,647 (+490.32%)
columnifyMake record oriented data to columnar format.
Stars: ✭ 28 (-89.96%)
DaFlowApache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-91.4%)
dbddbd is a database prototyping tool that enables data analysts and engineers to quickly load and transform data in SQL databases.
Stars: ✭ 30 (-89.25%)
googleapisgoogleapis generated with gogoprotobuf
Stars: ✭ 28 (-89.96%)
GitQuerySync files and directories from a remote Git repo. CLI and Gradle Plugin.
Stars: ✭ 25 (-91.04%)
confluent-spark-avroSpark UDFs to deserialize Avro messages with schemas stored in Schema Registry.
Stars: ✭ 18 (-93.55%)
ob google-bigqueryThis service is meant to simplify running Google Cloud operations, especially BigQuery tasks. This means you do not have to worry about installation, configuration or ongoing maintenance related to an SDK environment. This can be helpful to those who would prefer to not to be responsible for those activities.
Stars: ✭ 43 (-84.59%)
BeetleX.RedisA high-performance async/non-blocking redis client components for dotnet core,default data formater json protobuf and messagepack,support ssl
Stars: ✭ 174 (-37.63%)
goflow2High performance sFlow/IPFIX/NetFlow Collector
Stars: ✭ 125 (-55.2%)
RoapiCreate full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (-9.32%)
growthbookOpen Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+739.43%)
UschemaIt is used to define the format of the data and validate it in the Ulord blockchain.
Stars: ✭ 33 (-88.17%)
Docker ProtobufAll inclusive Protocol Buffer and gRPC suite, powered by Docker and Alpine
Stars: ✭ 270 (-3.23%)
AndTTT🎲 Simple tic tac toe game for Android
Stars: ✭ 15 (-94.62%)
spring-cloud-stream-event-sourcing-testcontainersGoal: create a Spring Boot application that handles users using Event Sourcing. So, whenever a user is created, updated, or deleted, an event informing this change is sent to Kafka. Also, we will implement another application that listens to those events and saves them in Cassandra. Finally, we will use Testcontainers for integration testing.
Stars: ✭ 16 (-94.27%)
qweryA SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-89.96%)
firehoseFirehose is an extensible, no-code, and cloud-native service to load real-time streaming data from Kafka to data stores, data lakes, and analytical storage systems.
Stars: ✭ 213 (-23.66%)
bigquery fdwBigQuery Foreign Data Wrapper for PostgreSQL
Stars: ✭ 65 (-76.7%)
ronyFast and Scalable RPC Framework
Stars: ✭ 41 (-85.3%)
ocaml-pb-pluginA protoc plugin for generating OCaml code from protobuf (.proto) files.
Stars: ✭ 18 (-93.55%)
Protoc Gen MicroProtobuf code generation for Micro. Moved to go-micro/cmd/protoc-gen-micro.
Stars: ✭ 270 (-3.23%)
alphasqlAlphaSQL provides Integrated Type and Schema Check and Parallelization for SQL file set mainly for BigQuery
Stars: ✭ 35 (-87.46%)
docarrayThe data structure for unstructured data
Stars: ✭ 561 (+101.08%)
telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (-21.51%)
laravel-bigGoogle BigQuery for Laravel
Stars: ✭ 14 (-94.98%)
xrgrpcgRPC library for Cisco IOS XR
Stars: ✭ 40 (-85.66%)
pre-commit-dbt🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Stars: ✭ 149 (-46.59%)