Bigdata File ViewerA cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
Stars: ✭ 86 (-95.71%)
columnifyMake record oriented data to columnar format.
Stars: ✭ 28 (-98.6%)
Schema RegistryConfluent Schema Registry for Kafka
Stars: ✭ 1,647 (-17.86%)
MlsqlThe Programming Language Designed For Big Data and AI
Stars: ✭ 1,262 (-37.06%)
Liteflowliteflow是一个基于任务版本来实现的分布式任务流调度系统
Stars: ✭ 112 (-94.41%)
Ignite Book Code SamplesAll code samples, scripts and more in-depth examples for the book high performance in-memory computing with Apache Ignite. Please use the repository "the-apache-ignite-book" for Ignite version 2.6 or above.
Stars: ✭ 86 (-95.71%)
AbrisAvro SerDe for Apache Spark structured APIs.
Stars: ✭ 130 (-93.52%)
Daudit🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!
Stars: ✭ 108 (-94.61%)
SparktutorialSource code for James Lee's Aparch Spark with Java course
Stars: ✭ 105 (-94.76%)
SlimmessagebusLightweight message bus interface for .NET (pub/sub and request-response) with transport plugins for popular message brokers.
Stars: ✭ 120 (-94.01%)
SchemerSchema registry for CSV, TSV, JSON, AVRO and Parquet schema. Supports schema inference and GraphQL API.
Stars: ✭ 97 (-95.16%)
TwitworkMonitor twitter stream
Stars: ✭ 133 (-93.37%)
MnemonicApache Mnemonic - A non-volatile hybrid memory storage oriented library
Stars: ✭ 91 (-95.46%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-94.46%)
Avro4kAvro support for kotlinx.serialization
Stars: ✭ 82 (-95.91%)
Open Bank MarkA bank simulation application using mainly Clojure, which can be used to end-to-end test and show some graphs.
Stars: ✭ 81 (-95.96%)
Flinkstreamsql基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Stars: ✭ 1,682 (-16.11%)
Cleanframestype-class based data cleansing library for Apache Spark SQL
Stars: ✭ 75 (-96.26%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (-14.16%)
Rumble⛈️ Rumble 1.11.0 "Banyan Tree"🌳 for Apache Spark | Run queries on your large-scale, messy JSON-like data (JSON, text, CSV, Parquet, ROOT, AVRO, SVM...) | No install required (just a jar to download) | Declarative Machine Learning and more
Stars: ✭ 58 (-97.11%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-94.66%)
Dcos MetricsThe metrics pipeline for DC/OS 1.9-1.11
Stars: ✭ 57 (-97.16%)
Schema RegistryA CLI and Go client for Kafka Schema Registry
Stars: ✭ 105 (-94.76%)
HadoopcryptoledgerHadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-93.72%)
SplashSplash, a flexible Spark shuffle manager that supports user-defined storage backends for shuffle data storage and exchange
Stars: ✭ 105 (-94.76%)
RqRecord Query - A tool for doing record analysis and transformation
Stars: ✭ 1,808 (-9.83%)
KebsScala library to eliminate boilerplate
Stars: ✭ 113 (-94.36%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-33.27%)
Azure Event Hubs SparkEnabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-93.02%)
MagnolifyA collection of Magnolia add-on modules
Stars: ✭ 81 (-95.96%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (-22.99%)
Biglassobiglasso: Extending Lasso Model Fitting to Big Data in R
Stars: ✭ 87 (-95.66%)
AvroA fast Go Avro codec
Stars: ✭ 132 (-93.42%)
Kaufmann exKafka backed service library.
Stars: ✭ 86 (-95.71%)
Avro Hadoop StarterExample MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (-94.51%)
Aptos☀️ A tool for validating data using JSON Schema and converting JSON Schema documents into different data-interchange formats
Stars: ✭ 144 (-92.82%)
Athena CliPresto-like CLI tool for AWS Athena
Stars: ✭ 85 (-95.76%)
Books技术书籍等
Stars: ✭ 110 (-94.51%)
Avro BuilderRuby DSL to create Avro schemas
Stars: ✭ 82 (-95.91%)
TipdmTipDM建模平台,开源的数据挖掘工具。
Stars: ✭ 130 (-93.52%)
Uproot4ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-96.01%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-94.56%)
Apache Spark Hands OnEducational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-96.31%)
NoprotoFlexible, Fast & Compact Serialization with RPC
Stars: ✭ 138 (-93.12%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-96.56%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+422.59%)
Gcs ToolsGCS support for avro-tools, parquet-tools and protobuf
Stars: ✭ 57 (-97.16%)
FpartSort files and pack them into partitions
Stars: ✭ 127 (-93.67%)
ExamplesDemo applications and code examples for Confluent Platform and Apache Kafka
Stars: ✭ 571 (-71.52%)
GriddbGridDB is a next-generation open source database that makes time series IoT and big data fast,and easy.
Stars: ✭ 1,587 (-20.85%)
PoliAn easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain faster business insights.
Stars: ✭ 1,850 (-7.73%)
Kafka Connect Mongodb**Unofficial / Community** Kafka Connect MongoDB Sink Connector - Find the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
Stars: ✭ 137 (-93.17%)
VolcanoA Cloud Native Batch System (Project under CNCF)
Stars: ✭ 2,114 (+5.44%)