All Projects → Moonbox → Similar Projects or Alternatives

494 Open source projects that are alternatives of or similar to Moonbox

WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!

Stars: ✭ 372 (-12.26%)

Mutual labels: spark, hive

Kyuubi

Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark

Stars: ✭ 363 (-14.39%)

Mutual labels: spark, hive

Linkis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,323 (+447.88%)

Mutual labels: spark, hive

Hadoop cookbook

Cookbook to install Hadoop 2.0+ using Chef

Stars: ✭ 82 (-80.66%)

Mutual labels: spark, hive

Cube.js

📊 Cube — Open-Source Analytics API for Building Data Apps

Stars: ✭ 11,983 (+2726.18%)

Mutual labels: spark, hive

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+181.84%)

Mutual labels: spark, hive

Yanagishima

Web UI for Trino, Presto, Hive, Elasticsearch, SparkSQL

Stars: ✭ 424 (+0%)

Mutual labels: spark, hive

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+2492.22%)

Mutual labels: spark, hive

Luigi Warehouse

A luigi powered analytics / warehouse stack

Stars: ✭ 72 (-83.02%)

Mutual labels: spark, hive

Repository

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (-78.3%)

Mutual labels: spark, hive

Szt Bigdata

深圳地铁大数据客流分析系统🚇🚄🌟

Stars: ✭ 826 (+94.81%)

Mutual labels: spark, hive

swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (-91.75%)

Mutual labels: spark, hive

Hadoop Docker

基于Docker构建的Hadoop开发测试环境，包含Hadoop，Hive，HBase，Spark

Stars: ✭ 238 (-43.87%)

Mutual labels: spark, hive

Spark Authorizer

A Spark SQL extension which provides SQL Standard Authorization for Apache Spark

Stars: ✭ 141 (-66.75%)

Mutual labels: spark, hive

BigData-News

基于Spark2.2新闻网大数据实时系统项目

Stars: ✭ 36 (-91.51%)

Mutual labels: spark, hive

Bdp Dataplatform

大数据生态解决方案数据平台：基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。

Stars: ✭ 456 (+7.55%)

Mutual labels: spark, hive

God Of Bigdata

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stars: ✭ 6,008 (+1316.98%)

Mutual labels: spark, hive

Quicksql

A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

Stars: ✭ 1,821 (+329.48%)

Mutual labels: spark, hive

Xsql

Unified SQL Analytics Engine Based on SparkSQL

Stars: ✭ 176 (-58.49%)

Mutual labels: spark, hive

Scriptis

Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.

Stars: ✭ 696 (+64.15%)

Mutual labels: spark, hive

Bigdataguide

大数据学习，从零开始学习大数据，包含大数据学习各阶段学习视频、面试资料

Stars: ✭ 817 (+92.69%)

Mutual labels: spark, hive

Hops Examples

Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops

Stars: ✭ 84 (-80.19%)

Mutual labels: spark, hive

Apache Spark Hands On

Educational notes,Hands on problems w/ solutions for hadoop ecosystem

Stars: ✭ 74 (-82.55%)

Mutual labels: spark, hive

Bigdata docker

Big Data Ecosystem Docker

Stars: ✭ 161 (-62.03%)

Mutual labels: spark, hive

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (-70.28%)

Mutual labels: spark, hive

spark-acid

ACID Data Source for Apache Spark based on Hive ACID

Stars: ✭ 91 (-78.54%)

Mutual labels: spark, hive

incubator-linkis

Stars: ✭ 2,459 (+479.95%)

Mutual labels: spark, hive

Wirbelsturm

Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.

Stars: ✭ 332 (-21.7%)

Mutual labels: spark

Bigdl

Building Large-Scale AI Applications for Distributed Big Data

Stars: ✭ 3,813 (+799.29%)

Mutual labels: spark

Datafaker

Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具

Stars: ✭ 327 (-22.88%)

Mutual labels: hive

Sparklint

A tool for monitoring and tuning Spark jobs for efficiency.

Stars: ✭ 316 (-25.47%)

Mutual labels: spark

Spark Solr

Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.

Stars: ✭ 411 (-3.07%)

Mutual labels: spark

Tensorflowonspark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

Stars: ✭ 3,748 (+783.96%)

Mutual labels: spark

Cook

Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark

Stars: ✭ 314 (-25.94%)

Mutual labels: spark

Clickhouse Native Jdbc

ClickHouse Native Protocol JDBC implementation

Stars: ✭ 310 (-26.89%)

Mutual labels: spark

Hive

Apache Hive

Stars: ✭ 4,031 (+850.71%)

Mutual labels: hive

Coolplayspark

酷玩 Spark: Spark 源代码解析、Spark 类库等

Stars: ✭ 3,318 (+682.55%)

Mutual labels: spark

Sparkle

Haskell on Apache Spark.

Stars: ✭ 419 (-1.18%)

Mutual labels: spark

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (-4.25%)

Mutual labels: spark

Learningsparkv2

This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Stars: ✭ 307 (-27.59%)

Mutual labels: spark

Crayon

Simple framework agnostic UI router for SPAs

Stars: ✭ 310 (-26.89%)

Mutual labels: spark

Delta

An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.

Stars: ✭ 3,903 (+820.52%)

Mutual labels: spark

Spark Structured Streaming Book

The Internals of Spark Structured Streaming

Stars: ✭ 371 (-12.5%)

Mutual labels: spark

Spline

Data Lineage Tracking And Visualization Solution

Stars: ✭ 306 (-27.83%)

Mutual labels: spark

Zat

Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark

Stars: ✭ 303 (-28.54%)

Mutual labels: spark

Tutorial

Java全栈知识架构体系总结

Stars: ✭ 407 (-4.01%)

Mutual labels: spark

Sparkmeasure

This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.

Stars: ✭ 368 (-13.21%)

Mutual labels: spark

Awesome Ada

A curated list of awesome resources related to the Ada and SPARK programming language

Stars: ✭ 299 (-29.48%)

Mutual labels: spark

Elasticluster

Create clusters of VMs on the cloud and configure them with Ansible.

Stars: ✭ 298 (-29.72%)

Mutual labels: spark

Sidekick

High Performance HTTP Sidecar Load Balancer

Stars: ✭ 366 (-13.68%)

Mutual labels: spark

Spark Hbase Connector

Connect Spark to HBase for reading and writing data with ease

Stars: ✭ 299 (-29.48%)

Mutual labels: spark

Learningspark

Scala examples for learning to use Spark

Stars: ✭ 421 (-0.71%)

Mutual labels: spark

Agile data code 2

Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition

Stars: ✭ 413 (-2.59%)

Mutual labels: spark

Big data architect skills

一个大数据架构师应该掌握的技能

Stars: ✭ 400 (-5.66%)

Mutual labels: spark

Spark Notebook

Interactive and Reactive Data Science using Scala and Spark.

Stars: ✭ 3,081 (+626.65%)

Mutual labels: spark

Spark Druid Olap

Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.

Stars: ✭ 282 (-33.49%)

Mutual labels: spark

Trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Stars: ✭ 4,581 (+980.42%)

Mutual labels: hive

Metorikku

A simplified, lightweight ETL Framework based on Apache Spark

Stars: ✭ 361 (-14.86%)

Mutual labels: spark

Cloudflow

Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.