All Projects → Moonbox → Similar Projects or Alternatives

494 Open source projects that are alternatives of or similar to Moonbox

Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (-12.26%)
Mutual labels:  spark, hive
Kyuubi
Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (-14.39%)
Mutual labels:  spark, hive
Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+447.88%)
Mutual labels:  spark, hive
Hadoop cookbook
Cookbook to install Hadoop 2.0+ using Chef
Stars: ✭ 82 (-80.66%)
Mutual labels:  spark, hive
Cube.js
📊 Cube — Open-Source Analytics API for Building Data Apps
Stars: ✭ 11,983 (+2726.18%)
Mutual labels:  spark, hive
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+181.84%)
Mutual labels:  spark, hive
Yanagishima
Web UI for Trino, Presto, Hive, Elasticsearch, SparkSQL
Stars: ✭ 424 (+0%)
Mutual labels:  spark, hive
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+2492.22%)
Mutual labels:  spark, hive
Luigi Warehouse
A luigi powered analytics / warehouse stack
Stars: ✭ 72 (-83.02%)
Mutual labels:  spark, hive
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (-78.3%)
Mutual labels:  spark, hive
Szt Bigdata
深圳地铁大数据客流分析系统🚇🚄🌟
Stars: ✭ 826 (+94.81%)
Mutual labels:  spark, hive
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-91.75%)
Mutual labels:  spark, hive
Hadoop Docker
基于Docker构建的Hadoop开发测试环境,包含Hadoop,Hive,HBase,Spark
Stars: ✭ 238 (-43.87%)
Mutual labels:  spark, hive
Spark Authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-66.75%)
Mutual labels:  spark, hive
BigData-News
基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (-91.51%)
Mutual labels:  spark, hive
Bdp Dataplatform
大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+7.55%)
Mutual labels:  spark, hive
God Of Bigdata
专注大数据学习面试,大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...
Stars: ✭ 6,008 (+1316.98%)
Mutual labels:  spark, hive
Quicksql
A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+329.48%)
Mutual labels:  spark, hive
Xsql
Unified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-58.49%)
Mutual labels:  spark, hive
Scriptis
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Stars: ✭ 696 (+64.15%)
Mutual labels:  spark, hive
Bigdataguide
大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+92.69%)
Mutual labels:  spark, hive
Hops Examples
Examples for Deep Learning/Feature Store/Spark/Flink/Hive/Kafka jobs and Jupyter notebooks on Hops
Stars: ✭ 84 (-80.19%)
Mutual labels:  spark, hive
Apache Spark Hands On
Educational notes,Hands on problems w/ solutions for hadoop ecosystem
Stars: ✭ 74 (-82.55%)
Mutual labels:  spark, hive
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (-62.03%)
Mutual labels:  spark, hive
Hadoopcryptoledger
Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive
Stars: ✭ 126 (-70.28%)
Mutual labels:  spark, hive
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (-78.54%)
Mutual labels:  spark, hive
incubator-linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+479.95%)
Mutual labels:  spark, hive
Wirbelsturm
Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a focus on big data tech like Kafka.
Stars: ✭ 332 (-21.7%)
Mutual labels:  spark
Bigdl
Building Large-Scale AI Applications for Distributed Big Data
Stars: ✭ 3,813 (+799.29%)
Mutual labels:  spark
Datafaker
Datafaker is a large-scale test data and flow test data generation tool. Datafaker fakes data and inserts to varied data sources. 测试数据生成工具
Stars: ✭ 327 (-22.88%)
Mutual labels:  hive
Sparklint
A tool for monitoring and tuning Spark jobs for efficiency.
Stars: ✭ 316 (-25.47%)
Mutual labels:  spark
Spark Solr
Tools for reading data from Solr as a Spark RDD and indexing objects from Spark into Solr using SolrJ.
Stars: ✭ 411 (-3.07%)
Mutual labels:  spark
Tensorflowonspark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Stars: ✭ 3,748 (+783.96%)
Mutual labels:  spark
Cook
Fair job scheduler on Kubernetes and Mesos for batch workloads and Spark
Stars: ✭ 314 (-25.94%)
Mutual labels:  spark
Clickhouse Native Jdbc
ClickHouse Native Protocol JDBC implementation
Stars: ✭ 310 (-26.89%)
Mutual labels:  spark
Hive
Apache Hive
Stars: ✭ 4,031 (+850.71%)
Mutual labels:  hive
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+682.55%)
Mutual labels:  spark
Sparkle
Haskell on Apache Spark.
Stars: ✭ 419 (-1.18%)
Mutual labels:  spark
Devops Python Tools
80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.
Stars: ✭ 406 (-4.25%)
Mutual labels:  spark
Learningsparkv2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
Stars: ✭ 307 (-27.59%)
Mutual labels:  spark
Crayon
Simple framework agnostic UI router for SPAs
Stars: ✭ 310 (-26.89%)
Mutual labels:  spark
Delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
Stars: ✭ 3,903 (+820.52%)
Mutual labels:  spark
Spark Structured Streaming Book
The Internals of Spark Structured Streaming
Stars: ✭ 371 (-12.5%)
Mutual labels:  spark
Spline
Data Lineage Tracking And Visualization Solution
Stars: ✭ 306 (-27.83%)
Mutual labels:  spark
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-28.54%)
Mutual labels:  spark
Tutorial
Java全栈知识架构体系总结
Stars: ✭ 407 (-4.01%)
Mutual labels:  spark
Sparkmeasure
This is the development repository of SparkMeasure, a tool for performance troubleshooting of Apache Spark workloads. It simplifies the collection and analysis of Spark task metrics data.
Stars: ✭ 368 (-13.21%)
Mutual labels:  spark
Awesome Ada
A curated list of awesome resources related to the Ada and SPARK programming language
Stars: ✭ 299 (-29.48%)
Mutual labels:  spark
Elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-29.72%)
Mutual labels:  spark
Sidekick
High Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (-13.68%)
Mutual labels:  spark
Spark Hbase Connector
Connect Spark to HBase for reading and writing data with ease
Stars: ✭ 299 (-29.48%)
Mutual labels:  spark
Learningspark
Scala examples for learning to use Spark
Stars: ✭ 421 (-0.71%)
Mutual labels:  spark
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-2.59%)
Mutual labels:  spark
Big data architect skills
一个大数据架构师应该掌握的技能
Stars: ✭ 400 (-5.66%)
Mutual labels:  spark
Spark Notebook
Interactive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+626.65%)
Mutual labels:  spark
Spark Druid Olap
Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit.ly/2oBJSpP) an Integrated BI platform on Apache Spark.
Stars: ✭ 282 (-33.49%)
Mutual labels:  spark
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+980.42%)
Mutual labels:  hive
Metorikku
A simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-14.86%)
Mutual labels:  spark
Cloudflow
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-34.43%)
Mutual labels:  spark
Hbase Rdd
Spark RDD to read, write and delete from HBase
Stars: ✭ 277 (-34.67%)
Mutual labels:  spark
1-60 of 494 similar projects