LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+233.76%)
Mutual labels: sql, spark, hive, pyspark
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+253.3%)
Mutual labels: spark, hive, pyspark
DataspherestudioDataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+71.7%)
Mutual labels: spark, hive, hue
WedatasphereWeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (-46.55%)
Mutual labels: spark, hive, ide
Bigdata dockerBig Data Ecosystem Docker
Stars: ✭ 161 (-76.87%)
Mutual labels: spark, hive, hue
QuicksqlA Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources
Stars: ✭ 1,821 (+161.64%)
Mutual labels: sql, spark, hive
KyuubiKyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (-47.84%)
Mutual labels: sql, spark, hive
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-78.45%)
Mutual labels: sql, spark, pyspark
XsqlUnified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (-74.71%)
Mutual labels: sql, spark, hive
spark-extensionA library that provides useful extensions to Apache Spark and PySpark.
Stars: ✭ 25 (-96.41%)
Mutual labels: spark, pyspark
data-algorithms-with-sparkO'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
Stars: ✭ 34 (-95.11%)
Mutual labels: spark, pyspark
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+558.19%)
Mutual labels: sql, hive
bigdata-funA complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-97.99%)
Mutual labels: spark, hue
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-84.05%)
Mutual labels: spark, pyspark
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-96.41%)
Mutual labels: spark, pyspark
BigData-News基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (-94.83%)
Mutual labels: spark, hive
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-48.13%)
Mutual labels: sql, spark
HiveApache Hive
Stars: ✭ 4,031 (+479.17%)
Mutual labels: sql, hive
YanagishimaWeb UI for Trino, Presto, Hive, Elasticsearch, SparkSQL
Stars: ✭ 424 (-39.08%)
Mutual labels: spark, hive
MoonboxMoonbox is a DVtaaS (Data Virtualization as a Service) Platform
Stars: ✭ 424 (-39.08%)
Mutual labels: spark, hive