All Projects → Scriptis → Similar Projects or Alternatives

1658 Open source projects that are alternatives of or similar to Scriptis

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,323 (+233.76%)

Mutual labels: sql, spark, hive, pyspark

Xsql

Unified SQL Analytics Engine Based on SparkSQL

Stars: ✭ 176 (-74.71%)

Mutual labels: sql, spark, hive

Kyuubi

Kyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark

Stars: ✭ 363 (-47.84%)

Mutual labels: sql, spark, hive

incubator-linkis

Stars: ✭ 2,459 (+253.3%)

Mutual labels: spark, hive, pyspark

Bigdata docker

Big Data Ecosystem Docker

Stars: ✭ 161 (-76.87%)

Mutual labels: spark, hive, hue

Quicksql

A Flexible, Fast, Federated(3F) SQL Analysis Middleware for Multiple Data Sources

Stars: ✭ 1,821 (+161.64%)

Mutual labels: sql, spark, hive

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-78.45%)

Mutual labels: sql, spark, pyspark

Wedatasphere

WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!

Stars: ✭ 372 (-46.55%)

Mutual labels: spark, hive, ide

Dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

Stars: ✭ 1,195 (+71.7%)

Mutual labels: spark, hive, hue

Cc Pyspark

Process Common Crawl data with Python and Spark

Stars: ✭ 147 (-78.88%)

Mutual labels: spark, pyspark

Spark Iforest

Isolation Forest on Spark

Stars: ✭ 166 (-76.15%)

Mutual labels: spark, pyspark

Spark Practice

Apache Spark (PySpark) Practice on Real Data

Stars: ✭ 200 (-71.26%)

Mutual labels: spark, pyspark

Spark Authorizer

A Spark SQL extension which provides SQL Standard Authorization for Apache Spark

Stars: ✭ 141 (-79.74%)

Mutual labels: spark, hive

Pyspark Learning

Updated repository

Stars: ✭ 147 (-78.88%)

Mutual labels: spark, pyspark

Azure Cosmosdb Spark

Apache Spark Connector for Azure Cosmos DB

Stars: ✭ 165 (-76.29%)

Mutual labels: spark, pyspark

Learningapachespark

LearningApacheSpark

Stars: ✭ 155 (-77.73%)

Mutual labels: spark, pyspark

Databook

A facebook for data

Stars: ✭ 26 (-96.26%)

Mutual labels: sql, hive

Spark

Apache Spark - A unified analytics engine for large-scale data processing

Stars: ✭ 31,618 (+4442.82%)

Mutual labels: sql, spark

Spark Website

Apache Spark Website

Stars: ✭ 75 (-89.22%)

Mutual labels: sql, spark

Pyspark Example Project

Example project implementing best practices for PySpark ETL jobs and applications.

Stars: ✭ 633 (-9.05%)

Mutual labels: spark, pyspark

Hnswlib

Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs

Stars: ✭ 108 (-84.48%)

Mutual labels: spark, pyspark

Mmlspark

Simple and Distributed Machine Learning

Stars: ✭ 2,899 (+316.52%)

Mutual labels: spark, pyspark

Maha

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Stars: ✭ 101 (-85.49%)

Mutual labels: sql, hive

Presto

The official home of the Presto distributed SQL query engine for big data

Stars: ✭ 12,957 (+1761.64%)

Mutual labels: sql, hive

swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (-94.97%)

Mutual labels: spark, hive

ODSC India 2018

My presentation at ODSC India 2018 about Deep Learning with Apache Spark

Stars: ✭ 26 (-96.26%)

Mutual labels: spark, pyspark

spark-acid

ACID Data Source for Apache Spark based on Hive ACID

Stars: ✭ 91 (-86.93%)

Mutual labels: spark, hive

Hadoopcryptoledger

Hadoop Crypto Ledger - Analyzing CryptoLedgers, such as Bitcoin Blockchain, on Big Data platforms, such as Hadoop/Spark/Flink/Hive

Stars: ✭ 126 (-81.9%)

Mutual labels: spark, hive

Eat pyspark in 10 days

pyspark🍒🥭 is delicious，just eat it!😋😋

Stars: ✭ 116 (-83.33%)

Mutual labels: spark, pyspark

Datafusion

DataFusion has now been donated to the Apache Arrow project

Stars: ✭ 611 (-12.21%)

Mutual labels: sql, spark

Cube.js

📊 Cube — Open-Source Analytics API for Building Data Apps

Stars: ✭ 11,983 (+1621.7%)

Mutual labels: spark, hive

Handyspark

HandySpark - bringing pandas-like capabilities to Spark dataframes

Stars: ✭ 158 (-77.3%)

Mutual labels: spark, pyspark

Spark Nlp

State of the Art Natural Language Processing

Stars: ✭ 2,518 (+261.78%)

Mutual labels: spark, pyspark

Pyspark Cheatsheet

🐍 Quick reference guide to common patterns & functions in PySpark.

Stars: ✭ 108 (-84.48%)

Mutual labels: spark, pyspark

Parquet Generator

Parquet file generator

Stars: ✭ 16 (-97.7%)

Mutual labels: sql, spark

Hadoop Docker

基于Docker构建的Hadoop开发测试环境，包含Hadoop，Hive，HBase，Spark

Stars: ✭ 238 (-65.8%)

Mutual labels: spark, hive

Kamu Cli

Next generation tool for decentralized exchange and transformation of semi-structured data

Stars: ✭ 69 (-90.09%)

Mutual labels: sql, spark

Gimel

Big Data Processing Framework - Unified Data API or SQL on Any Storage

Stars: ✭ 216 (-68.97%)

Mutual labels: spark, pyspark

Parquet Index

Spark SQL index for Parquet tables

Stars: ✭ 109 (-84.34%)

Mutual labels: sql, spark

Php Thrift Sql

A PHP library for connecting to Hive or Impala over Thrift

Stars: ✭ 107 (-84.63%)

Mutual labels: sql, hive

aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.

Stars: ✭ 111 (-84.05%)

Mutual labels: spark, pyspark

kafka-compose

🎼 Docker compose files for various kafka stacks

Stars: ✭ 32 (-95.4%)

Mutual labels: spark, pyspark

bigdata-fun

A complete (distributed) BigData stack, running in containers

Stars: ✭ 14 (-97.99%)

Mutual labels: spark, hue

data-algorithms-with-spark

O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian

Stars: ✭ 34 (-95.11%)

Mutual labels: spark, pyspark

cloud

云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件

Stars: ✭ 48 (-93.1%)

Mutual labels: hive, hue

dockerfiles

Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )

Stars: ✭ 29 (-95.83%)

Mutual labels: hive, hue

data processing course

Some class materials for a data processing course using PySpark

Stars: ✭ 50 (-92.82%)

Mutual labels: spark, pyspark

Ultimatepp

U++ is a C++ cross-platform rapid application development framework focused on programmer's productivity. It includes a set of libraries (GUI, SQL, Network etc.), and integrated development environment (TheIDE).

Stars: ✭ 237 (-65.95%)

Mutual labels: sql, ide

BigData-News

基于Spark2.2新闻网大数据实时系统项目

Stars: ✭ 36 (-94.83%)

Mutual labels: spark, hive

spark-extension

A library that provides useful extensions to Apache Spark and PySpark.

Stars: ✭ 25 (-96.41%)

Mutual labels: spark, pyspark

Metorikku

A simplified, lightweight ETL Framework based on Apache Spark

Stars: ✭ 361 (-48.13%)

Mutual labels: sql, spark

Hive

Apache Hive

Stars: ✭ 4,031 (+479.17%)

Mutual labels: sql, hive

Moonbox

Moonbox is a DVtaaS (Data Virtualization as a Service) Platform

Stars: ✭ 424 (-39.08%)

Mutual labels: spark, hive

Devops Python Tools

80+ DevOps & Data CLI Tools - AWS, GCP, GCF Python Cloud Function, Log Anonymizer, Spark, Hadoop, HBase, Hive, Impala, Linux, Docker, Spark Data Converters & Validators (Avro/Parquet/JSON/CSV/INI/XML/YAML), Travis CI, AWS CloudFormation, Elasticsearch, Solr etc.

Stars: ✭ 406 (-41.67%)

Mutual labels: spark, pyspark

Trino

Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)

Stars: ✭ 4,581 (+558.19%)

Mutual labels: sql, hive

Relation extraction

Relation Extraction using Deep learning(CNN)

Stars: ✭ 96 (-86.21%)

Mutual labels: spark, pyspark

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+1479.17%)

Mutual labels: spark, hive

God Of Bigdata

专注大数据学习面试，大数据成神之路开启。Flink/Spark/Hadoop/Hbase/Hive...

Stars: ✭ 6,008 (+763.22%)

Mutual labels: spark, hive

basin

Basin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser

Stars: ✭ 25 (-96.41%)

Mutual labels: spark, pyspark

Yanagishima

Web UI for Trino, Presto, Hive, Elasticsearch, SparkSQL

Stars: ✭ 424 (-39.08%)

Mutual labels: spark, hive

1-60 of 1658 similar projects

›

next*5