HiveApache Hive
Stars: ✭ 4,031 (-12.01%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-96.73%)
PrestoThe official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+182.84%)
MahaA framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (-97.8%)
CrateCrateDB is a distributed SQL database that makes it simple to store and analyze
massive amounts of data in real-time.
Stars: ✭ 3,254 (-28.97%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-97.21%)
RqliteThe lightweight, distributed relational database built on SQLite
Stars: ✭ 9,147 (+99.67%)
EventqlDistributed "massively parallel" SQL query engine
Stars: ✭ 1,121 (-75.53%)
LinkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (-49.29%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+360.36%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+15.8%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (-64.66%)
KyuubiKyuubi is a unified multi-tenant JDBC interface for large-scale data processing and analytics, built on top of Apache Spark
Stars: ✭ 363 (-92.08%)
IgniteApache Ignite
Stars: ✭ 4,027 (-12.09%)
MigrateDatabase migrations. CLI and Golang library.
Stars: ✭ 7,712 (+68.35%)
PhoenixMirror of Apache Phoenix
Stars: ✭ 867 (-81.07%)
GoroseGoRose(go orm), a mini database ORM for golang, which inspired by the famous php framwork laravle's eloquent. It will be friendly for php developer and python or ruby developer. Currently provides six major database drivers: mysql,sqlite3,postgres,oracle,mssql, Clickhouse.
Stars: ✭ 947 (-79.33%)
EbeanEbean ORM
Stars: ✭ 1,172 (-74.42%)
hive-jdbc-driverAn alternative to the "hive standalone" jar for connecting Java applications to Apache Hive via JDBC
Stars: ✭ 31 (-99.32%)
RadonRadonDB is an open source, cloud-native MySQL database for building global, scalable cloud services
Stars: ✭ 1,584 (-65.42%)
NormAccess a database in one line of code.
Stars: ✭ 152 (-96.68%)
TidbTiDB is an open source distributed HTAP database compatible with the MySQL protocol
Stars: ✭ 29,871 (+552.06%)
JailerDatabase Subsetting and Relational Data Browsing Tool.
Stars: ✭ 576 (-87.43%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+590.2%)
Yugabyte DbThe high-performance distributed SQL database for global, internet-scale apps.
Stars: ✭ 5,890 (+28.57%)
HerddbA JVM-embeddable Distributed Database
Stars: ✭ 192 (-95.81%)
Interferenceopensource distributed database with base JPA implementation and event processing support
Stars: ✭ 57 (-98.76%)
RagtimeDatabase-independent migration library
Stars: ✭ 519 (-88.67%)
Docker SupersetRepository for Docker Image of Apache-Superset. [Docker Image: https://hub.docker.com/r/abhioncbr/docker-superset]
Stars: ✭ 86 (-98.12%)
ShardingsphereBuild criterion and ecosystem above multi-model databases
Stars: ✭ 14,989 (+227.2%)
MigrateDatabase migrations. CLI and Golang library.
Stars: ✭ 2,315 (-49.47%)
CalciteApache Calcite
Stars: ✭ 2,816 (-38.53%)
Php Thrift SqlA PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (-97.66%)
Jcabi JdbcFluent Wrapper of JDBC
Stars: ✭ 90 (-98.04%)
Presto Go ClientA Presto client for the Go programming language.
Stars: ✭ 183 (-96.01%)
CitusDistributed PostgreSQL as an extension
Stars: ✭ 5,580 (+21.81%)
DuckdbDuckDB is an in-process SQL OLAP Database Management System
Stars: ✭ 4,014 (-12.38%)
JaydebeapiJayDeBeApi module allows you to connect from Python code to databases using Java JDBC. It provides a Python DB-API v2.0 to that database.
Stars: ✭ 247 (-94.61%)
PreqlAn interpreted relational query language that compiles to SQL.
Stars: ✭ 257 (-94.39%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+381.29%)
SqlhelperSQL Tools ( Dialect, Pagination, DDL dump, UrlParser, SqlStatementParser, WallFilter, BatchExecutor for Test) based Java. it is easy to integration into any ORM frameworks
Stars: ✭ 242 (-94.72%)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (-86.25%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+23.47%)
MaterializeMaterialize lets you ask questions of your live data, which it answers and then maintains for you as your data continue to change. The moment you need a refreshed answer, you can get it in milliseconds. Materialize is designed to help you interactively explore your streaming data, perform data warehousing analytics against live relational data, or just increase the freshness and reduce the load of your dashboard and monitoring tasks.
Stars: ✭ 3,341 (-27.07%)
AddaxAddax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (-86.57%)
incubator-linkisLinkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (-46.32%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-97.66%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+128.73%)
SaynData processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (-98.28%)
Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-95.79%)
H2databaseH2 is an embeddable RDBMS written in Java.
Stars: ✭ 3,078 (-32.81%)
the-apache-ignite-bookAll code samples, scripts and more in-depth examples for The Apache Ignite Book. Include Apache Ignite 2.6 or above
Stars: ✭ 65 (-98.58%)
CockroachCockroachDB - the open source, cloud-native distributed SQL database.
Stars: ✭ 22,700 (+395.52%)