All Projects → ExpediaGroup → apiary

ExpediaGroup / apiary

Licence: Apache-2.0 license
Apiary provides modules which can be combined to create a federated cloud data lake

Projects that are alternatives of or similar to apiary

hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (+23.33%)
Mutual labels:  hive, hive-metastore
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+15170%)
Mutual labels:  hive, datalake
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Stars: ✭ 194 (+546.67%)
Mutual labels:  hive, hive-metastore
beekeeper
Service for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (+43.33%)
Mutual labels:  hive, hive-metastore
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+86.67%)
Mutual labels:  hive
pan-cortex-data-lake-python
Python idiomatic SDK for Cortex™ Data Lake.
Stars: ✭ 36 (+20%)
Mutual labels:  datalake
fense
Fense is a database proxy written in Java, which can connect DB of different engines at the same time. The key features are: authority management, query cache, audit security, current limiting fuse, onesql and so on
Stars: ✭ 22 (-26.67%)
Mutual labels:  hive
zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+2083.33%)
Mutual labels:  datalake
HiveJdbcStorageHandler
No description or website provided.
Stars: ✭ 21 (-30%)
Mutual labels:  hive
liquibase-impala
Liquibase extension to add Impala Database support
Stars: ✭ 23 (-23.33%)
Mutual labels:  hive
data-profiling
a set of scripts to pull meta data and data profiling metrics from relational database systems
Stars: ✭ 57 (+90%)
Mutual labels:  hive
awesome-hive
A curated list of awesome Hive resources.
Stars: ✭ 20 (-33.33%)
Mutual labels:  hive
databricks-dbapi
DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters
Stars: ✭ 21 (-30%)
Mutual labels:  hive
datalake-etl-pipeline
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (+30%)
Mutual labels:  datalake
hive-cube
Data self exporting and monitoring platform based on Hive data warehouse. https://hc.smartloli.org
Stars: ✭ 34 (+13.33%)
Mutual labels:  hive
dockerfiles
Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, Hue, Mesos, ... )
Stars: ✭ 29 (-3.33%)
Mutual labels:  hive
xxhadoop
Data Analysis Using Hadoop/Spark/Storm/ElasticSearch/MachineLearning etc. This is My Daily Notes/Code/Demo. Don't fork, Just star !
Stars: ✭ 37 (+23.33%)
Mutual labels:  hive
hadoop-etl-udfs
The Hadoop ETL UDFs are the main way to load data from Hadoop into EXASOL
Stars: ✭ 17 (-43.33%)
Mutual labels:  hive
hiveql-parser
HiveQL Parser. Parse HiveQL code and print AST in JSON format if success, else print well formed syntax error message.
Stars: ✭ 25 (-16.67%)
Mutual labels:  hive
simple-ddl-parser
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
Stars: ✭ 76 (+153.33%)
Mutual labels:  hive

Apiary.

Overview

Apiary provides modules which can be combined to create a federated cloud data lake. These include:

  • Read/write Hive metastore service
  • Read only Hive metastore service
  • Waggle Dance federated Hive metastore service
  • Beekeeper event-based data lifecycle service
  • Drone Fly decouples your Hive metastore (HMS) MetaStoreEventListener implementations from HMS.
  • Related infrastructure including load balancers
  • Various extensions and plugins for adding additional functionality to the Hive metastore

Components

Apiary consists of the following components which are managed in separate git repositories:

Contact

Mailing List

If you would like to ask any questions about or discuss Apiary please join our mailing list at

https://groups.google.com/forum/#!forum/apiary-user

Legal

This project is available under the Apache 2.0 License.

Copyright 2018-2019 Expedia, Inc.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].