All Projects → hive-metastore-client → Similar Projects or Alternatives

388 Open source projects that are alternatives of or similar to hive-metastore-client

beekeeper
Service for automatically managing and cleaning up unreferenced data
Stars: ✭ 43 (+16.22%)
Mutual labels:  hive, metastore, hive-metastore
waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Stars: ✭ 194 (+424.32%)
Mutual labels:  hive, metastore, hive-metastore
beneath
Beneath is a serverless real-time data platform ⚡️
Stars: ✭ 65 (+75.68%)
Mutual labels:  etl, data-engineering
pangeo-forge-recipes
Python library for building Pangeo Forge recipes.
Stars: ✭ 64 (+72.97%)
Mutual labels:  etl, data-engineering
AirflowETL
Blog post on ETL pipelines with Airflow
Stars: ✭ 20 (-45.95%)
Mutual labels:  etl, data-engineering
Benthos
Fancy stream processing made operationally mundane
Stars: ✭ 3,705 (+9913.51%)
Mutual labels:  etl, data-engineering
Aws Data Wrangler
Pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+6345.95%)
Mutual labels:  etl, data-engineering
blockchain-etl-streaming
Streaming Ethereum and Bitcoin blockchain data to Google Pub/Sub or Postgres in Kubernetes
Stars: ✭ 57 (+54.05%)
Mutual labels:  etl, data-engineering
uptasticsearch
An Elasticsearch client tailored to data science workflows.
Stars: ✭ 47 (+27.03%)
Mutual labels:  etl, data-engineering
Sayn
Data processing and modelling framework for automating tasks (incl. Python & SQL transformations).
Stars: ✭ 79 (+113.51%)
Mutual labels:  etl, data-engineering
Butterfree
A tool for building feature stores.
Stars: ✭ 126 (+240.54%)
Mutual labels:  etl, data-engineering
Aws Serverless Data Lake Framework
Enterprise-grade, production-hardened, serverless data lake on AWS
Stars: ✭ 179 (+383.78%)
Mutual labels:  etl, data-engineering
hamilton
A scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+1554.05%)
Mutual labels:  etl, data-engineering
Dataspherestudio
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
Stars: ✭ 1,195 (+3129.73%)
Mutual labels:  hive, etl
Pyetl
python ETL framework
Stars: ✭ 33 (-10.81%)
Mutual labels:  hive, etl
etl manager
A python package to create a database on the platform using our moj data warehousing framework
Stars: ✭ 14 (-62.16%)
Mutual labels:  etl, data-engineering
etl
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Stars: ✭ 279 (+654.05%)
Mutual labels:  etl, data-engineering
morph-kgc
Powerful RDF Knowledge Graph Generation with [R2]RML Mappings
Stars: ✭ 77 (+108.11%)
Mutual labels:  etl, data-engineering
gallia-core
A schema-aware Scala library for data transformation
Stars: ✭ 44 (+18.92%)
Mutual labels:  etl, data-engineering
Luigi Warehouse
A luigi powered analytics / warehouse stack
Stars: ✭ 72 (+94.59%)
Mutual labels:  hive, etl
Dataform
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (+824.32%)
Mutual labels:  etl, data-engineering
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (+278.38%)
Mutual labels:  hive, etl
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (+113.51%)
Mutual labels:  etl, data-engineering
web-click-flow
网站点击流离线日志分析
Stars: ✭ 14 (-62.16%)
Mutual labels:  hive, etl
DaFlow
Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple categories of transformation rules.
Stars: ✭ 24 (-35.14%)
Mutual labels:  hive, etl
Addax
Addax is an open source universal ETL tool that supports most of those RDBMS and NoSQLs on the planet, helping you transfer data from any one place to another.
Stars: ✭ 615 (+1562.16%)
Mutual labels:  hive, etl
versatile-data-kit
Versatile Data Kit (VDK) is an open source framework that enables anybody with basic SQL or Python knowledge to create their own data pipelines.
Stars: ✭ 144 (+289.19%)
Mutual labels:  etl, data-engineering
Wedatasphere
WeDataSphere is a financial level one-stop open-source suitcase for big data platforms. Currently the source code of Scriptis and Linkis has already been released to the open-source community. WeDataSphere, Big Data Made Easy!
Stars: ✭ 372 (+905.41%)
Mutual labels:  hive, etl
apiary
Apiary provides modules which can be combined to create a federated cloud data lake
Stars: ✭ 30 (-18.92%)
Mutual labels:  hive, hive-metastore
polygon-etl
ETL (extract, transform and load) tools for ingesting Polygon blockchain data to Google BigQuery and Pub/Sub
Stars: ✭ 53 (+43.24%)
Mutual labels:  etl, data-engineering
AirflowDataPipeline
Example of an ETL Pipeline using Airflow
Stars: ✭ 24 (-35.14%)
Mutual labels:  etl, data-engineering
Pyspark Example Project
Example project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+1610.81%)
Mutual labels:  etl, data-engineering
Airbyte
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+13194.59%)
Mutual labels:  etl, data-engineering
simple-ddl-parser
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
Stars: ✭ 76 (+105.41%)
Mutual labels:  hive, ddls
arthur-redshift-etl
ELT Code for your Data Warehouse
Stars: ✭ 22 (-40.54%)
Mutual labels:  etl, data-engineering
DataX-src
DataX 是异构数据广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、SqlServer、Postgre、HDFS、Hive、ADS、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。
Stars: ✭ 21 (-43.24%)
Mutual labels:  hive, etl
qwery
A SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-24.32%)
Mutual labels:  hive, etl
Datax
DataX is an open source universal ETL tool that support Cassandra, ClickHouse, DBF, Hive, InfluxDB, Kudu, MySQL, Oracle, Presto(Trino), PostgreSQL, SQL Server
Stars: ✭ 116 (+213.51%)
Mutual labels:  hive, etl
Avro Hadoop Starter
Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.
Stars: ✭ 110 (+197.3%)
Mutual labels:  hive
Hive
Lightweight and blazing fast key-value database written in pure Dart.
Stars: ✭ 2,681 (+7145.95%)
Mutual labels:  hive
Haproxy Configs
80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.
Stars: ✭ 106 (+186.49%)
Mutual labels:  hive
Php Thrift Sql
A PHP library for connecting to Hive or Impala over Thrift
Stars: ✭ 107 (+189.19%)
Mutual labels:  hive
vixtract
www.vixtract.ru
Stars: ✭ 40 (+8.11%)
Mutual labels:  etl
Xsql
Unified SQL Analytics Engine Based on SparkSQL
Stars: ✭ 176 (+375.68%)
Mutual labels:  hive
Ecency Mobile
Ecency Mobile - reimagined social blogging, contribute and get rewarded (for Android and iOS)
Stars: ✭ 103 (+178.38%)
Mutual labels:  hive
Pyhive
Python interface to Hive and Presto. 🐝
Stars: ✭ 1,378 (+3624.32%)
Mutual labels:  hive
Bigdata practice
大数据分析可视化实践
Stars: ✭ 166 (+348.65%)
Mutual labels:  hive
Maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (+172.97%)
Mutual labels:  hive
Esteem Surfer
Ecency desktop formerly known as Esteem Surfer - reimagined desktop social wallet, contribute and get rewarded (for Windows, Mac, Linux)
Stars: ✭ 100 (+170.27%)
Mutual labels:  hive
id3c
Data logistics system enabling real-time pathogen surveillance. Built for the Seattle Flu Study.
Stars: ✭ 21 (-43.24%)
Mutual labels:  etl
thain
Thain is a distributed flow schedule platform.
Stars: ✭ 81 (+118.92%)
Mutual labels:  etl
Bigdata docker
Big Data Ecosystem Docker
Stars: ✭ 161 (+335.14%)
Mutual labels:  hive
Springboot Templates
springboot和dubbo、netty的集成,redis mongodb的nosql模板, kafka rocketmq rabbit的MQ模板, solr solrcloud elasticsearch查询引擎
Stars: ✭ 100 (+170.27%)
Mutual labels:  hive
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+29605.41%)
Mutual labels:  hive
Linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,323 (+6178.38%)
Mutual labels:  hive
Bitalarm
An app to keep track of different cryptocurrencies, written in dart + flutter
Stars: ✭ 94 (+154.05%)
Mutual labels:  hive
Repository
个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。
Stars: ✭ 92 (+148.65%)
Mutual labels:  hive
airflow-dbt-python
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
Stars: ✭ 111 (+200%)
Mutual labels:  data-engineering
Presto
The official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+34918.92%)
Mutual labels:  hive
Wifi
基于wifi抓取信息的大数据查询分析系统
Stars: ✭ 93 (+151.35%)
Mutual labels:  hive
1-60 of 388 similar projects