All Projects → beekeeper → Similar Projects or Alternatives

917 Open source projects that are alternatives of or similar to beekeeper

waggle-dance
Hive federation service. Enables disparate tables to be concurrently accessed across multiple Hive deployments.
Stars: ✭ 194 (+351.16%)
Mutual labels:  hive, metastore, hive-metastore
hive-metastore-client
A client for connecting and running DDLs on hive metastore.
Stars: ✭ 37 (-13.95%)
Mutual labels:  hive, metastore, hive-metastore
Streamx
kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)
Stars: ✭ 96 (+123.26%)
Mutual labels:  big-data, s3
Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+890.7%)
Mutual labels:  big-data, s3
Maha
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
Stars: ✭ 101 (+134.88%)
Mutual labels:  big-data, hive
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+25460.47%)
Mutual labels:  big-data, hive
Trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+10553.49%)
Mutual labels:  big-data, hive
Airflow Maintenance Dags
A series of DAGs/Workflows to help maintain the operation of Airflow
Stars: ✭ 914 (+2025.58%)
Mutual labels:  maintenance, cleanup
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+111.63%)
Mutual labels:  big-data, hive
GooglePlay-Web-Crawler
Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive
Stars: ✭ 18 (-58.14%)
Mutual labels:  hive, s3
Hive
Apache Hive
Stars: ✭ 4,031 (+9274.42%)
Mutual labels:  big-data, hive
Cloud Volume
Read and write Neuroglancer datasets programmatically.
Stars: ✭ 63 (+46.51%)
Mutual labels:  big-data, s3
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+3665.12%)
Mutual labels:  big-data, hive
Amazon S3 Find And Forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (+167.44%)
Mutual labels:  big-data, s3
qwery
A SQL-like language for performing ETL transformations.
Stars: ✭ 28 (-34.88%)
Mutual labels:  hive, s3
Eel Sdk
Big Data Toolkit for the JVM
Stars: ✭ 140 (+225.58%)
Mutual labels:  big-data, hive
Docker Registry Pruner
Tool to apply retention logic to docker images in a Docker Registry
Stars: ✭ 122 (+183.72%)
Mutual labels:  maintenance, cleanup
Ozone
Scalable, redundant, and distributed object store for Apache Hadoop
Stars: ✭ 330 (+667.44%)
Mutual labels:  big-data, s3
Presto
The official home of the Presto distributed SQL query engine for big data
Stars: ✭ 12,957 (+30032.56%)
Mutual labels:  big-data, hive
Docker Registry Manifest Cleanup
Cleans up docker registry by removing untagged manifests from the registry
Stars: ✭ 127 (+195.35%)
Mutual labels:  s3, cleanup
apiary
Apiary provides modules which can be combined to create a federated cloud data lake
Stars: ✭ 30 (-30.23%)
Mutual labels:  hive, hive-metastore
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (+90.7%)
Mutual labels:  big-data, s3
Helicalinsight
Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.
Stars: ✭ 214 (+397.67%)
Mutual labels:  big-data, hive
nifi
Deploy a secured, clustered, auto-scaling NiFi service in AWS.
Stars: ✭ 37 (-13.95%)
Mutual labels:  big-data, s3
spark-records
Bulletproof Apache Spark jobs with fast root cause analysis of failures.
Stars: ✭ 67 (+55.81%)
Mutual labels:  big-data
terraform-aws-sftp
This terraform module is used to create sftp on AWS for S3.
Stars: ✭ 20 (-53.49%)
Mutual labels:  s3
common-datax
基于DataX的通用数据同步微服务,一个Restful接口搞定所有通用数据同步
Stars: ✭ 51 (+18.6%)
Mutual labels:  hive
hiveql-parser
HiveQL Parser. Parse HiveQL code and print AST in JSON format if success, else print well formed syntax error message.
Stars: ✭ 25 (-41.86%)
Mutual labels:  hive
azure-big-data-starter
A boilerplate project for Azure Big Data PaaS services
Stars: ✭ 13 (-69.77%)
Mutual labels:  big-data
go-localstack
Go Wrapper for using localstack
Stars: ✭ 56 (+30.23%)
Mutual labels:  s3
silly-android
Android plugins for Java, making core Android APIs easy to use
Stars: ✭ 40 (-6.98%)
Mutual labels:  cleanup
commentator
A simple commenting system for your blog.
Stars: ✭ 29 (-32.56%)
Mutual labels:  s3
CS Book
🔥 Latest computer science e-books。提供最新技术类电子书下载, “我无非就是想卷死各位,或者被各位卷死!”
Stars: ✭ 40 (-6.98%)
Mutual labels:  big-data
scarf
Toolkit for highly memory efficient analysis of single-cell RNA-Seq, scATAC-Seq and CITE-Seq data. Analyze atlas scale datasets with millions of cells on laptop.
Stars: ✭ 54 (+25.58%)
Mutual labels:  big-data
BigInsights-on-Apache-Hadoop
Example projects for 'BigInsights for Apache Hadoop' on IBM Bluemix
Stars: ✭ 21 (-51.16%)
Mutual labels:  hive
databricks-dbapi
DBAPI and SQLAlchemy dialect for Databricks Workspace and SQL Analytics clusters
Stars: ✭ 21 (-51.16%)
Mutual labels:  hive
mining-camp
Easy automated configuration and deployment of Minecraft servers on AWS spot instances, featuring automatic backups and restoration using S3.
Stars: ✭ 43 (+0%)
Mutual labels:  s3
simple-ddl-parser
Simple DDL Parser to parse SQL (HQL, TSQL, AWS Redshift, BigQuery, Snowflake and other dialects) ddl files to json/python dict with full information about columns: types, defaults, primary keys, etc. & table properties, types, domains, etc.
Stars: ✭ 76 (+76.74%)
Mutual labels:  hive
mlflow-docker
Ready to run docker-compose configuration for ML Flow with Mysql and Minio S3
Stars: ✭ 146 (+239.53%)
Mutual labels:  s3
RemoteShuffleService
Celeborn provides an elastic and high-performance service for shuffle and spilled data.
Stars: ✭ 262 (+509.3%)
Mutual labels:  big-data
terraform-modules
Terraform Modules by Peak
Stars: ✭ 16 (-62.79%)
Mutual labels:  s3
IoT-system-PLC-data-to-InfluxDB
This project aim is to provide free software to fetch data from plcs (Siemens S7-300/400/1200/1500) and store it. Used stack is completly opensource. I used InfluDB as data storage, so application principle is following Big Data paradigm.
Stars: ✭ 26 (-39.53%)
Mutual labels:  big-data
awesome-hive
A curated list of awesome Hive resources.
Stars: ✭ 20 (-53.49%)
Mutual labels:  hive
react-relay-appsync
AppSync for Relay
Stars: ✭ 19 (-55.81%)
Mutual labels:  s3
datajoint-python
Relational data pipelines for the science lab
Stars: ✭ 140 (+225.58%)
Mutual labels:  s3
athena-sqlite
A SQLite driver for S3 and Amazon Athena 😳
Stars: ✭ 82 (+90.7%)
Mutual labels:  s3
terraform-aws-kinesis-firehose
This code creates a Kinesis Firehose in AWS to send CloudWatch log data to S3.
Stars: ✭ 25 (-41.86%)
Mutual labels:  big-data
minio-dart
Unofficial MinIO Dart Client SDK that provides simple APIs to access any Amazon S3 compatible object storage server.
Stars: ✭ 42 (-2.33%)
Mutual labels:  s3
terraform-s3-user
A Terraform module that creates a tagged S3 bucket and an IAM user/key with access to the bucket
Stars: ✭ 20 (-53.49%)
Mutual labels:  s3
s3cr3t
A supercharged S3 reverse proxy
Stars: ✭ 55 (+27.91%)
Mutual labels:  s3
sparkucx
A high-performance, scalable and efficient ShuffleManager plugin for Apache Spark, utilizing UCX communication layer
Stars: ✭ 32 (-25.58%)
Mutual labels:  big-data
hadoopoffice
HadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (+30.23%)
Mutual labels:  hive
airavata-php-gateway
Mirror of Apache Airavata PHP Gateway
Stars: ✭ 15 (-65.12%)
Mutual labels:  big-data
data-profiling
a set of scripts to pull meta data and data profiling metrics from relational database systems
Stars: ✭ 57 (+32.56%)
Mutual labels:  hive
django-s3file
A lightweight file upload input for Django and Amazon S3
Stars: ✭ 66 (+53.49%)
Mutual labels:  s3
spark-root
Apache Spark Data Source for ROOT File Format
Stars: ✭ 28 (-34.88%)
Mutual labels:  big-data
s3-practical-guide
A practical guide for Sociocracy 3.0.
Stars: ✭ 56 (+30.23%)
Mutual labels:  s3
dxram
A distributed in-memory key-value storage for billions of small objects.
Stars: ✭ 25 (-41.86%)
Mutual labels:  big-data
react-native-appsync-s3
React Native app for image uploads to S3 and storing their records in Amazon DynamoDB using AWS Amplify and AppSync SDK
Stars: ✭ 18 (-58.14%)
Mutual labels:  s3
pysorter
A command line utility for organizing files and directories according to regex patterns.
Stars: ✭ 40 (-6.98%)
Mutual labels:  cleanup
1-60 of 917 similar projects