All Projects → josep2 → Jathena

josep2 / Jathena

Licence: other
Proof of concept open source Amazon Athena

Projects that are alternatives of or similar to Jathena

adminio-api
simple admin API for min.io (minio) s3 server
Stars: ✭ 24 (+26.32%)
Mutual labels:  minio
skbn
Copy files and directories between Kubernetes and cloud storage
Stars: ✭ 68 (+257.89%)
Mutual labels:  minio
mlflow-docker
Ready to run docker-compose configuration for ML Flow with Mysql and Minio S3
Stars: ✭ 146 (+668.42%)
Mutual labels:  minio
frame-backend
Frame API Build on JSON Web Tokens
Stars: ✭ 15 (-21.05%)
Mutual labels:  minio
boxball
Prebuilt Docker images with Retrosheet's complete baseball history data for many analytical frameworks. Includes Postgres, cstore_fdw, MySQL, SQLite, Clickhouse, Drill, Parquet, and CSV.
Stars: ✭ 79 (+315.79%)
Mutual labels:  apache-drill
zookeeper-k8s-openshift
Zookeeper docker container, ready for deployments on kubernetes and openshift
Stars: ✭ 22 (+15.79%)
Mutual labels:  apache-zookeeper
Upic
📤uPic is a native, powerful, beautiful and simple picture and file upload tool for macOS.
Stars: ✭ 2,465 (+12873.68%)
Mutual labels:  minio
e-shop
Sample Spring Cloud microservices e-shop.
Stars: ✭ 48 (+152.63%)
Mutual labels:  minio
Minio.AspNetCore
AspNetCore integration for Minio client
Stars: ✭ 77 (+305.26%)
Mutual labels:  minio
minio
Minio Object Storage in Kubernetes, used by Deis Workflow.
Stars: ✭ 51 (+168.42%)
Mutual labels:  minio
k3s-minio-deployment
Instructions and manifest files for deploying MinIO Object Storage on K3s.
Stars: ✭ 46 (+142.11%)
Mutual labels:  minio
mindav
A self-hosted file backup server which bridges WebDAV protocol with @minio written in @totoval. Webdav ❤️ Minio
Stars: ✭ 64 (+236.84%)
Mutual labels:  minio
kdp
Kubernetes deployment of PrestoDB, Hive Metastore, and Minio S3-standard object store
Stars: ✭ 15 (-21.05%)
Mutual labels:  minio
vagrant-codes-in-practice
Vagrant code files similar to the real bare metal environments
Stars: ✭ 22 (+15.79%)
Mutual labels:  minio
beihu-boot
企业级快速开发框架集,提供各种组件的模板方法包装,简化使用成本,供参考学习!
Stars: ✭ 32 (+68.42%)
Mutual labels:  minio
Drone Cache
A Drone plugin for caching current workspace files between builds to reduce your build times
Stars: ✭ 194 (+921.05%)
Mutual labels:  minio
mall4cloud
⭐️⭐️⭐️ Springcloud商城 O2O商城 小程序商城 PC商城 H5商城 APP商城 Java商城 分销商城 多用户商城 uniapp商城 微服务商城
Stars: ✭ 3,915 (+20505.26%)
Mutual labels:  minio
gosquito
gosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (+31.58%)
Mutual labels:  minio
minio-dart
Unofficial MinIO Dart Client SDK that provides simple APIs to access any Amazon S3 compatible object storage server.
Stars: ✭ 42 (+121.05%)
Mutual labels:  minio
data-exploration-with-apache-drill
Data Exploration with Apache Drill
Stars: ✭ 25 (+31.58%)
Mutual labels:  apache-drill

Jathena

An Open Source Amazon Athena Proof of Concept

Parts

  • Apache Drill
  • Apache Zookeeper
  • Minio
  • Kubernetes

Installation

Prereqs:

Kubernetes

Ordered Installation

Install Minio

//Run these three commands (Thanks: https://github.com/kubernetes/examples/blob/master/staging/storage/minio/README.md)
kubectl create -f https://github.com/kubernetes/kubernetes/blob/master/examples/storage/minio/minio-standalone-pvc.yaml?raw=true
kubectl create -f https://github.com/kubernetes/kubernetes/blob/master/examples/storage/minio/minio-standalone-deployment.yaml?raw=true
kubectl create -f https://github.com/kubernetes/kubernetes/blob/master/examples/storage/minio/minio-standalone-service.yaml?raw=true

//You can also use the distributed mode found in the linked README above. 

Install Zookeeper

kubectl create -f zookeeper.yaml
// thanks https://github.com/vitillo/minipipe

Install Apache Drill

kubectl create -f drill.yaml

Final Steps

  1. Add the bank-data.csv file to Minio
  2. Edit and enable s3 in Apache Drill Admin Console
  3. Query File
  • To add the CSV to Minio, you need to port-forward your minio pod 9000:9000, create a bucket named "data" and add the bank-data.csv file from the repo.

  • To edit and enable s3. You must port forward your Drill pod 8047:8047 and add the s3.json config to the s3 storage service.

  • Use the Drillbit interface to query the file
select * from s3.`bank-data.csv`;

Bonus

You can get the JDBC or ODBC connection for Drill by following the intrusctions here.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].