All Projects → dharmeshkakadia → presto-kubernetes

dharmeshkakadia / presto-kubernetes

Licence: other
Running Presto on k8s

Projects that are alternatives of or similar to presto-kubernetes

ML-DS-Guide
Complied Resources for learning Machine Learning & Data Science
Stars: ✭ 42 (+10.53%)
Mutual labels:  datascience
comet-for-mlflow
Comet-For-MLFlow Extension
Stars: ✭ 48 (+26.32%)
Mutual labels:  datascience
WikiChron
Data visualization tool for wikis evolution
Stars: ✭ 19 (-50%)
Mutual labels:  datascience
presto-chart
Highly configurable Helm Presto Chart
Stars: ✭ 23 (-39.47%)
Mutual labels:  presto
Statistical-Learning-using-R
This is a Statistical Learning application which will consist of various Machine Learning algorithms and their implementation in R done by me and their in depth interpretation.Documents and reports related to the below mentioned techniques can be found on my Rpubs profile.
Stars: ✭ 27 (-28.95%)
Mutual labels:  datascience
bayarea-2019-scikit-sprint
Bay Area WiMLDS scikit-learn open source sprint (Nov 2, 2019)
Stars: ✭ 16 (-57.89%)
Mutual labels:  datascience
xyr
Query any data source using SQL, works with the local filesystem, s3, and more. It should be a very tiny and lightweight alternative to AWS Athena, Presto ... etc.
Stars: ✭ 58 (+52.63%)
Mutual labels:  presto
metallicaRt
R package of colour palettes based on Metallica studio album covers.
Stars: ✭ 18 (-52.63%)
Mutual labels:  datascience
mlops-case-study
MLOps Case Study
Stars: ✭ 23 (-39.47%)
Mutual labels:  datascience
dpkb
大数据相关内容汇总,包括分布式存储引擎、分布式计算引擎、数仓建设等。关键词:Hadoop、HBase、ES、Kudu、Hive、Presto、Spark、Flink、Kylin、ClickHouse
Stars: ✭ 123 (+223.68%)
Mutual labels:  presto
blogpost codes
Repo of my blogpost articles codes
Stars: ✭ 41 (+7.89%)
Mutual labels:  datascience
dku-kaggle-class
단국대 SW중심대학 2020년도 오픈소스SW설계 - 캐글뽀개기 수업 일정 및 강의자료
Stars: ✭ 48 (+26.32%)
Mutual labels:  datascience
DGFraud-TF2
A Deep Graph-based Toolbox for Fraud Detection in TensorFlow 2.X
Stars: ✭ 84 (+121.05%)
Mutual labels:  datascience
assigner
Population assignment analysis using R
Stars: ✭ 17 (-55.26%)
Mutual labels:  datascience
TiBigData
TiDB connectors for Flink/Hive/Presto
Stars: ✭ 192 (+405.26%)
Mutual labels:  presto
schrutepy
The Entire Transcript from the Office in Tidy Format
Stars: ✭ 22 (-42.11%)
Mutual labels:  datascience
trino-query-formatter
Presto SQL query formatter
Stars: ✭ 16 (-57.89%)
Mutual labels:  presto
DataScienceTutorials.jl
A set of tutorials to show how to use Julia for data science (DataFrames, MLJ, ...)
Stars: ✭ 94 (+147.37%)
Mutual labels:  datascience
wildebeest
File processing pipelines
Stars: ✭ 86 (+126.32%)
Mutual labels:  datascience
data science portfolio
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
Stars: ✭ 51 (+34.21%)
Mutual labels:  datascience

Presto-Kubernetes

Run Presto cluster on Kubernetes.

  1. Clone this.

    git clone https://github.com/dharmeshkakadia/presto-kubernetes/ && cd presto-kubernetes

  2. Start Coordinator.

    kubectl create -f coordinator-deployment.yaml 
    kubectl create -f presto-service.yaml
    
  3. Start Workers

    kubectl create -f worker-deployment.yaml

  4. Start using Presto. You can find out the address to connect using service URL.

    kubectl get service presto
    
    NAME      TYPE       CLUSTER-IP   EXTERNAL-IP   PORT(S)          AGE
    presto    NodePort   10.0.0.251   <none>        8080:30126/TCP   20h
    

    If you are using minikube, you can find out the address as follows

    minikube service presto --url
    
    http://192.168.64.4:30126
    

    Here is how you can use presto-cli to connect and start running queries. The presto UI is also available at the above URI.

    presto-cli --server http://192.168.64.4:30126 --catalog tpch --schema sf1

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].