All Projects → apache → Tajo

apache / Tajo

Licence: apache-2.0
Mirror of Apache Tajo

Programming Languages

java
68154 projects - #9 most used programming language

Labels

Projects that are alternatives of or similar to Tajo

Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-14.06%)
Mutual labels:  big-data
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+1164.84%)
Mutual labels:  big-data
Mobydq
🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-3.91%)
Mutual labels:  big-data
Ambari
Mirror of Apache Ambari
Stars: ✭ 1,576 (+1131.25%)
Mutual labels:  big-data
Amazon S3 Find And Forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-10.16%)
Mutual labels:  big-data
Sigmf
The Signal Metadata Format Specification
Stars: ✭ 120 (-6.25%)
Mutual labels:  big-data
Attic Predictionio Sdk Java
PredictionIO Java SDK
Stars: ✭ 107 (-16.41%)
Mutual labels:  big-data
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+1912.5%)
Mutual labels:  big-data
Cmak
CMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+8137.5%)
Mutual labels:  big-data
Hazelcast Nodejs Client
Hazelcast IMDG Node.js Client
Stars: ✭ 124 (-3.12%)
Mutual labels:  big-data
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-11.72%)
Mutual labels:  big-data
Asakusafw
Asakusa Framework
Stars: ✭ 114 (-10.94%)
Mutual labels:  big-data
Scala Spark Tutorial
Project for James' Apache Spark with Scala course
Stars: ✭ 121 (-5.47%)
Mutual labels:  big-data
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+1106.25%)
Mutual labels:  big-data
Richdem
High-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-0.78%)
Mutual labels:  big-data
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-14.84%)
Mutual labels:  big-data
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-8.59%)
Mutual labels:  big-data
Azuredatalake
Samples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (+0%)
Mutual labels:  big-data
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (+0%)
Mutual labels:  big-data
Report
自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-3.91%)
Mutual labels:  big-data

Apache Tajo

Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. It has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.

Project

License

Documents

Requirements

  • Java 1.8 or higher
  • Hadoop 2.3.0 or higher

Mailing lists

To subscribe to the mailing lists, please send an email to:

${listname}[email protected]

For example, to subscribe to dev, send an email from your desired subscription address to:

[email protected]

and follow the instructions from there.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].