Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (-10.16%)

Mutual labels: big-data

Sigmf

The Signal Metadata Format Specification

Stars: ✭ 120 (-6.25%)

Mutual labels: big-data

Attic Predictionio Sdk Java

PredictionIO Java SDK

Stars: ✭ 107 (-16.41%)

Mutual labels: big-data

Feast

Feature Store for Machine Learning

Stars: ✭ 2,576 (+1912.5%)

Mutual labels: big-data

Cmak

CMAK is a tool for managing Apache Kafka clusters

Stars: ✭ 10,544 (+8137.5%)

Mutual labels: big-data

Hazelcast Nodejs Client

Hazelcast IMDG Node.js Client

Stars: ✭ 124 (-3.12%)

Mutual labels: big-data

Pythondata

repo for code published on pythondata.com

Stars: ✭ 113 (-11.72%)

Mutual labels: big-data

Asakusafw

Asakusa Framework

Stars: ✭ 114 (-10.94%)

Mutual labels: big-data

Scala Spark Tutorial

Project for James' Apache Spark with Scala course

Stars: ✭ 121 (-5.47%)

Mutual labels: big-data

Genie

Distributed Big Data Orchestration Service

Stars: ✭ 1,544 (+1106.25%)

Mutual labels: big-data

Richdem

High-performance Terrain and Hydrology Analysis

Stars: ✭ 127 (-0.78%)

Mutual labels: big-data

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (-14.84%)

Mutual labels: big-data

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-8.59%)

Mutual labels: big-data

Azuredatalake

Samples and Docs for Azure Data Lake Store and Analytics

Stars: ✭ 128 (+0%)

Mutual labels: big-data

Griffon Vm

Griffon Data Science Virtual Machine

Stars: ✭ 128 (+0%)

Mutual labels: big-data

Report

自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456

Stars: ✭ 123 (-3.91%)

Mutual labels: big-data

View All Similar Projects ➔

Apache Tajo

Tajo is a relational and distributed data warehouse system for Hadoop. Tajo is designed for low-latency and scalable ad-hoc queries, online aggregation and ETL on large-data sets by leveraging advanced database techniques. It supports SQL standards. It has its own query engine which allows direct control of distributed execution and data flow. As a result, Tajo has a variety of query evaluation strategies and more optimization opportunities. In addition, Tajo will have a native columnar execution and and its optimizer.

Project

License

Apache License 2.0

Documents

Requirements

Java 1.8 or higher
Hadoop 2.3.0 or higher

Mailing lists

[email protected] - To discuss and ask general development issues.
[email protected] - To discuss and ask end-user questions/issues.
[email protected] - To see notifications made in the Tajo issue tracking system, review board, and Jenkins CI.
[email protected] - To monitor commits to the source repository.

To subscribe to the mailing lists, please send an email to:

${listname}[email protected]

For example, to subscribe to dev, send an email from your desired subscription address to:

[email protected]

and follow the instructions from there.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 128

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (34) 🔗