Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (-40.1%)

Mutual labels: big-data

Hazelcast Go Client

Hazelcast IMDG Go Client

Stars: ✭ 140 (-27.08%)

Mutual labels: big-data

Ambari

Mirror of Apache Ambari

Stars: ✭ 1,576 (+720.83%)

Mutual labels: big-data

Spark.jl

Julia binding for Apache Spark

Stars: ✭ 153 (-20.31%)

Mutual labels: big-data

Attic Predictionio Sdk Java

PredictionIO Java SDK

Stars: ✭ 107 (-44.27%)

Mutual labels: big-data

Accelerator

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Stars: ✭ 137 (-28.65%)

Mutual labels: big-data

Calcite Avatica

Mirror of Apache Calcite - Avatica

Stars: ✭ 130 (-32.29%)

Mutual labels: big-data

Maha

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Stars: ✭ 101 (-47.4%)

Mutual labels: big-data

Spark With Python

Fundamentals of Spark with Python (using PySpark), code examples

Stars: ✭ 150 (-21.87%)

Mutual labels: big-data

Gaffer

A large-scale entity and relation database supporting aggregation of properties

Stars: ✭ 1,642 (+755.21%)

Mutual labels: big-data

Geopyspark

GeoTrellis for PySpark

Stars: ✭ 167 (-13.02%)

Mutual labels: big-data

Tajo

Mirror of Apache Tajo

Stars: ✭ 128 (-33.33%)

Mutual labels: big-data

100daysofmlcode

My journey to learn and grow in the domain of Machine Learning and Artificial Intelligence by performing the #100DaysofMLCode Challenge.

Stars: ✭ 146 (-23.96%)

Mutual labels: big-data

Feast

Feature Store for Machine Learning

Stars: ✭ 2,576 (+1241.67%)

Mutual labels: big-data

Bigdata Playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

Stars: ✭ 177 (-7.81%)

Mutual labels: big-data

Richdem

High-performance Terrain and Hydrology Analysis

Stars: ✭ 127 (-33.85%)

Mutual labels: big-data

Metamodel

Mirror of Apache Metamodel

Stars: ✭ 143 (-25.52%)

Mutual labels: big-data

Hazelcast Nodejs Client

Hazelcast IMDG Node.js Client

Stars: ✭ 124 (-35.42%)

Mutual labels: big-data

Fluo

Apache Fluo

Stars: ✭ 159 (-17.19%)

Mutual labels: big-data

Scala Spark Tutorial

Project for James' Apache Spark with Scala course

Stars: ✭ 121 (-36.98%)

Mutual labels: big-data

Big Data Study

🐳 big data study

Stars: ✭ 141 (-26.56%)

Mutual labels: big-data

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (-39.06%)

Mutual labels: big-data

Presto Go Client

A Presto client for the Go programming language.

Stars: ✭ 183 (-4.69%)

Mutual labels: big-data

Cmak

CMAK is a tool for managing Apache Kafka clusters

Stars: ✭ 10,544 (+5391.67%)

Mutual labels: big-data

Eel Sdk

Big Data Toolkit for the JVM

Stars: ✭ 140 (-27.08%)

Mutual labels: big-data

Asakusafw

Asakusa Framework

Stars: ✭ 114 (-40.62%)

Mutual labels: big-data

Geni

A Clojure dataframe library that runs on Spark

Stars: ✭ 152 (-20.83%)

Mutual labels: big-data

Pythondata

repo for code published on pythondata.com

Stars: ✭ 113 (-41.15%)

Mutual labels: big-data

Sparkling Graph

SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.

Stars: ✭ 139 (-27.6%)

Mutual labels: big-data

Genie

Distributed Big Data Orchestration Service

Stars: ✭ 1,544 (+704.17%)

Mutual labels: big-data

Keyvi

Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.

Stars: ✭ 171 (-10.94%)

Mutual labels: big-data

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (-43.23%)

Mutual labels: big-data

Spark On Lambda

Apache Spark on AWS Lambda

Stars: ✭ 137 (-28.65%)

Mutual labels: big-data

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction