Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (+144.68%)

Mutual labels: big-data

Presto

The official home of the Presto distributed SQL query engine for big data

Stars: ✭ 12,957 (+27468.09%)

Mutual labels: big-data

Just Dashboard

📊 📋 Dashboards using YAML or JSON files

Stars: ✭ 1,511 (+3114.89%)

Mutual labels: big-data

Clickhouse

ClickHouse® is a free analytics DBMS for big data

Stars: ✭ 21,089 (+44770.21%)

Mutual labels: big-data

Ambari

Mirror of Apache Ambari

Stars: ✭ 1,576 (+3253.19%)

Mutual labels: big-data

Spark.jl

Julia binding for Apache Spark

Stars: ✭ 153 (+225.53%)

Mutual labels: big-data

Bigdataclass

Two-day workshop that covers how to use R to interact databases and Spark

Stars: ✭ 110 (+134.04%)

Mutual labels: big-data

Awkward 0.x

Manipulate arrays of complex data structures as easily as Numpy.

Stars: ✭ 216 (+359.57%)

Mutual labels: big-data

Attic Predictionio Sdk Java

PredictionIO Java SDK

Stars: ✭ 107 (+127.66%)

Mutual labels: big-data

Fili

Easily make RESTful web services for time series reporting with Big Data analytics engines like Druid and SQL Databases.

Stars: ✭ 151 (+221.28%)

Mutual labels: big-data

Smart Array To Tree

Convert large amounts of data array to tree fastly

Stars: ✭ 91 (+93.62%)

Mutual labels: big-data

Vizuka

Explore high-dimensional datasets and how your algo handles specific regions.

Stars: ✭ 100 (+112.77%)

Mutual labels: big-data

Couchdb Docker

Semi-official Apache CouchDB Docker images

Stars: ✭ 194 (+312.77%)

Mutual labels: big-data

Eel Sdk

Big Data Toolkit for the JVM

Stars: ✭ 140 (+197.87%)

Mutual labels: big-data

Dataengineeringproject

Example end to end data engineering project.

Stars: ✭ 82 (+74.47%)

Mutual labels: big-data

Samza Hello Samza

Mirror of Apache Samza

Stars: ✭ 99 (+110.64%)

Mutual labels: big-data

Helicalinsight

Helical Insight software is world’s first Open Source Business Intelligence framework which helps you to make sense out of your data and make well informed decisions.

Stars: ✭ 214 (+355.32%)

Mutual labels: big-data

Kudu

Mirror of Apache Kudu

Stars: ✭ 1,360 (+2793.62%)

Mutual labels: big-data

Logisland

Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.

Stars: ✭ 97 (+106.38%)

Mutual labels: big-data

predictionio-template-recommender

PredictionIO Recommendation Engine Template (Scala-based parallelized engine)

Stars: ✭ 80 (+70.21%)

Mutual labels: big-data

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+2746.81%)

Mutual labels: big-data

Storm Doc Zh

Apache Storm 官方文档中文版

Stars: ✭ 142 (+202.13%)

Mutual labels: big-data

Reef

Mirror of Apache REEF

Stars: ✭ 92 (+95.74%)

Mutual labels: big-data

Attic Predictionio Sdk Python

PredictionIO Python SDK

Stars: ✭ 196 (+317.02%)

Mutual labels: big-data

Bitcoin Value Predictor

[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin

Stars: ✭ 91 (+93.62%)

Mutual labels: big-data

Belajarpython.com

Open Source Indonesian Python Programming Tutorial Site

Stars: ✭ 141 (+200%)

Mutual labels: big-data

Parquet Mr

Apache Parquet

Stars: ✭ 1,278 (+2619.15%)

Mutual labels: big-data

Kafka Ui

Open-Source Web GUI for Apache Kafka Management

Stars: ✭ 230 (+389.36%)

Mutual labels: big-data

Panoptes

A Global Scale Network Telemetry Ecosystem

Stars: ✭ 80 (+70.21%)

Mutual labels: big-data

Hazelcast Go Client

Hazelcast IMDG Go Client

Stars: ✭ 140 (+197.87%)

Mutual labels: big-data

Uproot4

ROOT I/O in pure Python and NumPy.

Stars: ✭ 80 (+70.21%)

Mutual labels: big-data

Iotdb

Apache IoTDB

Stars: ✭ 1,221 (+2497.87%)

Mutual labels: big-data

Data Science Live Book

An open source book to learn data science, data analysis and machine learning, suitable for all ages!

Stars: ✭ 193 (+310.64%)

Mutual labels: big-data

Sparkling Graph

SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.

Stars: ✭ 139 (+195.74%)

Mutual labels: big-data

Setl

A simple Spark-powered ETL framework that just works 🍺

Stars: ✭ 79 (+68.09%)

Mutual labels: big-data

Attic Predictionio Template Recommender

PredictionIO Recommendation Engine Template (Scala-based parallelized engine)

Stars: ✭ 78 (+65.96%)

Mutual labels: big-data

Poseidon

A search engine which can hold 100 trillion lines of log data.

Stars: ✭ 1,793 (+3714.89%)

Mutual labels: big-data

Spark Website

Apache Spark Website

Stars: ✭ 75 (+59.57%)

Mutual labels: big-data

Vue Virtual Scroll List

⚡️A vue component support big amount data list with high render performance and efficient.

Stars: ✭ 3,201 (+6710.64%)

Mutual labels: big-data

Selinon

An advanced distributed task flow management on top of Celery

Stars: ✭ 237 (+404.26%)

Mutual labels: big-data

Attic Predictionio Sdk Ruby

PredictionIO Ruby SDK

Stars: ✭ 192 (+308.51%)

Mutual labels: big-data

Labs

Research on distributed system

Stars: ✭ 73 (+55.32%)

Mutual labels: big-data

Bookkeeper

Apache Bookkeeper

Stars: ✭ 1,178 (+2406.38%)

Mutual labels: big-data

My Journey In The Data Science World

📢 Ready to learn or review your knowledge!

Stars: ✭ 1,175 (+2400%)

Mutual labels: big-data

Accelerator

The Accelerator is a tool for fast and reproducible processing of large amounts of data.

Stars: ✭ 137 (+191.49%)

Mutual labels: big-data

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (+51.06%)

Mutual labels: big-data

Appdocs

Application Performance Optimization Summary

Stars: ✭ 1,169 (+2387.23%)

Mutual labels: big-data

Gun

An open source cybersecurity protocol for syncing decentralized graph data.

Stars: ✭ 15,172 (+32180.85%)

Mutual labels: big-data

Attic Apex Malhar

Mirror of Apache Apex malhar

Stars: ✭ 131 (+178.72%)

Mutual labels: big-data

Countly Sdk Cordova

Countly Product Analytics SDK for Cordova, Icenium and Phonegap

Stars: ✭ 69 (+46.81%)

Mutual labels: big-data

61-120 of 579 similar projects

‹

›

next*5