All Projects → Hama → Similar Projects or Alternatives

369 Open source projects that are alternatives of or similar to Hama

Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)

Stars: ✭ 115 (-10.85%)

Mutual labels: big-data

Docker Spark Cluster

A Spark cluster setup running on Docker containers

Stars: ✭ 57 (-55.81%)

Mutual labels: big-data

Logisland

Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.

Stars: ✭ 97 (-24.81%)

Mutual labels: big-data

Lifion Kinesis

A native Node.js producer and consumer library for Amazon Kinesis Data Streams

Stars: ✭ 54 (-58.14%)

Mutual labels: big-data

Mobydq

🐳 Tool to automate data quality checks on data pipelines

Stars: ✭ 123 (-4.65%)

Mutual labels: big-data

Oodt

Mirror of Apache OODT

Stars: ✭ 52 (-59.69%)

Mutual labels: big-data

Spark Py Notebooks

Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 1,338 (+937.21%)

Mutual labels: big-data

Trck

Query engine for TrailDB

Stars: ✭ 48 (-62.79%)

Mutual labels: big-data

Just Dashboard

📊 📋 Dashboards using YAML or JSON files

Stars: ✭ 1,511 (+1071.32%)

Mutual labels: big-data

Moosefs

MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)

Stars: ✭ 1,025 (+694.57%)

Mutual labels: big-data

Reef

Mirror of Apache REEF

Stars: ✭ 92 (-28.68%)

Mutual labels: big-data

Attaca

Robust, distributed version control for large files.

Stars: ✭ 41 (-68.22%)

Mutual labels: big-data

Azuredatalake

Samples and Docs for Azure Data Lake Store and Analytics

Stars: ✭ 128 (-0.78%)

Mutual labels: big-data

Analysispreservation.cern.ch

Source code for the CERN Analysis Preservation portal

Stars: ✭ 37 (-71.32%)

Mutual labels: big-data

Bitcoin Value Predictor

[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin

Stars: ✭ 91 (-29.46%)

Mutual labels: big-data

Metrics

Measure behavior of Java applications

Stars: ✭ 35 (-72.87%)

Mutual labels: big-data

Ambari

Mirror of Apache Ambari

Stars: ✭ 1,576 (+1121.71%)

Mutual labels: big-data

Skymap

High-throughput gene to knowledge mapping through massive integration of public sequencing data.

Stars: ✭ 29 (-77.52%)

Mutual labels: big-data

Parquet Mr

Apache Parquet

Stars: ✭ 1,278 (+890.7%)

Mutual labels: big-data

Awesome Scalability

The Patterns of Scalable, Reliable, and Performant Large-Scale Systems

Stars: ✭ 36,688 (+28340.31%)

Mutual labels: big-data

Report

自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456

Stars: ✭ 123 (-4.65%)

Mutual labels: big-data

K8s Ingress Claim

An admission control policy that safeguards against accidental duplicate claiming of Hosts/Domains.

Stars: ✭ 14 (-89.15%)

Mutual labels: big-data

Panoptes

A Global Scale Network Telemetry Ecosystem

Stars: ✭ 80 (-37.98%)

Mutual labels: big-data

Dremio Oss

Dremio - the missing link in modern data

Stars: ✭ 862 (+568.22%)

Mutual labels: big-data

Bigdataclass

Two-day workshop that covers how to use R to interact databases and Spark

Stars: ✭ 110 (-14.73%)

Mutual labels: big-data

Accumulo

Apache Accumulo

Stars: ✭ 857 (+564.34%)

Mutual labels: big-data

Iotdb

Apache IoTDB

Stars: ✭ 1,221 (+846.51%)

Mutual labels: big-data

Dataflowjavasdk

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

Stars: ✭ 854 (+562.02%)

Mutual labels: big-data

Couchdb Documentation

Apache CouchDB Documentation

Stars: ✭ 128 (-0.78%)

Mutual labels: big-data

Pretzel

Javascript full-stack framework for Big Data visualisation and analysis

Stars: ✭ 26 (-79.84%)

Mutual labels: big-data

Attic Predictionio Template Recommender

PredictionIO Recommendation Engine Template (Scala-based parallelized engine)

Stars: ✭ 78 (-39.53%)

Mutual labels: big-data

Bandar Log

Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.

Stars: ✭ 19 (-85.27%)

Mutual labels: big-data

Attic Predictionio Sdk Java

PredictionIO Java SDK

Stars: ✭ 107 (-17.05%)

Mutual labels: big-data

Sqoop

Mirror of Apache Sqoop

Stars: ✭ 817 (+533.33%)

Mutual labels: big-data

Cookbook

The Data Engineering Cookbook

Stars: ✭ 9,829 (+7519.38%)

Mutual labels: big-data

Titanoboa

Titanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.

Stars: ✭ 787 (+510.08%)

Mutual labels: big-data

Sigmf

The Signal Metadata Format Specification

Stars: ✭ 120 (-6.98%)

Mutual labels: big-data

Storm

Mirror of Apache Storm

Stars: ✭ 6,297 (+4781.4%)

Mutual labels: big-data

Bookkeeper

Apache Bookkeeper

Stars: ✭ 1,178 (+813.18%)

Mutual labels: big-data

Cython

The most widely used Python to C compiler

Stars: ✭ 6,588 (+5006.98%)

Mutual labels: big-data

Mysql perf analyzer

MySQL performance monitoring and analysis.

Stars: ✭ 1,423 (+1003.1%)

Mutual labels: big-data

Samza

Mirror of Apache Samza

Stars: ✭ 676 (+424.03%)

Mutual labels: big-data

Big Data Engineering Coursera Yandex

Big Data for Data Engineers Coursera Specialization from Yandex

Stars: ✭ 71 (-44.96%)

Mutual labels: big-data

Sdc

Intel® Scalable Dataframe Compiler for Pandas*

Stars: ✭ 623 (+382.95%)

Mutual labels: big-data

Griffon Vm

Griffon Data Science Virtual Machine

Stars: ✭ 128 (-0.78%)

Mutual labels: big-data

H2o 3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

Stars: ✭ 5,656 (+4284.5%)

Mutual labels: big-data

Countly Sdk Cordova

Countly Product Analytics SDK for Cordova, Icenium and Phonegap

Stars: ✭ 69 (-46.51%)

Mutual labels: big-data

Zeppelin

Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.

Stars: ✭ 5,513 (+4173.64%)

Mutual labels: big-data

Vizuka

Explore high-dimensional datasets and how your algo handles specific regions.