All Projects → Hama → Similar Projects or Alternatives

369 Open source projects that are alternatives of or similar to Hama

Amazon S3 Find And Forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-10.85%)
Mutual labels:  big-data
Docker Spark Cluster
A Spark cluster setup running on Docker containers
Stars: ✭ 57 (-55.81%)
Mutual labels:  big-data
Logisland
Scalable stream processing platform for advanced realtime analytics on top of Kafka and Spark. LogIsland also supports MQTT and Kafka Streams (Flink being in the roadmap). The platform does complex event processing and is suitable for time series analysis. A large set of valuable ready to use processors, data sources and sinks are available.
Stars: ✭ 97 (-24.81%)
Mutual labels:  big-data
Lifion Kinesis
A native Node.js producer and consumer library for Amazon Kinesis Data Streams
Stars: ✭ 54 (-58.14%)
Mutual labels:  big-data
Mobydq
🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-4.65%)
Mutual labels:  big-data
Oodt
Mirror of Apache OODT
Stars: ✭ 52 (-59.69%)
Mutual labels:  big-data
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+937.21%)
Mutual labels:  big-data
Trck
Query engine for TrailDB
Stars: ✭ 48 (-62.79%)
Mutual labels:  big-data
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1071.32%)
Mutual labels:  big-data
Moosefs
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
Stars: ✭ 1,025 (+694.57%)
Mutual labels:  big-data
Reef
Mirror of Apache REEF
Stars: ✭ 92 (-28.68%)
Mutual labels:  big-data
Attaca
Robust, distributed version control for large files.
Stars: ✭ 41 (-68.22%)
Mutual labels:  big-data
Azuredatalake
Samples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Analysispreservation.cern.ch
Source code for the CERN Analysis Preservation portal
Stars: ✭ 37 (-71.32%)
Mutual labels:  big-data
Bitcoin Value Predictor
[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-29.46%)
Mutual labels:  big-data
Metrics
Measure behavior of Java applications
Stars: ✭ 35 (-72.87%)
Mutual labels:  big-data
Ambari
Mirror of Apache Ambari
Stars: ✭ 1,576 (+1121.71%)
Mutual labels:  big-data
Skymap
High-throughput gene to knowledge mapping through massive integration of public sequencing data.
Stars: ✭ 29 (-77.52%)
Mutual labels:  big-data
Parquet Mr
Apache Parquet
Stars: ✭ 1,278 (+890.7%)
Mutual labels:  big-data
Awesome Scalability
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Stars: ✭ 36,688 (+28340.31%)
Mutual labels:  big-data
Report
自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-4.65%)
Mutual labels:  big-data
K8s Ingress Claim
An admission control policy that safeguards against accidental duplicate claiming of Hosts/Domains.
Stars: ✭ 14 (-89.15%)
Mutual labels:  big-data
Panoptes
A Global Scale Network Telemetry Ecosystem
Stars: ✭ 80 (-37.98%)
Mutual labels:  big-data
Dremio Oss
Dremio - the missing link in modern data
Stars: ✭ 862 (+568.22%)
Mutual labels:  big-data
Bigdataclass
Two-day workshop that covers how to use R to interact databases and Spark
Stars: ✭ 110 (-14.73%)
Mutual labels:  big-data
Accumulo
Apache Accumulo
Stars: ✭ 857 (+564.34%)
Mutual labels:  big-data
Iotdb
Apache IoTDB
Stars: ✭ 1,221 (+846.51%)
Mutual labels:  big-data
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+562.02%)
Mutual labels:  big-data
Couchdb Documentation
Apache CouchDB Documentation
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Pretzel
Javascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-79.84%)
Mutual labels:  big-data
Attic Predictionio Template Recommender
PredictionIO Recommendation Engine Template (Scala-based parallelized engine)
Stars: ✭ 78 (-39.53%)
Mutual labels:  big-data
Bandar Log
Monitoring tool to measure flow throughput of data sources and processing components that are part of Data Ingestion and ETL pipelines.
Stars: ✭ 19 (-85.27%)
Mutual labels:  big-data
Attic Predictionio Sdk Java
PredictionIO Java SDK
Stars: ✭ 107 (-17.05%)
Mutual labels:  big-data
Sqoop
Mirror of Apache Sqoop
Stars: ✭ 817 (+533.33%)
Mutual labels:  big-data
Cookbook
The Data Engineering Cookbook
Stars: ✭ 9,829 (+7519.38%)
Mutual labels:  big-data
Titanoboa
Titanoboa makes complex workflows easy. It is a low-code workflow orchestration platform for JVM - distributed, highly scalable and fault tolerant.
Stars: ✭ 787 (+510.08%)
Mutual labels:  big-data
Sigmf
The Signal Metadata Format Specification
Stars: ✭ 120 (-6.98%)
Mutual labels:  big-data
Storm
Mirror of Apache Storm
Stars: ✭ 6,297 (+4781.4%)
Mutual labels:  big-data
Bookkeeper
Apache Bookkeeper
Stars: ✭ 1,178 (+813.18%)
Mutual labels:  big-data
Cython
The most widely used Python to C compiler
Stars: ✭ 6,588 (+5006.98%)
Mutual labels:  big-data
Mysql perf analyzer
MySQL performance monitoring and analysis.
Stars: ✭ 1,423 (+1003.1%)
Mutual labels:  big-data
Samza
Mirror of Apache Samza
Stars: ✭ 676 (+424.03%)
Mutual labels:  big-data
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-44.96%)
Mutual labels:  big-data
Sdc
Intel® Scalable Dataframe Compiler for Pandas*
Stars: ✭ 623 (+382.95%)
Mutual labels:  big-data
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+4284.5%)
Mutual labels:  big-data
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-46.51%)
Mutual labels:  big-data
Zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+4173.64%)
Mutual labels:  big-data
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-22.48%)
Mutual labels:  big-data
Scanner
Efficient video analysis at scale
Stars: ✭ 569 (+341.09%)
Mutual labels:  big-data
Hazelcast Cpp Client
Hazelcast IMDG C++ Client
Stars: ✭ 67 (-48.06%)
Mutual labels:  big-data
Nipype
Workflows and interfaces for neuroimaging packages
Stars: ✭ 557 (+331.78%)
Mutual labels:  big-data
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+1155.04%)
Mutual labels:  big-data
Rsparkling
RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)
Stars: ✭ 65 (-49.61%)
Mutual labels:  big-data
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+1172.87%)
Mutual labels:  big-data
Tajo
Mirror of Apache Tajo
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Richdem
High-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-1.55%)
Mutual labels:  big-data
Cmak
CMAK is a tool for managing Apache Kafka clusters
Stars: ✭ 10,544 (+8073.64%)
Mutual labels:  big-data
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+8420.16%)
Mutual labels:  big-data
Warp
Convert and analyze large data sets at light speed, on Mac and iOS.
Stars: ✭ 62 (-51.94%)
Mutual labels:  big-data
61-120 of 369 similar projects