All Projects → apache → Hama

apache / Hama

Licence: apache-2.0
Mirror of Apache Hama

Programming Languages

java
68154 projects - #9 most used programming language

Labels

Projects that are alternatives of or similar to Hama

Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-12.4%)
Mutual labels:  big-data
Scala Spark Tutorial
Project for James' Apache Spark with Scala course
Stars: ✭ 121 (-6.2%)
Mutual labels:  big-data
Feast
Feature Store for Machine Learning
Stars: ✭ 2,576 (+1896.9%)
Mutual labels:  big-data
Asakusafw
Asakusa Framework
Stars: ✭ 114 (-11.63%)
Mutual labels:  big-data
Hdfs Shell
HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS
Stars: ✭ 117 (-9.3%)
Mutual labels:  big-data
Hazelcast Nodejs Client
Hazelcast IMDG Node.js Client
Stars: ✭ 124 (-3.88%)
Mutual labels:  big-data
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+1096.9%)
Mutual labels:  big-data
Couchdb Documentation
Apache CouchDB Documentation
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Sigmf
The Signal Metadata Format Specification
Stars: ✭ 120 (-6.98%)
Mutual labels:  big-data
Griffon Vm
Griffon Data Science Virtual Machine
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Amazon S3 Find And Forget
Amazon S3 Find and Forget is a solution to handle data erasure requests from data lakes stored on Amazon S3, for example, pursuant to the European General Data Protection Regulation (GDPR)
Stars: ✭ 115 (-10.85%)
Mutual labels:  big-data
Drill
Apache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (+1155.04%)
Mutual labels:  big-data
Mobydq
🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-4.65%)
Mutual labels:  big-data
Just Dashboard
📊 📋 Dashboards using YAML or JSON files
Stars: ✭ 1,511 (+1071.32%)
Mutual labels:  big-data
Azuredatalake
Samples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Ambari
Mirror of Apache Ambari
Stars: ✭ 1,576 (+1121.71%)
Mutual labels:  big-data
Report
自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-4.65%)
Mutual labels:  big-data
Gaffer
A large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (+1172.87%)
Mutual labels:  big-data
Tajo
Mirror of Apache Tajo
Stars: ✭ 128 (-0.78%)
Mutual labels:  big-data
Richdem
High-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-1.55%)
Mutual labels:  big-data

Apache Hama

Apache Hama is a framework for Big Data analytics which uses the Bulk Synchronous Parallel (BSP) computing model, which was established in 2012 as a Top-Level Project of The Apache Software Foundation.

It provides not only pure BSP programming model but also SQL-like query interface (Apache MRQL) and vertex/neuron centric programming models, inspired by Google's Pregel and DistBelief (Apache Horn). For the latest information about Hama, please visit our website at: https://hama.apache.org/ and our wiki at: https://wiki.apache.org/hama/

Getting Started

Please refer to the Installation Guide in the online documentation for an overview on how to getting started with Hama.

Run Examples

Hama provides examples package that allows you to quickly run examples on your Hama Cluster. To run one of them, use % $HAMA_HOME/bin/hama jar hama-examples-x.x.x.jar. For example:

Download a Iris dataset. And then, run K-Means using:

% $HAMA_HOME/bin/hama jar hama-examples-x.x.x.jar kmeans /tmp/kmeans.txt /tmp/result 10 3

Getting Involved

Hama is an open source volunteer project under the Apache Software Foundation. We encourage you to learn about the project and contribute your expertise.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].