Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → jehiah → gomrjob

jehiah / gomrjob

Licence: other

gomrjob - a Go Framework for Hadoop Map Reduce Jobs

Programming Languages

31211 projects - #10 most used programming language

Labels

hadoop mapreduce mrjob dataproc

Projects that are alternatives of or similar to gomrjob

learning-hadoop-and-spark

Companion to Learning Hadoop and Learning Spark courses on Linked In Learning

Stars: ✭ 146 (+274.36%)

Mutual labels: hadoop, mapreduce, dataproc

Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.

Stars: ✭ 286 (+633.33%)

Mutual labels: hadoop, mapreduce

A collection of tutorials on Hadoop, MapReduce, Spark, Docker

Stars: ✭ 34 (-12.82%)

Mutual labels: hadoop, mapreduce

💎🔥大数据学习笔记

Stars: ✭ 488 (+1151.28%)

Mutual labels: hadoop, mapreduce

大数据生态圈学习

Stars: ✭ 18 (-53.85%)

Mutual labels: hadoop, mapreduce

网站点击流离线日志分析

Stars: ✭ 14 (-64.1%)

Mutual labels: hadoop, mapreduce

Data Science Ipython Notebooks

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

Stars: ✭ 22,048 (+56433.33%)

Mutual labels: hadoop, mapreduce

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing flows locally or on a cluster. See https://github.com/Cascading/cascading for the release repository.

Stars: ✭ 318 (+715.38%)

Mutual labels: hadoop, mapreduce

Data Algorithms Book

MapReduce, Spark, Java, and Scala for Data Algorithms Book

Stars: ✭ 949 (+2333.33%)

Mutual labels: hadoop, mapreduce

A light-weight distributed stream computing framework for Golang

Stars: ✭ 67 (+71.79%)

Mutual labels: hadoop, mapreduce

个人学习知识库涉及到数据仓库建模、实时计算、大数据、Java、算法等。

Stars: ✭ 92 (+135.9%)

Mutual labels: hadoop, mapreduce

Data-pipeline-project

Data pipeline project

Stars: ✭ 18 (-53.85%)

Mutual labels: hadoop, mapreduce

Avro Hadoop Starter

Example MapReduce jobs in Java, Hive, Pig, and Hadoop Streaming that work on Avro data.

Stars: ✭ 110 (+182.05%)

Mutual labels: hadoop, mapreduce

GooglePlay-Web-Crawler

Mapreduce project by Hadoop, Nutch, AWS EMR, Pig, Tez, Hive

Stars: ✭ 18 (-53.85%)

Mutual labels: hadoop, mapreduce

大数据学习笔记，学习路线，技术案例整理。

Stars: ✭ 37 (-5.13%)

Mutual labels: hadoop, mapreduce

Bigdata Interview

🎯 🌟[大数据面试题]分享自己在网络上收集的大数据相关的面试题以及自己的答案总结.目前包含Hadoop/Hive/Spark/Flink/Hbase/Kafka/Zookeeper框架的面试题知识总结

Stars: ✭ 857 (+2097.44%)

Mutual labels: hadoop, mapreduce

大数据入门指南 ⭐

Stars: ✭ 10,991 (+28082.05%)

Mutual labels: hadoop, mapreduce

Asakusa Framework

Stars: ✭ 114 (+192.31%)

Mutual labels: hadoop, mapreduce

RDMA accelerated, high-performance, scalable and efficient ShuffleManager plugin for Apache Spark

Stars: ✭ 215 (+451.28%)

Mutual labels: hadoop

kafka-connect-fs

Kafka Connect FileSystem Connector

Stars: ✭ 107 (+174.36%)

Mutual labels: hadoop

View All Similar Projects ➔

GoMRJob

A Go framework for running Map Reduce Jobs on Hadoop.

http://godoc.org/github.com/jehiah/gomrjob

Supported Configurations

Hadoop with HDFS via hadoop CLI
Google Cloud Dataproc with Google Storage

About

This framework has been in production use at Bitly since 2013, but it's light on examples.

See the example for more context.

Heavily inspired by Yelp/mrjob

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 39

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗