All Projects → tomwhite → Hadoop Book

tomwhite / Hadoop Book

Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White

Programming Languages

Makefile
30231 projects
java
68154 projects - #9 most used programming language
shell
77523 projects
scala
5932 projects
perl
6916 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Hadoop Book

Pragmaticai
[Book-2019] Pragmatic AI: An Introduction to Cloud-based Machine Learning
Stars: ✭ 79 (-97.62%)
Mutual labels:  makefile, book
Awesome Learning
实践源码库:https://github.com/jast90/bigdata 。 微信搜索Jast关注公众号,获取最新技术分享😯。
Stars: ✭ 197 (-94.06%)
Mutual labels:  hadoop, book
Istio Handbook
Istio Handbook - Istio Service Mesh Advanced Practical(Istio 服务网格进阶实战) - https://jimmysong.io/istio-handbook
Stars: ✭ 374 (-88.72%)
Mutual labels:  makefile, book
Learn machine learning
Road to Machine Learning
Stars: ✭ 81 (-97.56%)
Mutual labels:  makefile, hadoop
Yii2 Cookbook
Yii 2.0 Community Cookbook
Stars: ✭ 1,397 (-57.88%)
Mutual labels:  makefile, book
Pcl Cn
Practical Common Lisp (中英文对照版) - for Chinese only
Stars: ✭ 155 (-95.33%)
Mutual labels:  makefile, book
Book Open Source Tips
Open Source book on Open Source Tips
Stars: ✭ 152 (-95.42%)
Mutual labels:  makefile, book
Rust Async
Demystifying Asynchronous Rust
Stars: ✭ 184 (-94.45%)
Mutual labels:  makefile, book
Read Weekly
Think Outside The Box And Monkey Reading / 每周一书
Stars: ✭ 300 (-90.96%)
Mutual labels:  book
Little Javascript Book
Early draft for The Little JavaScript Book
Stars: ✭ 305 (-90.8%)
Mutual labels:  book
Elasticluster
Create clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-91.02%)
Mutual labels:  hadoop
Kaobook
A LaTeX class for books, reports or theses based on https://github.com/kenohori/thesis and https://github.com/Tufte-LaTeX/tufte-latex.
Stars: ✭ 300 (-90.96%)
Mutual labels:  book
Archlinux Docker
Docker Base Image for Arch Linux (read-only mirror)
Stars: ✭ 304 (-90.84%)
Mutual labels:  makefile
Tbd
Source for TrunkBasedDevelopment.com
Stars: ✭ 299 (-90.99%)
Mutual labels:  book
Boost Cookbook
Online examples from "Boost C++ Application Development Cookbook":
Stars: ✭ 306 (-90.77%)
Mutual labels:  book
Trkdb
Stars: ✭ 299 (-90.99%)
Mutual labels:  makefile
Zero To Production
Code to follow along the "Zero To Production" book on API development in Rust.
Stars: ✭ 289 (-91.29%)
Mutual labels:  book
Task
A task runner / simpler Make alternative written in Go
Stars: ✭ 4,282 (+29.09%)
Mutual labels:  makefile
Spark Gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
Stars: ✭ 308 (-90.71%)
Mutual labels:  book
Tucl
The first-ever paper on the Unix shell written by Ken Thompson in 1976 scanned, transcribed, and redistributed with permission
Stars: ✭ 303 (-90.87%)
Mutual labels:  makefile

Hadoop Book Example Code

This repository contains the example code for Hadoop: The Definitive Guide, Fourth Edition by Tom White (O'Reilly, 2014).

Code for the First, Second, and Third Editions is also available.

Note that the chapter names and numbering has changed between editions, see Chapter Numbers By Edition.

Building and Running

To build the code, you will first need to have installed Maven and Java. Then type

% mvn package -DskipTests

This will do a full build and create example JAR files in the top-level directory (e.g. hadoop-examples.jar).

To run the examples from a particular chapter, first install the component needed for the chapter (e.g. Hadoop, Pig, Hive, etc), then run the command lines shown in the chapter.

Sample datasets are provided in the input directory, but the full weather dataset is not contained there due to size restrictions. You can find information about how to obtain the full weather dataset on the book's website at [http://www.hadoopbook.com/] (http://www.hadoopbook.com/).

Hadoop Component Versions

This edition of the book works with Hadoop 2. It has not been tested extensively with Hadoop 1, although most of it should work.

For the precise versions of each component that the code has been tested with, see book/pom.xml.

Copyright

Copyright (C) 2014 Tom White

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].