All Projects → cwensel → Cascading.memcached

cwensel / Cascading.memcached

Memecached/Membase/ElasticSearch integration for Cascading

Programming Languages

java
68154 projects - #9 most used programming language

FYI, this is a work in progress (wip) branch, and only works with Cascading wip-1.2.

Welcome

This is the Cascading.Memcached module.

It provides support for writing data to a memcached protocol compatible cluster when bound to a Cascading data processing flow.

Current tested clusters are Memcached, MemBase, and ElasticSearch.

Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster. It can be found at the following location:

http://www.cascading.org/

Building

This release requires at least Cascading 1.2.x. Hadoop 0.19.x or later, and a cluster that supports the memcached text or binary protocols.

To build a jar,

ant -Dcascading.home=... -Dhadoop.home=... jar

To test (requires a localhost memcached cluster),

ant -Dcascading.home=... -Dhadoop.home=... test

where "..." is the install path of each of the dependencies.

Using

The cascading-memcached.jar file should be added to the "lib" directory of your Hadoop application jar file along with all Cascading dependencies.

Notes:

Currently ElasticSearch has a few memcached protocol related bugs that should be available in a future public release. Even with these fixes, only the text protocol is supported.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].