cwensel / Cascading.memcached
Programming Languages
FYI, this is a work in progress (wip) branch, and only works with Cascading wip-1.2.
Welcome
This is the Cascading.Memcached module.
It provides support for writing data to a memcached protocol compatible cluster when bound to a Cascading data processing flow.
Current tested clusters are Memcached, MemBase, and ElasticSearch.
Cascading is a feature rich API for defining and executing complex, scale-free, and fault tolerant data processing workflows on a Hadoop cluster. It can be found at the following location:
Building
This release requires at least Cascading 1.2.x. Hadoop 0.19.x or later, and a cluster that supports the memcached text or binary protocols.
To build a jar,
ant -Dcascading.home=... -Dhadoop.home=... jar
To test (requires a localhost memcached cluster),
ant -Dcascading.home=... -Dhadoop.home=... test
where "..." is the install path of each of the dependencies.
Using
The cascading-memcached.jar file should be added to the "lib" directory of your Hadoop application jar file along with all Cascading dependencies.
Notes:
Currently ElasticSearch has a few memcached protocol related bugs that should be available in a future public release. Even with these fixes, only the text protocol is supported.