Monthly Archives: September 2008

Goodbye MapReduce, Hello Cascading

We have been doing a lot of batch processing with Hadoop MapReduce lately, and we quickly realized how painful it can be to write MapReduce jobs by hand. Some parts of our workflow require up to TEN MapReduce jobs to execute in sequence, requiring a lot of hand-coordination of intermediate data and execution order. Additionally, [...]
Posted in Cascading, Hadoop, MapReduce | 17 Comments