Tag Archives: Cascading

Pseudo-Combiners in Cascading

In order to get maximum performance from MapReduce, you need to minimize the amount of data that you have to transfer around the network.  If nearly your entire input must be transferred from your mappers to your reducers, then you’ll be putting a great deal of stress on your disks and network.  One thing that [...]

Posted in Cascading | Tagged | Leave a comment

Accelerate your test suite with Cascading 1.1

One big downside of using Cascading for our applications has been the runtime of our regression test suite. We test with quantities of data nowhere near our regular production volume, but we still end up running lots of jobs. In our experience, this ends up making our tests take a long time (in the tens [...]

Posted in Cascading | Also tagged | 1 Comment
  • Rapleaf Is Hiring!

    We are looking for engineers who want to solve challenging problems.

    We have great people, do great work, and have great perks.

    Know someone who might be interested? Refer a friend and get $5,000 for successful hires.

    See our current openings at
    www.rapleaf.com/careers