Monthly Archives: June 2010

Fully async Thrift client in Java

Thrift has had an asynchronous server implementation for Java for quite some time, but users have been asking for a way to have an asynchronous client since the very beginning. The motivation behind this style of client is usually performance. Imagine you take a bunch of time and make a highly optimized web application that makes [...]
Posted in Thrift | Tagged | 4 Comments

Pseudo-Combiners in Cascading

In order to get maximum performance from MapReduce, you need to minimize the amount of data that you have to transfer around the network.  If nearly your entire input must be transferred from your mappers to your reducers, then you’ll be putting a great deal of stress on your disks and network.  One thing that [...]
Posted in Cascading | Tagged | Leave a comment