Hadoop Open Sourcing Google

Business Week has an article on Hadoop vs Google this week. No much meat here if you’re familiar with the space. I found this interesting though:

In early November, for example, the tech team at The New York Times (NYT) rented computing power on Amazon’s (AMZN) cloud and used Hadoop to convert 11 million archived articles, dating back to 1851, to digital and searchable documents. They turned around in a single day a job that otherwise would have taken months.

This is one of the advantages of EC2 and a good example of what you can do with the platform. If the compute work is a couple orders of magnitude longer than it takes to transfer the data into EC2 you have a clear win in terms of an easy compute job.

%d bloggers like this: