Slides from Spinn3r Architecture Talk at 2008 MySQL Users Conference
Here’s a copy of the slides from the talk I just gave about the architecture of Spinn3r at the 2008 MySQL Users Conference:
We present the backend architecture behind Spinn3r – our scalable web and blog crawler.
Most existing work in scaling MySQL has been around high read throughput environments similar to web applications. In contrast, at Spinn3r we needed to complete thousands of write transactions per second in order to index the blogosphere at full speed.
We have achieved this through our ground up development of a fault tolerant distributed database and compute infrastructure all built on top of cheap commodity hardware.