The document describes how Sphinx, an open source full-text search engine, was used to optimize searching and reporting on a large dataset of over 160 million cross-links. The data was partitioned across 8 servers each with 4 Sphinx instances and 2 indexes. Queries were run in parallel across the instances to return results faster than could be achieved with a single database, with average query times under 0.125 seconds and 95% of queries returning under 0.352 seconds. The document outlines the partitioning, indexing, and querying approach used to optimize performance for the dataset.