"Why Writing Your Own Search Engine Is Hard" http://queue.acm.org/detail.cfm?id=988407
(site is down currently.) google cache: http://74.125.95.132/search?q=cache:13tlOSQwtjAJ:queue.acm.o...