Hacker News new | ask | show | jobs
by kfichter 1405 days ago
Presumably OCR on the Google Books scans then compare to some known text from a different source (Gutenberg or something)