Hacker News new | ask | show | jobs
by MrVandemar 757 days ago
> The fact is that I think that there is not much written word, to actually train a sensible model on. A lot of books don't have OCRed scans, or a digital version.

https://books.google.com/