Hacker News new | ask | show | jobs
by jcmeyrignac 897 days ago
And what if Z-Library or Anna's Archive (respectively 11 and 25 millions of books) were used as sources? Do you remember Recaptcha, which displayed words extracted from scanned material? Maybe the content of these books was extracted with Recaptcha?
1 comments

Not that they are not aware: https://annas-archive.org/llm