Hacker News new | ask | show | jobs
by nickthegreek 458 days ago
We know Meta has done it. These companies have torrented or downloaded books that they did not pay for. Things like the The Pile, libgen, anna's library were scraped to build these models.