Hacker News new | ask | show | jobs
by c3534l 1167 days ago
As per 1, my understanding was that the training corpus is a well-known, and human-curated dataset. Its not just scraping the internet or anything.