Hacker News new | ask | show | jobs
by piannucci 1126 days ago
Speculating: perhaps the training data was labeled using top-of-file and top-of-repo copyright notices.