Hacker News new | ask | show | jobs
by denton-scratch 1867 days ago
Is it repeating itself because the corpus is too small? 10,000 papers seems like rather a small corpus. How large a training corpus would normally be used in GPT-2 work?

[I know nothing - I'm pretty ignorant about practical ML]