Hacker News new | ask | show | jobs
by O_H_E 2168 days ago
Worthy to note that as another comment mentioned, GPT3 was likely trained on pre-GPT2 data.