Hacker News new | ask | show | jobs
by kneel 637 days ago
Few short learning performance scales with model size. Afaik they don't see a plateau yet and the race is on the ingest more data and come up with better tuning techniques.

https://splab.sdu.edu.cn/GPT3.pdf