Y
Hacker News
new
|
ask
|
show
|
jobs
by
kneel
637 days ago
Few short learning performance scales with model size. Afaik they don't see a plateau yet and the race is on the ingest more data and come up with better tuning techniques.
https://splab.sdu.edu.cn/GPT3.pdf