Y
Hacker News
new
|
ask
|
show
|
jobs
by
0x3f
70 days ago
All the curves have been levelling off as expected. Not really sure what you're talking about.
1 comments
solenoid0937
70 days ago
They have not, every successful pre-train as of late has had performance increases greater than what the scaling laws predict.
link
0x3f
70 days ago
Those gains are arch based, data quality based, etc. Scaling laws only relate to data volume and compute, holding other factors constant.
link