Y
Hacker News
new
|
ask
|
show
|
jobs
by
mips_avatar
202 days ago
The scoop Dylan Patel got was that part way through the gpt4.5 pretraining run the results were very very good, but it leveled off and they ended up with a huge base model that really wasn't any better on their evals.