Y
Hacker News
new
|
ask
|
show
|
jobs
by
poorman
307 days ago
As we saw with GPT-5 the RL technique of training doesn't scale forever
2 comments
energy123
307 days ago
Unless GPT-5 is 30% cheaper to run than o3. Then it's scaling brilliantly given the small gap between release dates. People are really drawing too many conclusions from too little information.
link
oezi
307 days ago
I meant scaling the base training before RL.
link