Hacker News new | ask | show | jobs
by poorman 307 days ago
As we saw with GPT-5 the RL technique of training doesn't scale forever
2 comments

Unless GPT-5 is 30% cheaper to run than o3. Then it's scaling brilliantly given the small gap between release dates. People are really drawing too many conclusions from too little information.
I meant scaling the base training before RL.