| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by poorman 355 days ago
	As we saw with GPT-5 the RL technique of training doesn't scale forever

2 comments

energy123 354 days ago

Unless GPT-5 is 30% cheaper to run than o3. Then it's scaling brilliantly given the small gap between release dates. People are really drawing too many conclusions from too little information.

link

oezi 355 days ago

I meant scaling the base training before RL.

link