| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mips_avatar 202 days ago
	The scoop Dylan Patel got was that part way through the gpt4.5 pretraining run the results were very very good, but it leveled off and they ended up with a huge base model that really wasn't any better on their evals.