| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nialv7 547 days ago
	synthetic data is fine if you can ground the model somehow. that's why the o1/o3's improvements are mostly in reasoning, maths, etc., because you can easily tell if the data is wrong or not.

1 comments

That makes a lot of sense.

Binary success criteria has very little room for bias.