Hacker News new | ask | show | jobs
by nialv7 547 days ago
synthetic data is fine if you can ground the model somehow. that's why the o1/o3's improvements are mostly in reasoning, maths, etc., because you can easily tell if the data is wrong or not.
1 comments

That makes a lot of sense.

Binary success criteria has very little room for bias.