Y
Hacker News
new
|
ask
|
show
|
jobs
by
nialv7
547 days ago
synthetic data is fine if you can ground the model somehow. that's why the o1/o3's improvements are mostly in reasoning, maths, etc., because you can easily tell if the data is wrong or not.
1 comments
dartos
547 days ago
That makes a lot of sense.
Binary success criteria has very little room for bias.
link
Binary success criteria has very little room for bias.