Y
Hacker News
new
|
ask
|
show
|
jobs
by
johnfn
500 days ago
I think this is fairly easily debunked by o1, which is basically just 4o in a thinking for loop, and performs better on difficult tasks. Not a LOT better, mind you, but better enough to be measurable.