Y
Hacker News
new
|
ask
|
show
|
jobs
by
simianwords
85 days ago
It worked for you because the paper does the experiment without allowing the model to use any reasoning tokens - something that is grossly misleading.