Y
Hacker News
new
|
ask
|
show
|
jobs
by
throwaway0123_5
315 days ago
It seems like you might need less output tokens for the same quality of response though. One of their plots shows o3 needing ~14k tokens to get 69% on SWE-bench Verified, but GPT-5 needing only ~4k.