Hacker News new | ask | show | jobs
by throwaway0123_5 315 days ago
It seems like you might need less output tokens for the same quality of response though. One of their plots shows o3 needing ~14k tokens to get 69% on SWE-bench Verified, but GPT-5 needing only ~4k.