Hacker News new | ask | show | jobs
by lkois 3 hours ago
Since you cannot imagine how they'd perform, isn't this the perfect opportunity to test your assumption?
1 comments

yeah and lets not forget codex and glm have subscriptions too, with even more usage per dollar
but they also burn more tokens per task, so in the end, Claude comes out as the more efficient one, despite giving you less tokens.
You've got it backwards. Opus is the token/money burning one https://deepswe.datacurve.ai/

Gpt 5.5 uses a third of the opus 4.8 tokens for the same task and scores higher. Glm 5.2 was worse in quality but used half the tokens - 5.3 is not tested yet but will be higher.