| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by user43928 2 hours ago

Opus 4.8 High effort seems adequate for me currently, at API pricing, with a $200/month budget.

This is at work where I don't work on greenfield or parallelize feature development.

I cannot see the agent burning through $50 for one moderately sized TypeScript cleanup in my setup. This sounds like something that can be improved on OP's side.

There have been rumors about a potential Sonnet 5 model release in the near future, which hopefully tilts the cost/benefit ratio further in our favor.

2 comments

KronisLV 2 hours ago

> I cannot see the agent burning through $50 for one moderately sized TypeScript cleanup in my setup.

Here's my usage, from the ccusage tool (slightly shortened for readability):

  ┌──────────┬───────────────┬────────────┬─────────────┬─────────────┬───────────────┬────────────────┬────────────────┬─────────────┐
  │ Month    │ Agent         │ Models     │       Input │      Output │  Cache Create │     Cache Read │   Total Tokens │  Cost (USD) │
  ├──────────┼───────────────┼────────────┼─────────────┼─────────────┼───────────────┼────────────────┼────────────────┼─────────────┤
  │ 2026-06  │ - Claude      │ - opus-4-8 │  13,635,792 │  32,562,574 │   177,985,265 │  5,265,814,971 │  5,489,998,602 │    $4665.09 │
  └──────────┴───────────────┴────────────┴─────────────┴─────────────┴───────────────┴────────────────┴────────────────┴─────────────┘

Now obviously that is all with the Max 5x subscription, other agents and models excluded.

So per day that'd be around 155 USD (including weekends), which doesn't seem that far off, as long as the example cleanup takes up around 1/3 of one's daily work (or needs a lot of review/test iterations, or needs to review a lot of the existing code etc.).

link

user43928 1 hour ago

Interestingly it seems 80% of the cost is in the cached tokens.

I do not know whether that is typical, or indicative of conversations with too many turns.

Not that I would worry about this on a subscription plan, but at work where we are billed at API rates, I try to move to new conversations as often as possible.

link

KronisLV 1 hour ago

For agentic development upwards of 90% is pretty normal!

For example, if you make Claude Code explore a codebase, write a plan based on it and your requirements, do a few iterations of further specifying and altering it, and afterwards let it work for let's say 2-4 hours.

Sub-agents and dynamic workflows do alter the numbers a bit, but not to a crazy degree in the long run.

link

xienze 2 hours ago

> I cannot see the agent burning through $50 for one moderately sized TypeScript cleanup in my setup.

I have absolutely seen stuff like this happen. Think about it, when you point Claude at a bunch of files, it has to suck them all up (tens of thousands of tokens), spend some proportional number of tokens doing stuff, and spit them back out (tens of thousands of tokens) for each pass in the "cleanup" loop. I had a similar situation occur a few months ago. Very small "add Javadoc to these dozens of classes" scenario. Sonnet rapidly rate limited my $20 plan so I switched to extra usage. A very small (IMO) number of changes later I had spent like $7 in tokens.

The main problem is you really have no idea ahead of time just how many tokens a given task is going to take. I suggest you try spending a day running your Opus 4.8 High effort on API pricing to see just how much your $200 subscription is being subsidized before you confidently state that $50 for some TS cleanup task isn't possible.

link

user43928 1 hour ago

I've spend a week doing just that - I said at API pricing, $200/month currently seems adequate for 2-4 weeks of usage for me at work.

$50 would be 10M input tokens, not tens of thousands.

link

xienze 1 hour ago

> I said at API pricing, $200/month

Well I saw $200/month and thought you were talking about a max plan, sorry. But I will say unless you're using that top end model extremely judiciously $200 for 2-4 weeks of work is similarly hard to believe (see the other poster breaking down their usage). What are you typically doing? Must be pretty hardcore stuff if you need to use the baddest available model. How many interactions per day? Care to share your token usage stats?

> $50 would be 10M input tokens, not tens of thousands.

Two things. One, input tokens are but one component, and the cheapest. Output tokens include the tens of thousands being spit out for file changes AND the thinking/crunching that you don't see. And that's the most expensive part. And remember, that's per iteration, not everything is one-shot (especially with tasks like "fix this large part of my codebase).

link

user43928 1 hour ago

I don't have stats for what I use at work. This week I have been working on React frontend, also with TypeScript.

It is not my experience that you need to do 'hardcore stuff' to require the use of a large model. The difference in productivity between babysitting Sonnet and trying to get the result into a good shape compared to using Opus 4.8 seems large to me.

At home, unfortunately I only have the stats from the official apps rather than granular ones, and it looks like the Claude Desktop app is buggy: it was showing 17M tokens total in the last 30 days, but even just clicking on a conversation in my side bar increased the counter to 19M. It's clearly not working.

Codex shows up to 900M tokens total/week.

link