Hacker News new | ask | show | jobs
by nate 17 days ago
It's funny the convos I now have with Sonnet that I wasn't having with Opus. I feel like most of us here are starting to be told to draw down some of our 1M Opus xtrahigh thinking tokens :)

Is anyone using a local router to deal with that? Something thats like "don't even bother with sonnet for this task, just go with Opus". I wonder if Haiku could even do that math and recommend the model you should be in?

2 comments

my task workflow uses something like opus to evaluate the roadmap, sonnet to divide the tickets by complexity, and then dispatch them to the relevant models - I use haiku or openai's spark models (spark is FAST! and DUMB!) for the simplest, and ascending in complexity. I find mid tier sonnet and gpt5 are pretty competitive, and reserve opus for truly "rearchitect the app from scratch" style tasks.

But all that might be somewhat obsolete, the latest update for claude code looks like it uses workflows with various models, so they might already be optimizing that.

The version that probably works better is triaging in advance what's definitely not Opus territory: summaries, documentation, test generation.