Hacker News new | ask | show | jobs
by davidvgilmore 5 days ago
Four ways: (1) We are built specifically for Claude Code model routing. (2) We route at a subagent/subtask level. (3) We support on-device routing. (4) We have a built-in ML router trained specifically to route Claude Code subagent tasks. Its use is optional.
1 comments

What is the benefits of on device routing? How do you decide if the task can be run on device?
For those that have capable enough hardware, it's effectively free to run subtasks on-device. (just the marginal cost of additional electricity).

With Google's most recent 12b param Gemma model, even Mac users with just 16gb of unified memory can offload some tasks on-device.