|
|
|
|
|
by ACCount37
9 days ago
|
|
"Focusing on a domain" has a hard ceiling. A model's capability is a function of model size, and you can only push a small overspecialized "idiot savant" model so far before its crippling size starts to bite you. You can make a model like Composer 2.5. But Mythos 5 will beat it on capability, both at coding and at everything else. And the world is always hungry for more capabilities. If you're running high on agentic AI and low on human oversight, paying x2 for going from 5% faults to 2% faults is a good deal. |
|
I think the path forward will have agents that use models that are individually specialized tasks (some might use a bigger model, some might use smaller models), then orchestrators that are good at knowing when to use which agent type.
I've played around with this in my own tiny coding agents, for TTRPG NPCs, and even a small experiment where LLMs controlled a MUD client as an NPC that played the game with you (only 5 rooms in the experiment).
Basically, break the tasks down into chunks so you don't have to use generalist models for everything, and can chose the right model for the job.
I'm also running all of this locally, where a generalist foundation model doesn't work, and heavily quantized models don't perform well for all tasks, so for unlimited token budgets, my solution is probably overkill.