|
|
|
|
|
by JAG_Ecalona
69 days ago
|
|
The sweet spot thing is the real insight here and nobody seems to be talking about it. Frontier models get hyped for their maximum task horizon, but that's also where they're 10-30x more expensive per hour than their optimal range. You're paying a massive premium for the hardest tasks and still failing half the time. Honestly the practical takeaway is pretty boring: just break your work into smaller chunks. Not because the models can't handle longer tasks, but because the economics at shorter task lengths are just way better. The labs are racing to push the horizon out; the smart move for anyone actually paying the bills is to stay near the sweet spot and orchestrate from there. |
|
Generalist models have similar problems as generalist humans. The proverbial "Jack of all trades, master of none."
That said, I've made my career as a generalist :)