|
|
|
|
|
by Barathkanna
103 days ago
|
|
One underlooked source of variance is retries from formatting failures. In many agent systems the loops dominate the cost, not the raw token length. We ran into the same issue building agent workflows, which is why we started building https://oxlo.ai — experimenting with a flat subscription model where we absorb the token variance so builders don’t have to constantly model token risk. |
|