So once it has the API shape in memory, it could make dozens of tool calls in a single call.
It isn't about token saving, its about time/efficiency of tool usage/response time accumulation.
Instead of 20 separate tool calls one after the other, you get one larger, orchestrated one that only returns exactly what it needed.
So once it has the API shape in memory, it could make dozens of tool calls in a single call.
It isn't about token saving, its about time/efficiency of tool usage/response time accumulation.
Instead of 20 separate tool calls one after the other, you get one larger, orchestrated one that only returns exactly what it needed.