|
|
|
|
|
by sukinai
140 days ago
|
|
Thanks for sharing this. Real-time governance at the moment of consumption is exactly the pain point—most teams only notice the “token surprise” when finance forwards the bill. Curious: how do you handle streaming responses and tool calls in metering (e.g., partial outputs, retries, multi-step agent loops)? And what’s the typical latency overhead you see in production? |
|