Hacker News new | ask | show | jobs
by sukinai 140 days ago
Thanks for sharing this. Real-time governance at the moment of consumption is exactly the pain point—most teams only notice the “token surprise” when finance forwards the bill.

Curious: how do you handle streaming responses and tool calls in metering (e.g., partial outputs, retries, multi-step agent loops)? And what’s the typical latency overhead you see in production?