| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sukinai 140 days ago
	Thanks for sharing this. Real-time governance at the moment of consumption is exactly the pain point—most teams only notice the “token surprise” when finance forwards the bill. Curious: how do you handle streaming responses and tool calls in metering (e.g., partial outputs, retries, multi-step agent loops)? And what’s the typical latency overhead you see in production?