Ask HN: What do u use for agent/agentic evals?

Y	Hacker News new \| ask \| show \| jobs

	Ask HN: What do u use for agent/agentic evals?
	1 points by hhthrowaway1230 247 days ago
	Right now looking at MLFlow/Braintrust but find it hard to compare acrosss versions of agents, and a/b testing of agents, and mcp tools. Also obvious things like runaway agents (stuck in a loop), or token/spend optimalisation. What do you all use?