Trust at scale: Auto-evaluation for high-stakes LLM accuracy

Y	Hacker News new \| ask \| show \| jobs

	Trust at scale: Auto-evaluation for high-stakes LLM accuracy (blog.elicit.com)
	6 points by stuhlmueller 700 days ago