Hacker News new | ask | show | jobs
Trust at scale: Auto-evaluation for high-stakes LLM accuracy (blog.elicit.com)
6 points by stuhlmueller 700 days ago