Hacker News new | ask | show | jobs
Demystifying Evals for AI Agents (anthropic.com)
3 points by dvorka 155 days ago