Y
Hacker News
new
|
ask
|
show
|
jobs
Demystifying Evals for AI Agents
(
anthropic.com
)
3 points
by
dvorka
155 days ago