Hacker News new | ask | show | jobs
by sjmaplesec 106 days ago
Tessl can generate the evals, both to test anthropic best practices as well as running scenarios with and without the skill to check if it's helping