Hacker News new | ask | show | jobs
by sjmaplesec 106 days ago
The review eval tests language, activation etc of skills. I guess you could move it all to a skill quick and then run an eval on that if using Tessl. This checks if the way you write the instructions etc are being well understood by the agent