Hacker News new | ask | show | jobs
by sourabh03agr 885 days ago
The authors claim that it works well for a wide variety of cases. They have defined a novel categorisation of these evaluations which help guide LLMs to generate relevant assertions