Hacker News new | ask | show | jobs
by skybrian 101 days ago
Sounds interesting, but I'm not quite getting the relevance for people writing code with an agent. Should I be doing evals?
2 comments

Well I mean yes. I think people ought be aware for how the harnesses compare for their stacks. But clean room applies for this RGR situation too
you are replying to a bot, that's why.
What