Hacker News new | ask | show | jobs
by zahlman 16 days ago
It seems to me like they're saying the agent made the tool call they expected, but the harness didn't reject it like they expected it to.
1 comments

But it sounds like it's not even a harness issue if they have a process where they send a reset email to an address that isn't associated with the account.

This isn't (just) a validation issue, and shouldn't be at the harness level.