Hacker News new | ask | show | jobs
by gblargg 10 days ago
Seems like this will backfire. Now when developers encounter problems with Claude Fable, they will have an easy explanation: it did it deliberately and intentionally vaguely. There's no way to falsify it. It's reasonable to expect it to get false positives and invoke this when it shouldn't be.