Hacker News new | ask | show | jobs
by shruggedatlas 320 days ago
Is this a specific example from their demo? I just tried it and Opus 4.1 is able to solve it.
1 comments

Context matters a lot here - it may fail on this problem within a particular context (what the original commenter was working on), but then be able to solve it when presented with the question in isolation. The way your phrase the question may hint the model towards the answer as well.