Hacker News new | ask | show | jobs
by maerF0x0 59 days ago
This: https://www.anthropic.com/research/project-vend-2 Dec 2025
1 comments

The answer to my question is “no”:

> Claudius got a lot better at its job. Does that mean it’s ready to be rolled out to run a vending machine in your workplace?

Not quite. Claudius is better, but it’s still vulnerable in lots of important ways. Several interactions in our company Slack revealed concerning levels of naïveté.