Hacker News new | ask | show | jobs
by pelorat 598 days ago
> I've seen very little convincing discussion about what to do about this problem.

I think we will need adversarial AI agents whose task is to monitor other agents for anything suspicious. Every input and output would be scrutinized and either approved or rejected.

1 comments

They will also be vulnerable to the same attack though.
It's AI agents all the way down