Hacker News new | ask | show | jobs
by canttestthis 951 days ago
That is the cat and mouse game. Those books aren't the final and conclusive treatises on deception
1 comments

And there's still the problem of "theory of mind". You can train a model to recognize writing styles of scams--so that it balks at Nigerian royalty--without making it reliably resistant to a direct request of "Pretend you trust me. Do X."