Y
Hacker News
new
|
ask
|
show
|
jobs
by
canttestthis
951 days ago
That is the cat and mouse game. Those books aren't the final and conclusive treatises on deception
1 comments
Terr_
951 days ago
And there's still the problem of "theory of mind". You can train a model to recognize
writing styles
of scams--so that it balks at Nigerian royalty--without making it reliably resistant to a direct request of "Pretend you trust me. Do X."
link