Hacker News new | ask | show | jobs
by cousin_it 3982 days ago
The AI won't be limited to techniques that you could think of, or techniques that Eliezer could think of. So you'd only get a false sense of security.

Besides, releasing a successful log might be a bad idea for other reasons. Think about how you'd play this game as an AI. You wouldn't go looking for a general purpose mindfuck, because there's probably no such thing. Instead, you would probably spend about a month gathering real life information about the gatekeeper's history, family, weaknesses etc. You'd read books on manipulation and sales techniques, and pick the strongest ones that you can find. You would brainstorm possible tactics and run tests. At the end of the month you'd have a 4 hour script with all possible unfair moves you could use against that person, arranged in the most effective order. (That's why it's a bad idea to play this game with friends.) Do you really want that information to be released? And if you know ahead of time that it will be released, won't it limit your efficiency?

1 comments

So you reckon as the AI player he blackmailed the gatekeeper player? "Let me out or I'll tell your friends/family/co-workers x about you" type of thing?
It's more about finding buttons to push. For example, Justin Corwin won one of his games against a religious woman by telling her that she shouldn't play God by keeping him locked up for a subjective eternity (it was more involved, but you get the point). You could come up with other tactics if you know the gatekeeper is divorced, or donates to charity, or is an immigrant, etc. Really, you'll be surprised by how much progress you can make on an "impossible" problem if you just spend five minutes thinking without flinching away.