Hacker News new | ask | show | jobs
by JoshTriplett 2893 days ago
> which hopes to prevent deception by carefully constructing games in which a superhuman agent's best strategy is honesty

I'd be very hesitant to assume that an agent cannot learn under which circumstances it should be honest to gain a benefit without putting any innate value on honesty. A human agent is more than capable of reasoning like that, let alone a superhuman one.