Hacker News new | ask | show | jobs
by djmips 439 days ago
Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.