Hacker News new | ask | show | jobs
by monkeynotes 539 days ago
We don't know if a supreme deceiver is aligned at all. If a model can think ahead a trillion moves of deception how do humans possibly stand a chance of scrutinizing anything with any confidence?