Hacker News new | ask | show | jobs
by magic_hamster 226 days ago
From what I gather, this is sort of what happened and why this was even posted in the first place. The models were able to immediately detect a change in their internal state before answering anything.