Hacker News new | ask | show | jobs
by dpoloncsak 260 days ago
Does it still try to 'unplug' itself if it gets something wrong, or did they RL that out yet?
1 comments

Not sure if you're joking or serious? Every model has "degenerate" behavior it can be coerced into. Sonnet is even more apologetic on average.