Hacker News new | ask | show | jobs
by woooooo 1185 days ago
There was the famous example of chatgpt refusing to disable a nuke in the middle of NYC by using a racial slur.

I don't think anyone in real life would choose that tradeoff but it's what happens when all of your "safety" training is about US culture war buttons.

1 comments

That's a situation where the training doesn't follow current subjective norms, so I don't think it really validates the complaint.