Hacker News new | ask | show | jobs
by ffsm8 541 days ago
But what we're doing to these models is literally censoring what they're saying - not doing.

I don't think that anyone has any problems with stopping random AIs when they're doing crimes (or more realistically the humans making them do that) - but if you're going to make the comparison to humans in good faith, it'd be a person standing behind you, punishing you when you say something offensive.

1 comments

What I'm saying is that the argument "they're math and data, therefore what they say is safe" is not a valid one.