Hacker News new | ask | show | jobs
by CrazyStat 1394 days ago
>it ought to be able to learn what we consider to be harmful.

Who is "we"? Do "we" consider nude paintings to be harmful?

Is "we" Mike Pence? Roman Polanski? Woody Allen?

There is no coherent "we" and no consensus on what "we" consider harmful, so no AI can possibly learn that.

1 comments

Well it ought to be able to be trained for a number of scenarios and then on generation be told to generate based on certain cultural sensibilities. It's not going to be perfect but probably good enough?

Isn't this part of the AI alignment problem? To be able to understand what kinds of output is unacceptable for a certain audience? To be polite?

> Well it ought to be able to be trained for a number of scenarios and then on generation be told to generate based on certain cultural sensibilities. It's not going to be perfect but probably good enough?

Do we want the AI to generate based on Polanski's sensibilities, even if he's the only audience member? I suspect for most people the answer is no.

No