|
|
|
|
|
by CrazyStat
1394 days ago
|
|
>it ought to be able to learn what we consider to be harmful. Who is "we"? Do "we" consider nude paintings to be harmful? Is "we" Mike Pence? Roman Polanski? Woody Allen? There is no coherent "we" and no consensus on what "we" consider harmful, so no AI can possibly learn that. |
|
Isn't this part of the AI alignment problem? To be able to understand what kinds of output is unacceptable for a certain audience? To be polite?