|
|
|
|
|
by yyyk
1140 days ago
|
|
>>>Alignment just means "getting AI to have any concept of 'the thing we told it to do'.
>>That's a requirement for AGI anyway,
>No, that's a requirement for AGI that does what humans want it to do, rather than having no conception of humans. Can you imagine an AGI which has a general conceptions of things but has no conception of humans? This is all but precluded by the current training methods. Alignment refers to values. Problem is that human values are far from practically universal and that certain human groups have.. interesting values. |
|
Very easily. It might have some associations with "human", just as it has some associations "lamp" is a concept, but that doesn't mean it has any particular regard for either humans or lamps when taking actions.
> Problem is that human values are far from practically universal and that certain human groups have.. interesting values.
We currently have no ability to safely align with human values at all, let alone distinguish between different values. We're building capabilities rapidly.
Making this about "who wins" is not interesting until we can guarantee the outcome is not "everyone loses".