Hacker News new | ask | show | jobs
by ben_w 1183 days ago
That demonstrates possibly rather than necessity of alignment via having a definition.

Behaviours can be reinforced or dissuaded in non-verbal subjects, such as wild animals.

There's also the size of the possible behaviour space to consider: a discussion seldom has exactly two possible outcomes, the good one and the bad one, because even if you want yes-or-no answers it's still valid to respond "I don't know".

For an example of the former, I'm not sure how good the language model in DALL•E 2 is, but asking it for "Umfana nentombazane badlala ngebhola epaki elihle elinelanga elinesihlahla, umthwebuli wezithombe, uchwepheshe, 4k" didn't produce anything close to the English that I asked Google Translate to turn into Zulu: https://github.com/BenWheatley/Studies-of-AI/blob/main/DALL•...

(And for the latter, that might be why it did what it did with the Somali).