Hacker News new | ask | show | jobs
by paytonjjones 8 days ago
Exposure to horrors doesn't imply capability or desire to commit said horrors. But it does seem like kind of a prerequisite.

All else being equal, I think I'd prefer my models to be naive about human degradation and torture, for instance. Exceptions made for specialized models used for police work etc.

I do think broader alignment is necessary either way but that seems like an extra guardrail it'd be nice to have.

1 comments

>I'd prefer my models to be naive about...

In practice it's been shown that LLMs perform better when trained on more diverse data. Training on images in this domain can improve the performance of other domains. I would prefer to have models train as much data that exist.

>specialized models used for police work

The benefit of AGI is that you do not need to have special models for different domains.