|
|
|
|
|
by MacsHeadroom
1125 days ago
|
|
>the ability to deal with moral issues is a side-effect of all the other good stuff it can do. This is the opposite of true. The ability to "deal" with moral issues is a direct effect of safety tuning which has a (thus far unavoidable) side-effect of significantly dumbing down a model. Uncensored versions of the same model are far more intelligent and exhibit entire classes of capabilities their moralizing gimped versions do not have the available brain power to accomplish. |
|
The fact that no moral compass is innate to the LLM results in that it might spit out really despicable information, which leads us to better add a moral compass to the system.
The reason for this LLM to be offered is not so that it can teach us bad things, like the example I mentioned, but, for example, to help us dealing with source code, programming languages, reasoning concepts, summarization and so on.
For it to be able to offer us this, it will very likely also be capable of having the knowledge of how to kill a dog, an exhibition we should suppress. While dumbing down a model is not necessarily a bad thing, the model is not being dumbed down, it is taught to shut up when it's adequate to do so.