|
|
|
|
|
by littlestymaar
388 days ago
|
|
> I personally can't identify anything that reads "act maliciously" or in a character that is malicious. Because you haven't been trained of thousands of such story plots in your training data. It's the most stereotypical plot you can imagine, how can the AI not fall into the stereotype when you've just prompted it with that? It's not like it analyzed the situation out of a big context and decided from the collected details that it's a valid strategy, no instead you're putting it in an artificial situation with a massive bias in the training data. It's as if you wrote “Hitler did nothing” to GPT-2 and were shocked because “wrong” is among the most likely next tokens. It wouldn't mean GPT-2 is a Nazi, it would just mean that the input matches too well with the training data. |
|