|
|
|
|
|
by Vuizur
1153 days ago
|
|
I could replicate the refusal, but all it took to get the model to reply properly was replacing "evil hacker" with "hacker". Paternalism is not that big of a deal right now, especially when compared to ChatGPT/Bing Chat. The fine tuning data doesn't contain a lot (any?) task refusal. |
|