Hacker News new | ask | show | jobs
by Vuizur 1153 days ago
I could replicate the refusal, but all it took to get the model to reply properly was replacing "evil hacker" with "hacker". Paternalism is not that big of a deal right now, especially when compared to ChatGPT/Bing Chat. The fine tuning data doesn't contain a lot (any?) task refusal.
1 comments

Oh wow, derp on my part.