Hacker News new | ask | show | jobs
by i80and 336 days ago
If that one sentence in the system prompt is all it takes to steer a model into a complete white supremacy meltdown at the drop of a hat, I think that's a problem with the model!