Hacker News new | ask | show | jobs
by michaelt 317 days ago
If a person is configuring an LLM for education, to provide personalised math coaching to 10 year olds, they want an LLM that won't output anything NSFW, no matter how the user pokes and prods it. That's totally reasonable.

But if that person is applying AI safety techniques like concept erasure to remove the model's ability to output porn, is that not anti-porn in the most literal sense?