| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by michaelt 317 days ago
	If a person is configuring an LLM for education, to provide personalised math coaching to 10 year olds, they want an LLM that won't output anything NSFW, no matter how the user pokes and prods it. That's totally reasonable. But if that person is applying AI safety techniques like concept erasure to remove the model's ability to output porn, is that not anti-porn in the most literal sense?