Hacker News new | ask | show | jobs
by martin-t 497 days ago
The nuance here being that this only proves additional censorship is applied on top of the output. It does not disprove that (sometimes ineffective) censorship is part of the LLM or that censorship was not attempted during training.