Hacker News new | ask | show | jobs
by AbrahamParangi 1023 days ago
Here you go: https://arxiv.org/pdf/2212.08073.pdf

The relevant part is the graph on the third page showing the helpfulness/harmlessness trade off curves.

Also, I don’t believe I said that “censorship is logically inconsistent in every moral framework”. I think you’re combining my statements that some people believe in some censorship and that logically inconsistent HR blather can only be reproduced by models too stupid to realize it’s blather or too manipulative to tell the truth.