Hacker News new | ask | show | jobs
by JanisErdmanis 538 days ago
> provably dangerous things

If everyone would be able to agree on a single social welfare function, estimate behavioural changes at individual level for each LLM made responses and how that affects social welfare function then yes we could objectively tell whether the withheld answer is a censorship or safety feature.

1 comments

that is a very interesting point! we would get along, lol