| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by JanisErdmanis 538 days ago
	> provably dangerous things If everyone would be able to agree on a single social welfare function, estimate behavioural changes at individual level for each LLM made responses and how that affects social welfare function then yes we could objectively tell whether the withheld answer is a censorship or safety feature.

1 comments

that is a very interesting point! we would get along, lol