| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by circuit10 1149 days ago
	They use RLHF (reinforcement learning through human feedback) which means they can reward it when it does it and punish it when it doesn’t They’ve probably done it strongly enough that it can’t really not do it, maybe on purpose to prevent misuse