| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by HarHarVeryFunny 681 days ago
	RLHF really isn't the problem as far as surpassing human capability - language models trained to mimic human responses are fundamentally not going to do anything other than mimic human responses, regardless of how you fine-tune them for the specific type of human responses you do or don't like. If you want to exceed human intelligence, then design architectures for intelligence, not for copying humans!