| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by AgentME 730 days ago
	A point of evidence in this direction is that RLHF was developed originally as an alignment technique and then it turned out to be a breakthrough that also made LLMs better and more useful. Alignment and capabilities work aren't necessarily at odds with each other.