Reinforcement Learning from Human Feedback: When the Math Ain't Enough

Y	Hacker News new \| ask \| show \| jobs

	Reinforcement Learning from Human Feedback: When the Math Ain't Enough (evalovernite.substack.com)
	1 points by scoresmoke 1042 days ago