| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by sinenomine 364 days ago
	NLL loss and large-batch training regime inherently bias the model to learn “modal” representation of the world, and RLHF additionally collapses enthropy, especially as it is applied at most leading labs.