| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by astrange 1204 days ago
	That's how the text decoder works, but the model gets to define "most likely" and an RLHF model uses this to make the text decoder produce useful answers instead.