| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jdub 27 days ago
	Reinforcement learning perturbs the model such that the token prediction process (inference) tends towards the desired result.