| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by slashdave 28 days ago
	RL is more than facts. Synthetic feedback is an obvious approach. Does the model suggest code that compiles and performs well?