| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rsfern 381 days ago
	“Kill the [model] for trying” kind of sounds like using reinforcement learning to get models to behave a certain way