| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by AndrewKemendo 79 days ago
	Training RL policies on edge cases by using humans to collect and instrument previously closed data systems.