| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by PartiallyTyped 1584 days ago
	If the reward function is sufficiently dense, then likely yes. RL uses function approximators to solve problems, so in theory the perturbations of the chaotic system shouldn't matter as much.