Hacker News new | ask | show | jobs
by PartiallyTyped 1584 days ago
If the reward function is sufficiently dense, then likely yes. RL uses function approximators to solve problems, so in theory the perturbations of the chaotic system shouldn't matter as much.