| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by PartiallyTyped 1110 days ago
	til. For whatever reason I totally imagined it was some RL based method trained on sims. In my defense, RL is used for control problems as well, but this is so cool! Thank you for sharing.

1 comments

4gotunameagain 1110 days ago

no serious, safety critical system uses RL (except tesla "autopilot" and we see how that went). Control theory algorithms can be validated to work within the desired envelope and produce a valid solution.

The big advantage of convexifying the problem, is that when it is convex you have a guarantee it can be solved in fixed time, a major requirement for real time systems

link

PartiallyTyped 1110 days ago

I wasn't thinking of DeepRL, but more on the more classical side of things with approximators other than neural NNs; but what you describe makes sense.

link

bo1024 1110 days ago

On that side, reinforcement learning bleeds over into control theory, so you're partly right.

link