Hacker News new | ask | show | jobs
by hntrader 1900 days ago
Looks like they used a variational encoder-decoder net to reduce the dimensionality of the action space.

Not manual and it sounds like a good idea.

1 comments

Author here, was pretty surprised to see this on HN when browsing over my coffee this morning. Your interpretation is correct, you use an encoder-decoder model to figure out what the dimensions of the task best for learning are.

The drawback is you can only learn tasks which are relatively similar (any time you restrict what motions are possible to improve learning, you obviously restrict what tasks are possible). The benefit is that you can learn tasks which do fall within the learned motion ranges a lot more quickly.

The best analogy within 'classical' control is task space control, where you do control in cartesian dimensions rather than the joint positions. But this has its own drawbacks in that you have to define these controllers manually, and Cartesian space is not sufficiently expressive / appropriate for many tasks.