Hacker News new | ask | show | jobs
by xenonite 2240 days ago
Given the fully trained RL model, would it be possible to infer and explain the optimal movement technique in terms of a simple rule set?