Hacker News new | ask | show | jobs
by MichaelRazum 297 days ago
The claim about sample efficiency sounds a bit strange, since they did not include the state of the art sample efficient algorithms. Like dreamer or tdmpc. Also PPO is known to be not efficient, just compute efficient.