Y
Hacker News
new
|
ask
|
show
|
jobs
by
MichaelRazum
297 days ago
The claim about sample efficiency sounds a bit strange, since they did not include the state of the art sample efficient algorithms. Like dreamer or tdmpc. Also PPO is known to be not efficient, just compute efficient.