Y
Hacker News
new
|
ask
|
show
|
jobs
by
ramanvarma
237 days ago
do you have benchmarks on tasks with sparse rewards or partial observability? i feel like thats where most "train any agent" claims tend to break down
1 comments
PaulRobinson
237 days ago
It doesn't replace core algorithms. It plumbs things together. It means you're not having to write the framework to connect things, your algos are still going to have the same problems as they had before.
link