| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ramanvarma 237 days ago
	do you have benchmarks on tasks with sparse rewards or partial observability? i feel like thats where most "train any agent" claims tend to break down

1 comments

PaulRobinson 237 days ago

It doesn't replace core algorithms. It plumbs things together. It means you're not having to write the framework to connect things, your algos are still going to have the same problems as they had before.

link