|
|
|
|
|
by SpaceManNabs
442 days ago
|
|
> Yeah it's surprising that it works from such sparse rewards. I think imagining a lot of scenarios in parallel using the world model does some of the heavy lifting here. This is such gold. Thanks for sharing. Immediately added to my notes. |
|