|
|
|
|
|
by mcbuilder
336 days ago
|
|
This article stands as complete hype. They just seem to offer an idea of "replication training" which is just some vague agentic distributed RL. Multi-agent distributed reinforcement learning algorithms have been in the actual literature for a while. I suggest studying what DeepMind is doing for current state of the art in agentic distributed RL. |
|
The vague part is whether this will generalize to other non software domains.