Hacker News new | ask | show | jobs
by aoeusnth1 442 days ago
As far as you know, AI labs are doing E2E RL training with running code in the loop to advance the model's capability to act as an agent (for cursor et al).