|
|
|
|
|
by YmiYugy
2 days ago
|
|
Makes me wonder, as people grow to trust the AI more and more, not reading the code and barely skimming the implementation plans and simply rerolling if something doesn't work, will the value of these chats erode?
Thinking back 1-1.5 years I was closely monitoring what these agents did and steering them quite aggressively. These days not so much.
Where will RL signals come from when it approaches humans capabilities ever closer?
How well does self play work for coding work?
What about multistep tasks where it isn't just about being good at a single task, but evolving a codebase over time in the face of changing requirements? |
|