|
|
|
|
|
by charcircuit
113 days ago
|
|
It can learn. When my agents makes mistake they update their memories and will avoid making the same mistakes in the future. >Reinforcement learning, on the other hand, can do that, on a human timescale. But you can't make money quickly from it. Tools like Claude Code and Codex have used RL to train the model how to use the harness and make a ton of money. |
|
That kind of capability is not going to lead to AGI, not even close.