|
|
|
|
|
by CamperBob2
11 days ago
|
|
And that's all we do, and it's all we need, and it's probably all there is. The discovery that reinforcement learning allows next-token prediction to extrapolate beyond its pretrained data set is harder to explain than the discovery of fire or the wheel or electricity, but it's up there on that level. |
|