|
|
|
|
|
by csomar
125 days ago
|
|
> If anything, they predict words based on a heuristic ensemble of what word is most likely to come next in similar sentences and what word is most likely to give a final higher reward. So... "finding the most likely next word based on what they've seen on the internet"? |
|
[1] https://arxiv.org/pdf/2509.19249