|
|
|
|
|
by andy12_
120 days ago
|
|
Unless the LLM is a base model or just a finetuned base model, it definitely doesn't predict words just based on how likely they are in similar sentences it was trained on. Reinforcement learning is a thing and all models nowadays are extensively trained with it. If anything, they predict words based on a heuristic ensemble of what word is most likely to come next in similar sentences and what word is most likely to give a final higher reward. |
|
So... "finding the most likely next word based on what they've seen on the internet"?