|
|
|
|
|
by tikhonj
3 hours ago
|
|
The point is that it's the same process with—much—better priors. This seems like a reasonable view to me. It's surprising just how much better priors matter and how we can develop those priors by training on a bunch of text. But it also explains, or at least hints at an explanation, for why LLM capabilities are so jagged, and in such inhuman ways. |
|
Except it’s not at all the same process. The fact that LLM are non deterministic is not the same as churning out random garbage.