|
|
|
|
|
by Isamu
38 days ago
|
|
Oh, you mean somewhere it is tracking the statistical likelihood of the output. Yeah I buy that, although I think it just tends towards the most likely output given the context that it is dragging along. I mean it wouldn’t deliberately choose something really statistically unlikely, that’s like a non sequitur. |
|
But it then throws that distribution away / consumes it in the next token calculation. So it's not really tracking it per se.