|
|
|
|
|
by sharemywin
1214 days ago
|
|
I see 2 options: 1. we don't know what they(coding layer between bing and GPT) look up and store as a prompt aka working memory. 2. it can do the equivalent of receiving it's own prompt silently. I seen with code it outputs the step for the code then writes the code. so there's some kind of plan and execute going on. maybe it can do that in model some how |
|
The simple answer is that the internal state that picks the next token is stable over iterations so that the model can follow a consistent plan over multiple token outputs. Then as the plan "unfolds" in the output tokens, these tokens help stabilize the plan further, thus creating consistency over long generations.