|
|
|
|
|
by mlsu
827 days ago
|
|
We don't have a model for "Where the conversation is going," we have a model for "What's the next token" which implicitly models "Where is the conversation going." The difference is significant here, because direct manipulation the implicit modeling task is required to do the type of planning that I've described. It's the same reason these LLM are not "agents." It's because you can only manipulate their world model through the interface of tokens. |
|