|
|
|
|
|
by loudmax
1049 days ago
|
|
My understanding of ghost attention is that the interface will inject periodic reminders into the conversation. These are seen by the model but hidden from the user. They do use up some of the tokens available in the context window. |
|
But during inference, there's no trick. The system message remains once at the top.