|
|
|
|
|
by omneity
990 days ago
|
|
Does an LLM need to loop back to re-read its input, even in a regular (read non-sliding) context window? Maybe I'm misunderstanding, but doesn't the hidden state solve the "lookup" problem in this case? In the sense that the LLM needs to ingest your entire input anyway before answering, then whether your instruction is at the front or at the end carries little impact besides on attention. |
|