|
|
|
|
|
by xmprt
378 days ago
|
|
On the other hand, I'm skeptical if that has any impact because these models have thinking tokens where they can put all those comments and attention shouldn't care about how close the tokens are as long as they're within the context window. |
|