|
|
|
|
|
by famouswaffles
850 days ago
|
|
>What's actually happening is that by asking a model to generate more tokens, it increases the amount of information it has learnt to be present in its context block. I'm not saying this isn't part of it but even if it's just dummy tokens without any new information, it works. https://arxiv.org/abs/2310.02226 |
|