|
|
|
|
|
by t55
509 days ago
|
|
Thanks for reading! In most contexts (including this one), seq length encompasses both the initial input (prompt) tokens and the output tokens the model generates. It’s the total length of all tokens processed by the model so far. |
|
The template will be hugely helpful for a non-programmer like me.