Y
Hacker News
new
|
ask
|
show
|
jobs
by
foxhop
638 days ago
The llama 3.0, 3.1, & 3.2 all use the TikToken tokenizer which is the open source openai tokenizer.
1 comments
littlestymaar
638 days ago
GP is talking about context windows, not the number of token used by the tokenizer.
link
sva_
637 days ago
Somewhat confusingly, it appears the tokenizer vocabulary as well as the context length are both 128k tokens!
link
littlestymaar
637 days ago
Yup, that's why I wanted to clarify things.
link