Y
Hacker News
new
|
ask
|
show
|
jobs
by
worldsayshi
1189 days ago
2048 words?
3 comments
wongarsu
1189 days ago
Tokens. Short or common words tend to be one token, while less common words are composed of multiple tokens. For GPT OpenAI gives the rule of thumb that on average you need four tokens to encode three words, and LLaMA should be similar
link
worldsayshi
1189 days ago
Well that's for sure bigger than my context size.
link
doctoboggan
1189 days ago
2048 "tokens", where one token is roughly equivalent to ¾ of a word
link
teaearlgraycold
1189 days ago
Tokens
link