Hacker News new | ask | show | jobs
by worldsayshi 1189 days ago
2048 words?
3 comments

Tokens. Short or common words tend to be one token, while less common words are composed of multiple tokens. For GPT OpenAI gives the rule of thumb that on average you need four tokens to encode three words, and LLaMA should be similar
Well that's for sure bigger than my context size.
2048 "tokens", where one token is roughly equivalent to ¾ of a word
Tokens