Hacker News new | ask | show | jobs
by winchester6788 2122 days ago
Length of given text after encoding it with their BPE. In general, you can expect it to be 1.2x-2x the len(words) in your text.