Hacker News new | ask | show | jobs
by antirez 531 days ago
Oh, that makes sense! So they use the probability of the next token itself. Thanks for clarifying. Also clever trick about the multiple potential tokens to represent the same text.