Hacker News new | ask | show | jobs
by Ambix 1103 days ago
For those who interested, there some new researches in the field [0]. It usually possible to create more compact token representation from given text, but my guess the greedy "optimal" tokenizer might harm the performance of the model?

[0] https://www.reddit.com/r/LocalLLaMA/comments/140gcn7/new_tok...