Hacker News new | ask | show | jobs
by axeldunkel 61 days ago
the better the tokenizer maps text to its internal representation, the better the understanding of the model what you are saying - or coding! But 4.7 is much more verbose in my experience, and this probably drives cost/limits a lot.