Y
Hacker News
new
|
ask
|
show
|
jobs
by
nisegami
326 days ago
How consistent is the tokenization across different model families? It always served as a mental hangup for me when comparing LLM inference pricing.
1 comments
alexellman
326 days ago
They all tokenize a little differently so they are not exactly 1-1. However I plan on addressing this by having each model complete a test task and getting the actual price from each api + token count to make a real 1-1 comparison.
link
esafak
326 days ago
And please timestamp the benchmarks, and rerun them periodically, so vendors can't quietly cost optimize the model when no-one's looking.
link
nisegami
326 days ago
Ah, that's a great idea and would be a welcome addition to the site.
link