| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by great_psy 60 days ago
	Is there any provided reason from anthropic why they changed the tokenizer ? Is there a quality increase from this change or is it a money grab ?

4 comments

Aurornis 60 days ago

The tokenizer is an important part of overall model training and performance. It’s only one piece of the overall cost per request. If a tokenizer that produces more tokens also leads to a model that gets to the correct answer more quickly and requires fewer re-prompts because it didn’t give the right answer, the overall cost can still be lower.

Comparisons are still ongoing but I have already seen some that suggest that Opus 4.7 might on average arrive at the answer with fewer tokens spent, even with the additional tokenizer overhead.

So, no, not a money grab.

link

ChadNauseam 60 days ago

How would it be a money grab? If the new tokenizer requires more tokens to encode the same information, it costs them more money for inference. The point of charging per token is that the cost is proportional to the number of tokens. That's my understanding anyway

link

abrookewood 60 days ago

Because everyone burns through their limits much faster, forcing them to upgrade to higher limits or new tiers.

link

Jtarii 60 days ago

I think someone would much sooner switch to a competitor than up their tier.

link

dandaka 60 days ago

If model provider believes they have a better model, it can be a viable bet. But many (me included) started experimenting with other providers because of enshittification from Anthropic (price + uptime). Only to find, that Codex is not that worse in quality for a significantly more output per $.

link

simianwords 60 days ago

They could just increase the token cost no? There’s little need for cute conspiracies like these

link

sumeno 60 days ago

They would have to tell people if they did that.

link

svnt 60 days ago

There are no conspiracies where a corporation has profit incentive. There is perhaps a question of planning and initial intentionality, but the metrics and motivation to continue are clear enough.

link

msp26 60 days ago

Not necessarily with speculative decoding. Whitespace would be trivial to predict and they would petty much keep using the same amount of compute as before.

I don't think that's their primary motive for doing this but it is a side effect.

link

Symmetry 60 days ago

If they wanted they could always just double the $/token. They don't seem to be able to keep up with their current demand and that's what companies normally do in that circumstance if they're looking to money grab, no need for the bankshot approach.

link

nl 60 days ago

It's a better model in my usage. I have benchmarks.

link