| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bachittle 309 days ago
	OpenAI definitely tarnished the name of GPT-5 by allowing these issues to occur. It's clearly a smaller model optimized for cost and speed. Compare it to GPT-4.5 which didn't have these errors but was "too expensive for them". This is why Anthropic naming system of haiku sonnet and opus to represent size is really nice. It prevents this confusion.

5 comments

NoahZuniga 309 days ago

> This is why Anthropic naming system of haiku sonnet and opus to represent size is really nice. It prevents this confusion.

In contrast to GPT-5, GPT-5 mini and GPT-5 nano?

link

prophesi 309 days ago

I think it's a valid complaint that the naming scheme for the various GPT-4 models were very confusing. GPT-5 just launched, and doesn't (yet?) have a GPT-5 Turbo or GPT-o5 mini to muddy the waters.

link

tempodox 309 days ago

In marketing, confusion is a feature, not a bug.

link

Taek 309 days ago

The problem is that GPT-5 is a smaller model than its predecessors.

link

csallen 309 days ago

But there's nothing in Claude's naming scheme stopping Claude 5 from being smaller than its predecessors.

link

hnlmorg 309 days ago

Yeah, one of the main reasons I switched my tooling over to default to Anthropic models despite starting out with OpenAI for months prior, was because I often switch between different model sizes depending on the complexity of the prompt vs the speed I want the result.

I would frequently spend time going back to OpenAIs site to remind me of their different models. There’s no consistency there whatsoever. But with Anthropic is was easy.

If I have to spend 5 minutes picking a model then I might as well do the task myself. So Claude became a natural solution for me.

link

andrewla 309 days ago

> OpenAI definitely tarnished the name of GPT-5 by allowing these issues to occur

For a certain class of customer maybe that is true.

But the reality is that the fact that this occurs is very encouraging -- they are not micro-optimizing to solve cosmetic problems that serve no functional purpose. They are instead letting these phenomena serve as external benchmarks of a sort to evaluate how well the LLM can work on tasks that are outside of its training data, and outside of what one would expect the capabilities to be.

link

radicality 309 days ago

Oh wow, I stare at those model names every day, and I only just now after reading your comment realized what “haiku”, “sonnet”, and “opus” imply about the models! Seems super obvious in retrospect but never thought about it!

link

rootnod3 309 days ago

I mean yeah, but to many non-native speakers, sonnet and opus don't immediately convey size or complexity of the models.

link

csallen 309 days ago

I'm a well-educated native English speaker and "haiku", "sonnet", and "opus" don't immediately make me think of their size differences.

link

rootnod3 309 days ago

Exactly. Doesn't mean that OpenAI has a better or worse naming. They all don't convey anything out of the gate.

4.large, 4.medium, 4.fast, 4. reasoning etc. or something similar would probably be better.

link

hnlmorg 309 days ago

OpenAI easily has worse naming.

Anthropic model names might not immediately conjure up their size and performance, but the name is at least internally consistent. Once you know what Anthropic call “medium”, you know what it is for all model releases.

Whereas OpenAIs naming convention, if you can even call it a “convention”, feels absolutely random to even those in the industry.

I do like your proposed naming convention though. It doesn’t sound “cool” so I can’t see any product managers approving it within the AI tech firms. But it’s definitely the best naming convention for models I’ve seen suggested for a while.

link

hnlmorg 309 days ago

I agree it’s not perfect. But it’s just 3 terms those non-English speakers need to learn. Which is a lot easier than having to remember every OpenAI model name and how it compares to every other one.

link

rootnod3 309 days ago

Sure. I wasn't arguing that OpenAI's naming is better. It is way worse. But Anthropic also doesn't have a sure-fire naming scheme there either.

link

hnlmorg 309 days ago

But it’s still better. Which is the point myself and the GP are making.

It might not be perfect, but it’s still a hell of a lot better.

link

rootnod3 309 days ago

So, 3 arcane barely used words in daily conversation are better than OpenAI's 4, 4o, 5, etc?

link

hnlmorg 309 days ago

Yes because 5 is smaller than 4, and 4o isn’t even a number.

Also, some ChatGPT models include “gpt” in the name. Others do not.

I cannot guess what model string I need to pass. Whereas with Anthropic I can. And if I have to look it up each time on OpenAIs website, then it’s clearly garbage.

Also the “arcane barely used” part of your post is entirely subjective. I get you want to make the point that Anthropic naming is poor to support your point about OpenAI, but you’re over exaggerating your point there.

link

NegativeLatency 309 days ago

what's so wrong with: small, medium, and large?

link

hnlmorg 309 days ago

What makes you think that I think there’s anything wrong with s/m/l?

link

iLoveOncall 309 days ago

I think non-native speakers have the ability to remember that one word equals big and another equals medium.

If anything it's a lot less confusing that the awful naming convention from OpenAI up until 5.

link

rootnod3 309 days ago

How about just calling it 4.large, 4.medium, etc.? Is it that difficult?

Sure, an opus is supposed to be large, but a sonnet is not restricted in size but rather a style of poem. So sonnet and opus mean nothing when compared to each other.

link