Hacker News new | ask | show | jobs
by minimaxir 58 days ago
It's funny that 128B is now considered Medium. I remember back in the day when 355M parameters was considered medium with GPT-2.
1 comments

And GPT-2 1.5B was considered too dangerous to release.

They were perhaps right.

considered that by OpenAI for marketing purposes that is

But yes, perhaps it would have been better for all of us if they haven't.

In lockstep over the past month, a subset of people, un-labelable, unprompted, share this train of thought:

- Mythos wasn't released widely.

- But Anthropic shared info on it and said it was dangerous.

- Anthropic is a company.

- Companies like money.

- Therefore Mythos is marketing hype.

- Remember GPT-2? That also wasn't released. They said it was dangerous.

- But, GPT-3, GPT-4, GPT-5, etc. were released.

- Therefore GPT-2 being dangerous was marketing hype.

I've seen the idea that GPT-2 not being released was marketing hype at least 6 times since Mythos was shared.

It's Not Even Wrong, in the Pauli sense: they weren't selling anything! They weren't raising funding! What were they marketing!?

And there's a lot more elided from history, ex. they didn't have an API yet.

GPT-3 was released, a year or two later, and did have an API. But, no one used it, it wasn't good enough yet. And they did treat it as dangerous, it was wildly over-the-top manually monitored for anything resembling not-intended-use. I got permanently suspended for using the word "twink"

> I've seen the idea that GPT-2 not being released was marketing hype at least 6 times since Mythos was shared.

That's not what I am saying.

It's not that GPT-2 not being released was marketing hype, it's that OpenAI themselves claiming it's too dangerous to release specifically, implying it's close to AGI, (or something like that), was marketing hype.

That may sound more defensible to you, but its even more detached from reality. I feel very old right now because I actually read the thing at the time, but setting that aside, do you really think anyone thought or said GPT-2 was AGI?

I don't think you do.

I only mention reading it because that would clear it up, and you seem interested, and your parenthetical indicates A) you're aware you're claiming something a bit silly and B) you don't know what was actually said.

> do you really think anyone thought or said GPT-2 was AGI?

> I don't think you do.

I don't think they did. I think the marketing around it was such as to imply to the general public that it's not that far from AGI/let their imagination run wild, because it's useful marketing.

Same way as Apple's marketing around the iPad was that it's the 'Super. (full stop) (space) Computer. (full stop)' They never say it's any kind of a 'supercomputer'. But they know how that is going to be interpreted by many. It's intentional marketing.

Same with OpenAI.

I do think you're playing obtuse at this point, if you don't get that.