| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Philpax 946 days ago
	Is it me or do they not mention the size of the model at all? Pretty hard to compare it with other models when we don't know what weight class it's in...

3 comments

machdiamonds 946 days ago

Emad (Stability AI) thinks it's a 300B model https://twitter.com/EMostaque/status/1727373950685200674

link

thawab 946 days ago

Why would the deepmind cofounder go for 1 model and not a MOE architecture like gpt4? 1.3 billion dollars is not enough to get you to gpt4?

link

csjh 946 days ago

I disagree, IMO for any model that isn't open source model size is just an implementation detail. If someone released a 10 trillion parameter model that's better than GPT4 it isn't somehow inferior because it has more parameters.

link

moonsu 946 days ago

Why? You’re not going to be running it on your own hardware. As an end user all that matters are the results.

link