Hacker News new | ask | show | jobs
by Philpax 946 days ago
Is it me or do they not mention the size of the model at all? Pretty hard to compare it with other models when we don't know what weight class it's in...
3 comments

Emad (Stability AI) thinks it's a 300B model https://twitter.com/EMostaque/status/1727373950685200674
Why would the deepmind cofounder go for 1 model and not a MOE architecture like gpt4? 1.3 billion dollars is not enough to get you to gpt4?
I disagree, IMO for any model that isn't open source model size is just an implementation detail. If someone released a 10 trillion parameter model that's better than GPT4 it isn't somehow inferior because it has more parameters.
Why? You’re not going to be running it on your own hardware. As an end user all that matters are the results.