Is it me or do they not mention the size of the model at all? Pretty hard to compare it with other models when we don't know what weight class it's in...
I disagree, IMO for any model that isn't open source model size is just an implementation detail. If someone released a 10 trillion parameter model that's better than GPT4 it isn't somehow inferior because it has more parameters.