Hacker News new | ask | show | jobs
by gnfedhjmm2 731 days ago
The whole “parameters” thing is just a marketing term though, if they wanted to say they use a babillion parameters how would that change anything? The term is subjective and open to interpretation.
1 comments

Parameters is the literal size of the model. How is it a marketing term? Is saying that a computer has 16GB of ram just a marketing term?
Well, the statement that GPT-4 is 1.8T parameters is a little misleading since it's really a 8 x 220B MoE (according to the rumors at least).

Also the size of the model itself isn't the only factor that determines performance, LLama 3 70B outperforms LLama 2 70B even though they have the same size.