|
|
|
|
|
by smokel
648 days ago
|
|
Does anyone know why the sizes of these models are typically expressed in number of weights (i.e 1.5B and 9B in this case), without mentioning the weight size in bytes? For practical reasons, I often like to know how much GPU RAM is required to run these models locally. The actual number of weights seems to only express some kind of relative power, which I doubt is relevant to most users. Edit: reformulated to sound like a genuine question instead of a complaint. |
|