|
|
|
|
|
by omneity
938 days ago
|
|
GPT-4 is unlikely to be 1.7T params. This is a number floating around in the internet with no justification. The largest US open model is Google’s Switch-C which is 1.6T and only because it is a Mixture of experts model, i.e. it is constituted of many small models working together. |
|