| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Toolbox1337 938 days ago
	Not bad for a start by the US government. A bit more than half as powerful as the 1.7 trillion parameter GPT4 model.

1 comments

danpalmer 938 days ago

A bit more than half the size, it remains to be seen how powerful it is. There's clearly a non-linear relationship to model size, and it's also clear that it's hard to assess the power of these models anyway.

link

omneity 938 days ago

GPT-4 is unlikely to be 1.7T params. This is a number floating around in the internet with no justification.

The largest US open model is Google’s Switch-C which is 1.6T and only because it is a Mixture of experts model, i.e. it is constituted of many small models working together.

link

Toolbox1337 938 days ago

Ah, an important distinction to make. Good point.

link