Y
Hacker News
new
|
ask
|
show
|
jobs
by
bigeagle
335 days ago
I believe so.
Grok-1 is 341B, DeepSeek-v3 is 671B, and recent new open weights models are around 70B~300B.