Hacker News new | ask | show | jobs
by bigeagle 335 days ago
I believe so.

Grok-1 is 341B, DeepSeek-v3 is 671B, and recent new open weights models are around 70B~300B.