Hacker News new | ask | show | jobs
by IceMetalPunk 1390 days ago
So the number of parameters metric here is the sum across all their models, right? There isn't yet a single model with 10 trillion parameters... right?
1 comments

Yes, it is the sum of all models. But M6-10T (MoE) of Alibaba is a 10 trillion one.