Hacker News new | ask | show | jobs
by fishingboy 1392 days ago
Yes, it is the sum of all models. But M6-10T (MoE) of Alibaba is a 10 trillion one.