Hacker News new | ask | show | jobs
by JLO64 56 days ago
35B (35 billion) is the number of parameters this model has. Its a Mixture of Experts model (MoE) so A3B means that 3B parameters are Active at any moment.
1 comments

~I see. What’s the 6?~

Nevermind, the other reply clears it