Y
Hacker News
new
|
ask
|
show
|
jobs
by
nivvis
311 days ago
for posterity, since shown that is it actually MoE
> 21B parameters with 3.6B active parameters