|
|
|
|
|
by coder543
166 days ago
|
|
I meant large MoE models are more socially accepted now. They were not when Llama 4 launched, and I believe that worked against the Llama 4 models. The Llama 4 models are MoE models, in case you are unaware, since it feels like your comment feels was implying they were dense models. |
|