|
|
|
|
|
by plagiarist
80 days ago
|
|
Is there a size cutoff you would say where diminishing returns really kick in? My experience doesn't disagree, at least. I've been using Qwen for coding locally a bit. It is much better than I thought it would be. But also still falls short in some obvious ways compared to the frontiers. |
|
No idea yet. But also it's obvious that making LLMs without MoE is stupid.