| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pocksuppet 135 days ago
	There's one at the top of Hacker News right now, Qwen3-Coder-Next: https://news.ycombinator.com/item?id=46872706

1 comments

int_19h 135 days ago

A 80B MoE model with 3B params per activation is not a competent model regardless of what their cherry-picked benchmarks say. This reminds me of back when every other llama-7b finetune was claiming to be "GPT-4 quality".

link