Y
Hacker News
new
|
ask
|
show
|
jobs
by
tempaccount420
79 days ago
Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.
1 comments
lostmsu
79 days ago
> Bigger size means more knowledge.
Qwen 3.5 27B beats StepFun 3.5 Flash on GPQA Diamond too, so probably no.
link
Qwen 3.5 27B beats StepFun 3.5 Flash on GPQA Diamond too, so probably no.