Y
Hacker News
new
|
ask
|
show
|
jobs
by
anana_
102 days ago
I've had even better results using the dense 27B model -- less looping and churning on problems
1 comments
androiddrew
102 days ago
Which dense model are you referring to? The dense model isn’t finetuned for code instruction according to the model card.
link
anana_
102 days ago
https://huggingface.co/Qwen/Qwen3.5-27B
I wasn't aware of that, which page mentions that?
link
zerebos
102 days ago
Yeah the page you linked even shows the benchmarks in coding for this model, so I'd be curious where that claim comes from
link