Hacker News new | ask | show | jobs
by anana_ 102 days ago
I've had even better results using the dense 27B model -- less looping and churning on problems
1 comments

Which dense model are you referring to? The dense model isn’t finetuned for code instruction according to the model card.
https://huggingface.co/Qwen/Qwen3.5-27B

I wasn't aware of that, which page mentions that?

Yeah the page you linked even shows the benchmarks in coding for this model, so I'd be curious where that claim comes from