| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by anana_ 102 days ago
	I've had even better results using the dense 27B model -- less looping and churning on problems

1 comments

Which dense model are you referring to? The dense model isn’t finetuned for code instruction according to the model card.

I wasn't aware of that, which page mentions that?

Yeah the page you linked even shows the benchmarks in coding for this model, so I'd be curious where that claim comes from