| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by impossiblefork 593 days ago
	It's a much smaller model though. I think the point is more the demonstration that such a small model can have such good performance than any actual usefulness.

1 comments

Gemma2 9B has significantly better prompt adherence than Llama 3.1 8B in my experience.

I've just assumed it's down to how it was trained, but no expert.