Hacker News new | ask | show | jobs
by impossiblefork 546 days ago
It's a much smaller model though.

I think the point is more the demonstration that such a small model can have such good performance than any actual usefulness.

1 comments

Gemma2 9B has significantly better prompt adherence than Llama 3.1 8B in my experience.

I've just assumed it's down to how it was trained, but no expert.