Hacker News new | ask | show | jobs
by Me1000 558 days ago
The 32B parameter model size seems like the sweet spot right now, imho. It's large enough to be very useful (Qwen 2.5 32B and the Coder variant our outstanding models), and they run on consumer hardware much more easily than the 70B models.

I hope Llama 4 reintroduces that mid sized model size.

1 comments

qwen2.5 looks like magic compared to llama3.2.