Hacker News new | ask | show | jobs
by wild_egg 340 days ago
I think the issue there is those smaller versions of those models. I regularly use Gemma3 and Qwen3 for programming without issue but in the 27b-32b range. Going smaller than that generally yields garbage.
1 comments

I've tried 24-32b sizes as well and besides being even slower they were also unreliable.