Y
Hacker News
new
|
ask
|
show
|
jobs
by
wild_egg
340 days ago
I think the issue there is those smaller versions of those models. I regularly use Gemma3 and Qwen3 for programming without issue but in the 27b-32b range. Going smaller than that generally yields garbage.
1 comments
cess11
339 days ago
I've tried 24-32b sizes as well and besides being even slower they were also unreliable.
link