Hacker News new | ask | show | jobs
by iamnotagenius 506 days ago
would be great to have dynamic quants of V3-non-R1 version, as for some tasks it is good enough. Also would be very interesting to see degradation with dynamic quants on small/medium size MoEs, such as older Deepseek models, Mixtrals, IBM tiny Granite MoE. Would be fun if Granite 1b MoE will still be functioning at 1.58bit.
1 comments

Oh yes multiple people have asked me about this - I'll see what I can do :)