Hacker News new | ask | show | jobs
by jszymborski 5 days ago
I think the Qwen 0.6B is so cool. It is super fast and as illustrated here it has a clear niche, esp. when fine-tuned.

I'm also interested in it as a student for distillation.