|
|
|
|
|
by msgodel
353 days ago
|
|
Wow. Close to a Qwen3 distill with 75% the size. That's great! I've been using the smollm base models for my own finetunes just because they're so high quality, it looks like I might be using them to drive local agents/code completion in the near future too. Their RL algorithm looks interesting. I'm still using OpenAI's algorithm for my stuff, I've been meaning to check on the SoTA since I know my code is pretty outdated (It's crazy how fast that happens with this stuff.) |
|