Hacker News new | ask | show | jobs
by rafaquintanilha 498 days ago
You know you are running an extremely nerfed version of the model, right?
1 comments

I did update my comment, but said that I am using the distilled version, so yes?
Even the full model scores below Claude on livebench so a distilled version will likely be even worse.
Based on the leaderboard R1 is significantly better than Claude? https://livebench.ai/#/
Not at coding.