Hacker News new | ask | show | jobs
by leovander 499 days ago
I did update my comment, but said that I am using the distilled version, so yes?
1 comments

Even the full model scores below Claude on livebench so a distilled version will likely be even worse.
Based on the leaderboard R1 is significantly better than Claude? https://livebench.ai/#/
Not at coding.