Y
Hacker News
new
|
ask
|
show
|
jobs
by
rafaquintanilha
498 days ago
You know you are running an extremely nerfed version of the model, right?
1 comments
leovander
498 days ago
I did update my comment, but said that I am using the distilled version, so yes?
link
cbg0
498 days ago
Even the full model scores below Claude on livebench so a distilled version will likely be even worse.
link
rsanek
498 days ago
Based on the leaderboard R1 is significantly better than Claude?
https://livebench.ai/#/
link
cbg0
498 days ago
Not at coding.
link