Y
Hacker News
new
|
ask
|
show
|
jobs
by
leovander
499 days ago
I did update my comment, but said that I am using the distilled version, so yes?
1 comments
cbg0
499 days ago
Even the full model scores below Claude on livebench so a distilled version will likely be even worse.
link
rsanek
499 days ago
Based on the leaderboard R1 is significantly better than Claude?
https://livebench.ai/#/
link
cbg0
499 days ago
Not at coding.
link