| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by rafaquintanilha 544 days ago
	You know you are running an extremely nerfed version of the model, right?

1 comments

I did update my comment, but said that I am using the distilled version, so yes?

Even the full model scores below Claude on livebench so a distilled version will likely be even worse.

Based on the leaderboard R1 is significantly better than Claude? https://livebench.ai/#/

Not at coding.