Y
Hacker News
new
|
ask
|
show
|
jobs
by
slewis
502 days ago
It would be really useful to see these evaluated across some of the same evals that the original R1 and deepseek's distills were evaluated on.