Hacker News new | ask | show | jobs
by codecheers 49 days ago
With-skill vs without-skill evals are useful, but what about comparing skills against each other? Is there an emerging standard for saying one Skill is better than another, beyond custom pass/fail evals?