Y
Hacker News
new
|
ask
|
show
|
jobs
by
codecheers
49 days ago
With-skill vs without-skill evals are useful, but what about comparing skills against each other? Is there an emerging standard for saying one Skill is better than another, beyond custom pass/fail evals?