Y
Hacker News
new
|
ask
|
show
|
jobs
by
kovezd
273 days ago
By the composition of evals. Plus secondary metrics like parameter size, and token cost.
Not perfect, but useful.