Hacker News new | ask | show | jobs
by kovezd 273 days ago
By the composition of evals. Plus secondary metrics like parameter size, and token cost.

Not perfect, but useful.