|
|
|
|
|
by skysniper
77 days ago
|
|
both are shown in battle detail page already. Time is shown in Scores table. Number of tokens are shown in Cost details at the bottom of the Scores. (I thought most people just want to see cost in USD so I put token details at the bottom) |
|
https://i.imgur.com/wFVSpS5.png
and quality vs cost
https://i.imgur.com/fqM4edw.png
But I just noticed that my plot is meaningless because it conflates model quality with provider uptime.
Claude Haiku has a higher average quality than Claude Opus, which does not make sense. The explanation is that network errors were credited with a quality score of 0, and there were _a lot_ of network errors.