| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by wonderwhyer 124 days ago

The debate around local vs API vs subscriptions feels mostly anecdotal. I tried building a tool that compares them using “quality-adjusted tokens per dollar.”

The idea:

Tokens per dollar

Weighted input/output pricing (75/25 assumption)

Benchmark-normalized quality (Arena, Aider, SWE-bench)

Early results surprised me (local often loses economically unless privacy is heavily valued).

I’m mostly looking for critique of the methodology:

Is quality-adjusted tokens per dollar even the right metric?

Is normalizing ELO to % defensible?

What benchmarks am I missing?