Hacker News new | ask | show | jobs
by Salgat 41 days ago
Local models are much less energy efficient right?
1 comments

It's a good question, although I think hard to quantify.

If you are simply measuring Watt Cost per Token, you are missing the mark drastically. You have to measure quality output per Watt.

It sounds reasonably difficult to benchmark this, maybe I'm wrong though.