|
|
|
|
|
by numlocked
326 days ago
|
|
(I work at OpenRouter) We have solved this problem by working with the providers to implement a prices and models API that we scrape, which is how we keep our marketplace up to date. It's been a journey; a year ago it was all happening through conversations in shared Slack channels! The pricing landscape has become more complex as providers have introduced e.g. different prices for tokens depending on prompt length, caching, etc. I do believe the right lens on this is actually the price per token by endpoint, not by model; there are fast/slow versions, thinking/non-thinking, etc. that can sometimes also vary by price. The point of this comment is not to self promote, but we have put a huge amount of work into figuring all of this out, and have it all publicly available on OpenRouter (admittedly not in such a compact, pricing-focused format though!) |
|
https://github.com/tekacs/llm-pricing