|
|
|
|
|
by ACCount37
82 days ago
|
|
Check the token prices for open weight LLMs at various independent inference providers. That gives you a very good estimate of "how much can you serve the tokens of a model of the size N for while making a profit". Now, keep in mind: Kimi K2.5 is 1T MoE. Today's frontier LLMs are in the 1T to 5T range, also MoE. Make an estimate. Compare that estimate with the actual frontier lab prices. |
|
In the current volatile environment, the API prices are more of a baseline where we can assume it can't be much cheaper to operate these models.