Hacker News new | ask | show | jobs
by 6bb32646d83d 740 days ago
GPU companies providing inference on open source models (like deepinfra or togetherAI) are doing so at an extremely competitive cost, making me think that the API pricing of the big players right now is profitable.

(for example, deepinfra has wizardLM-2-8x22B at $0.65/1M output tokens, compared to $6/1M output tokens for 8x22B by Mistral - and of course Mistral has some more expensive, closed source models that perform better)