Hacker News new | ask | show | jobs
by iLoveOncall 33 days ago
Why would smaller and worse models not be 80% cheaper?

If I can run those models on my consumer hardware, I'd better believe they are 80% cheaper than the models that need 1 TB of RAM.

2 comments

For the very hardest reasoning tasks GPT-5 and Opus are still ahead, no argument there. But what we see in practice is customers dropping in an open-source model and getting very similar results on 80-90% of real-world use cases — with significant cost savings and end-to-end UK data residency (which matters a lot for our enterprise and institutional customers). And on the consumer-hardware point: these are Blackwell GPUs in a UK datacentre, in a token factory architecture.
This is very disingenuous, I've been deploying local models to enterprise across a variety of use cases and the optimisation overhead and prompt engineering required to get good performance is huge. Let alone comparative perf to frontier models.
Comment from poster says they are offering Deepseek v4-Pro. Cannot find any details on website.
click "Models and Pricing" in the left menu https://relax.ai/docs/getting-started/pricing
Its written

Input Price: £1.17 Output Price: £2.33

So, slightly cheaper than Fireworks AI