| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by iLoveOncall 33 days ago
	Why would smaller and worse models not be 80% cheaper? If I can run those models on my consumer hardware, I'd better believe they are 80% cheaper than the models that need 1 TB of RAM.

2 comments

benjamintnorris 33 days ago

For the very hardest reasoning tasks GPT-5 and Opus are still ahead, no argument there. But what we see in practice is customers dropping in an open-source model and getting very similar results on 80-90% of real-world use cases — with significant cost savings and end-to-end UK data residency (which matters a lot for our enterprise and institutional customers). And on the consumer-hardware point: these are Blackwell GPUs in a UK datacentre, in a token factory architecture.

link

l_c_m 33 days ago

This is very disingenuous, I've been deploying local models to enterprise across a variety of use cases and the optimisation overhead and prompt engineering required to get good performance is huge. Let alone comparative perf to frontier models.

link

graemep 33 days ago

Comment from poster says they are offering Deepseek v4-Pro. Cannot find any details on website.

link

anentropic 33 days ago

click "Models and Pricing" in the left menu https://relax.ai/docs/getting-started/pricing

link

ttoinou 33 days ago

Its written

Input Price: £1.17 Output Price: £2.33

So, slightly cheaper than Fireworks AI

link