| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nunodonato 64 days ago
	we have a big dependency on AI, both for developers (can survive without it, mostly habits) and internal workflows (very hard to go without it). So we decided to unplug from cloud AI, rent our own GPU and use an open model for both scenarios. We have been very happy with it so far, 60% cheaper and around 50% faster

2 comments

scottyah 64 days ago

Faster in what way? All the open models we have access to at work are very noticeably behind the frontier models to the point where it's usually faster to not use them at all.

link

shimman 63 days ago

Faster in which you probably don't have to make so many network requests.

link

nunodonato 62 days ago

No, its way way faster than Claude

link

htrp 64 days ago

why not an inbetween scenario like using a managed inference provider to host your own models?

link

nunodonato 62 days ago

what would be the advantage?

link