Hacker News new | ask | show | jobs
by nunodonato 64 days ago
we have a big dependency on AI, both for developers (can survive without it, mostly habits) and internal workflows (very hard to go without it). So we decided to unplug from cloud AI, rent our own GPU and use an open model for both scenarios. We have been very happy with it so far, 60% cheaper and around 50% faster
2 comments

Faster in what way? All the open models we have access to at work are very noticeably behind the frontier models to the point where it's usually faster to not use them at all.
Faster in which you probably don't have to make so many network requests.
No, its way way faster than Claude
why not an inbetween scenario like using a managed inference provider to host your own models?
what would be the advantage?