Hacker News new | ask | show | jobs
by IanCal 1161 days ago
Quality is one aspect, running them is another. If I've already got everything setup with them and they work efficiently, they could also offer open source models and let me pay for usage. Both bursty usage and low constant usage benefit from paying per token and having some shared & large infrastructure to use. I don't want to be running a bunch of h100s, I just want my requests processed.

If they're selling gpt-5 and let me pay for LLaMa or whatever is also out then I'll just use them unless pricing is wildly different.

1 comments

> I don't want to be running a bunch of h100s, I just want my requests processed.

Assuming you don't have sensitive data and that you never try anything outside the rules.

Those are hosting issues really and shouldn't be an issue for most companies.

I'd be absolutely shocked if they don't launch a version where it's run on more secure setups, particularly as they've got huge Microsoft backing.

You can run this in azure which I feel solves basically all of the hosting issues.