Hacker News new | ask | show | jobs
by a13n 14 days ago
Both OpenAI and Anthropic have billing caps… who doesn’t?
7 comments

Huh, so they do.

Anthropic: https://support.claude.com/en/articles/8977456-how-do-i-pay-... - you can pre-pay and get a hard cutoff.

OpenAI: https://community.openai.com/t/how-to-set-billing-limits-and... - last time I looked OpenAI had a soft but not hard limit, I guess they fixed that last year.

I remember bugging them both about this last year, I need to update my mental model!

I tried Alibaba Cloud. They have no caps. This was the reason to cancel my account there.

Deepseek has a prepaid model. (Pretty impressive, what fits into 10 Dollar)

Literally every credit card I own allows me to make a virtual card that is either single use or has a cap.
Does not matter, you still owe what you used for a service
Like which one? Most I know don’t have this feature
My business card is a Cap one spark business 2%.. get 2% cash on everything which is nice.
Who doesn't have hard billing caps for inference? Microsoft, Google and AWS my friend. And you know who uses Microsoft, Google and AWS? Almost all big corporations do use them instead of direct OAI or Anthropic API because all their contracts and infra are built around the big cloud providers.
There is a scheme to send gifts with a compromised anthropic key even if the limit is reached.
Based on experience, Google Cloud. No idea if that translates to Gemini usage billing.
Gemini added prepaid billing and spending caps a few weeks ago: https://twitter.com/OfficialLoganK/status/204451626215244231...
I cancelled my whole GCP account a month ago because I was too afraid of getting charged hundreds of thousands overnight like all peoples on Reddit
Google Vertex
> Long-running tasks like batch mode completions and agent sessions may incur overages beyond your project spend cap.

> Billing data processing times can be delayed in AI Studio, up to around 10 minutes. You may experience overages beyond your project cap if billing data hasn't processed before more charges are accrued.

https://ai.google.dev/gemini-api/docs/billing#project-spend-...

That's a soft cap, not a hard cap

I spent two hours the other day trying to figure out how to manage spend on gcp, i gave up and used openrouter and cloudflare.
AI studio added it recently, Vertex not.
Microsoft