Hacker News new | ask | show | jobs
by jillesvangurp 28 days ago
It gets better; most of their incoming requests don't actually require a frontier model to handle. There's a huge potential for future optimization in this space. Anthropic, OpenAI, Google and a few other companies are going to be well positioned to scale in the few years. A 65$ billion round to finance operations over the next few years isn't that controversial if you look at the growth and profit potential.

I think token counts and GW are a gross over simplification here. Not all tokens are the same in the amount of GPU time they consume or the size of the GPUs they require or the amount of energy they consume. There's a huge optimization potential here once these companies get serious about consolidating the business they have and executing much more efficiently. Given enough time, these companies can heavily optimize their operations. Short term growth and not slamming the brakes on that is their primary concern.

1 comments

Where's the moat though? What prevents a race to the bottom with competing AI providers, everyone trying to undercut one another?
I'm also thinking the same.

I have been trying Claude Code with DeepSeek 4 apis, and the experience is barely different. In fact the margin of error is so small that harness and prompting account for the most impact in output quality.

But, here's the catch: I spend barely more than a handful of dollars per day of regular usage. In fact DS4 via api is cheaper than Claude 100$ subscription.

I really think that very soon many will start realizing that the alternatives are extremely close in performance but dramatically different in pricing.

Claude includes or at least promises ZDR in some situations, whereas DeepSeek is explicitly using output to train models. The subsidising might be done with your data.