|
|
|
|
|
by zozbot234
26 days ago
|
|
They just have to incrementally raise the price of inference tokens and limit subscriptions to curtail existing demand (with much of it likely moving to slower and cheaper local models). Which, come to think of it, is exactly what seems to be happening right now. > So they don't have investor money to burn (and when they do, they immediately burn it on new datacenters, which usually take years to build and aren't a certainty). If AI models can get smarter and more practically useful via some combination of increased scale and more fine-tuned post-training on specific workloads (which is compute-heavy, even more than the usual kind of pre-training) these new datacenters are a fantastic investment. |
|
They can get more efficient, but inference efficiency doesn't map linearly to cost efficiency. Firstly because software is a gas; if you give people more compute (for the same price), they immediately use it all up. But second, if you spend $50BN, you still have to make $50BN to break even. They could make inference cost $0.00000001, but that isn't going to cover their costs. That's what's driving their cost right now - they're trying to collect enough cash from people at the table to pay the bill, without the price scaring everyone out of the restaurant.
So they can't raise the price without scaring people off, and they can't lower the price and pay the bill.