Hacker News new | ask | show | jobs
by adjejmxbdjdn 5 days ago
Compute is also a rapidly depreciating asset.

I want to make a comparison with a car rental business and say that it would be like valuing Hertz entirely on the basis of the number of cars they own, as opposed to how many they rent out, but cars have a much longer depreciation period, if there are no customers they’re not costing you more money, unlike your computer which you are using for training and sucking up massive amounts of energy, and those cars do maintain decent value even after they’re of little use to the car rental company, unlike the compute here.

9 comments

> Compute is also a rapidly depreciating asset.

That's the default assumption but in the new GPU+Memory constrained age isn't true.

Time on 4 year old H100 servers costs more now than when they were new (!!)

> That's the default assumption but in the new GPU+Memory constrained age isn't true.

Is it an age or a temporary situation?

Memory is unlikely to drop in price before mid-2027 when new capacity starts to come online.

The GPU shortage looks to be even longer lived.

So, temporary situation then. That's a pretty short period with no paradigm shift, just a delay in capacity.
Temporary until its not.

It's the new normal, get used to it.

The MAG7 isn't pumping all their FCF + new debt issuance into DC's just for fun.

The world is seemingly moving into a era where compute is becoming expensive and scarce.

Only thing that can possibly change this is LLMs hitting a vertical unscalable wall.

More AI compute = more CPU, memory, storage needs.

Do you think we will recognize any walls? Or is there a point where the output might look different with respect to different paradigms / modalities we throw at it, but we won't be able to understand the quantitative differences as good/bad/scalable?
It’s gonna take a lot longer than mid 2027. 2029 earliest IMO. Hyperscaler spend is basically already spoken for the next 2 years.
Everything is a temporary situation on long enough timeframes, especially if it’s exponentially growing. Moore’s law which dictates that compute depreciates quickly has been slowing down a lot in the last few years, coupled with the explosion in demand we’ve found ourselves in a prolonged shortage situation. The bubble will pop, but if you predict when correctly, you will be a rich man.
It's very unclear to me.

The key question is on direction of LLMs. Right now, LLMs are taking over human jobs. If the cost of silicon+power < cost of human being doing the same work, what rational reason is there to employ a human being?

If this applies to SWEs, lawyers, business analysts, many research scientists, .... this situation could persist for a long, long time. While capital costs less than the inputs of labor (nominal food, housing, etc.), there is no need for labor.

The key question is about continued progress in models, and of the tooling around them:

- Plateau: Old silicon obsoletes in due course

- Rise quickly: Old silicon maintains value for a long time

What I don't understand is if nobody has jobs, who's paying the machines to do anything?

So okay cool you don't need people to design and build cars. Who's going to buy the cars and where exactly are they finding money?

But see also the "radiologists driving to work" meme for why I think tech in general is currently getting high off their own farts.

Rich people become the only consumers.
Yes, the plan seems to be anti human in the extreme. Why do you need the plebs if they can be entirely replaced by AI? But the question then becomes why does the AI (and before that their security detail in a post money world) need billionaires?
I think the Amish will mostly be fine. Maybe that's how the future looks like.
Long term, or short term?

Short term, money physically exists and gets spent, so if you wave a magic want of oversimplification and transition all labour to AI instantly, all the money currently in bank accounts and wallets gets spend on the same businesses it was already getting spent on, a lot of which gets spent on stuff from other businesses who have in this scenario also replaced all their labour with AI.

Eventually, perhaps quickly, all this money ends up in the hands of shareholders and landlords. There's a lot of both in the economy; famously retirement funds, but smaller-scale shareholders and landlords also exist. I wouldn't want to guess what the distribution looks like, probably highly variable between countries not just social classes (the definitions of which themselves can vary between countries).

Long term, money exists as a convenient fiction to help us organise transactions of goods and services: while it may be physically possible to eat gold and banknotes, you're not getting any real nutrients out of it when you do. So in a world where goods and services come from machines, the options are too broad to forecast: humanity could be relegated to the same role and economic stature as other primates (both in and out of zoos), or we could get universal UBI denominated in machine labour credits which lets each of us live better lives than the most extravagant billionaires live today.

I don't know. It just seems odd because money was used as an abstraction of labor and if labor disappears it seems like money has no fundamental value. If you can't pay people to do something (because machines are doing all the labor). Then people have no money and money has no value to people. Industrialization resulted in transition to service-based economy but this new wave of machines are being said to replace service work.

I'm just trying to understand if suppose you have fully robotic farms and fully automated slaughterhouses and fully automated McDonald's, who is McDonald's selling anything to and how do these people supposedly buying fully-mechanized burgers have jobs? Something just doesn't add up about this in my head about how this equation balances.

UBI ultimately seems like socialism with extra steps. Mostly is comes across as billionaires desperately begging for an alternative to being nationalized.

The overwhelming majority of the labor force remains service, manual labor, and other such stuff that LLMs will have no real effect on. So the economy will be fine, but I do agree with you from a different angle. The entire goal of LLMs seems self destructive. If they're successful then the endgame is completely removing the barriers to entry to producing software and other digital tech. But if we do reach that endgame then the value of tech is going to plummet because there will be absolutely no barriers to entry to compete, or even just individuals homebrewing up what they need on demand.

Like imagine there was something you could buy where you insert some lumber, give it some passable description of furniture, and it outputs it. And you paid $20/month for access to this. And this was all being bankrolled by the furniture industry? I mean, sure guys - it's much appreciated, but I don't think I've ever seen anybody so enthusiastic about digging their own grave. I think it's already obvious that the gazillion dollars of API calls isn't going to materialize - it seems the handful of companies that trialed that are already reversing course hard. And in the future where LLMs are successful, that'd be even more true.

Llms either reach the point where they can quickly design and build physical robots to take on that service industry or they stop exponential growth.

Both of those are devastating for their valuation. Stopping growth means open modes catch up in a year or so. Continuing means end of the current economy.

> what rational reason is there to employ a human being?

To maintain a functioning society and social contract?

Is wanting low unemployment in our society not rational?

It's ethnically rational, and morally right.

However.

It's not rational relative to the short-term incentives of a typical corporation or investment vehicle. PE, VC, fund managers aren't paid to give a fuck about the social contract. Literally not in their job description.

> Is wanting low unemployment in our society not rational?

Only conditionally on there being bad consequences for high unemployment.

I don't particularly trust politicians, but there's a whole host of hypothetical scenarios about futures where work is essentially optional. Unfortunately, they're all either in the sci-fi or religion sections of the book store:

Despite people occasionally investigating UBI, the efforts to research UBI seriously have the same problems that Marx had with literal Communism, in that there's an obvious difference between any partial transition as compared to a global transition, and we don't have a completely disconnected parallel world to be a petri dish for us to test the economic outcomes on.

Correct. Unfortunately, that's not how capitalism makes decisions.
Capitalism does not decide anything. Capitalism allows individuals to take decisions in a free market.

If you want to complain about selfishness then do it on selfish individuals, which by the way, are present in all types of economic systems.

Are current datacenter deployments structured in such a way that the memory can later be moved to newer GPU dies? Or is it all packaged together as on consumer graphics cards?

I assumed the latter and therefore that the memory is depreciating along with the GPU cores it's soldered onto PCBs with.

... or is it a different argument being made, perhaps that depreciation for GPUs has slowed because rising demand will keep them in service longer?

The argument is that all GPUs are currently appreciating (!!)

Google is still running 10 year old Tesla T4s at full capacity.

This is way beyond the expected lifetime.

Removing RAM chips off old cards is uneconomical, until it isn't. With things going the way they are, if you've got a card with soldered on RAM that could be transplanted to a newer card, I think you'll start seeing that happening.
It has already become economic. While not exactly the same, the NVIDIA 2080 11GB cards are notorious for being upcycled with extra RAM: https://www.reddit.com/r/nvidia/comments/146us12/nvidia_gefo...
Chinese recyclers already do this with laptops
> Time on 4 year old H100 servers costs more now than when they were new (!!)

There are several confounding factors.

We’ve seen massive inflation since then. So some growth in cost was expected.

More importantly, the current Tech industry almost always starts by selling things at a loss. The increased cost could simply be the industry choosing to not subsidize that particular service anymore.

But also, I don’t think that’s a realistic comparison. Rented out GPUs are likely not a similar use profile as compute used for training LLMs. The latter is likely closer to the cryptocurrency GPUs that are running at full tilt 24/7.

And those things physically burn out.

> Rented out GPUs are likely not a similar use profile as compute used for training LLMs. The latter is likely closer to the cryptocurrency GPUs that are running at full tilt 24/7.

This is untrue.

H100's are used for training (well were, but are now outdated because B100/B200s are much faster).

Most of the reason people rent H100s is for smaller training runs.

If you are doing inference you usually buy managed capacity at Baseten or something, and that is often priced differently (although it comes down to an extra margin on longer term H100 prices basically).

Inference utilization is often actually higher than training now because so much effort has been spent on optimizing that stack.

I also feel that the GPU/NPU value does not lose money as fast anymore.

What I am wondering though is how long can you run such a system at basically full load without interruption before it starts to just physically degrade.

If I have a H100 and I let it run for 4 years at full throttle does it still have the same theoretical value as it had at the start or are the chips just burning out.

I think I remember that back when the cards used for crypto mining were sold en masse on ebay the advice was to stay away from them because they are more likely to fail?

Quite the opposite, GPUs running at a stable rate degrade less than GPU that continuously hit highs and lows (like it would happen on a gaming rig).
Normal use means loading data into the GPU for each batch. The load is not even, though training might be worse than "production".
After digging around a bit I found an unverified claim from 2024 that GPUs in datacenters have a lifespan of 1-3 years

https://www.tomshardware.com/pc-components/gpus/datacenter-g...

Others say that moderate load means a lifespan of ~5 years

Not sure what that means but I would assume that a datacenter will start replacing a node once the error rate hits a certain threshold without really investigating why it failed, so the practical lifespan may be shorter than 5 years even if it would technically still be usable enough

https://en.wikipedia.org/wiki/Electromigration

Temperature is a big factor, as well as current density.

But there's also the # and magnitude of thermal cycles (which translate into mechanical stress, leading to metal-fatigue like effects on contact points etc), attack from chemicals in the air, cosmic radiation, ESD damage & more. Some may matter, some not.

That's why "new" > "used" in case of electronics. Especially since you don't know the (ab)use history of used parts.

> I also feel that the GPU/NPU value does not lose money as fast anymore.

That's because the rate of improvement in silicon manufacturing has been continually declining for a few decades, which has a compounding effect. Just compare the technological improvements in successive decades. 1976->1986->1996->2006->2016->2026.

That's why "in real terms" performance has only been very slowly improving if you compare apples to apples (and not e.g. apples to oranges by reducing precision, like nvidia tends to do, or by comparing chips with x W to an MCM with x*2 W and saying the latter is much faster). The "just halve the number of bits in each generation" strategy has also run out now, there's no more bits to halve.

Depreciating doesn't just mean it could depreciate in value relative to the performance of newer GPUs, but also that its lifespan is limited by reliability issues and failures.
That's just inflation (yeah, the global one) and demand at play.

Let's not mix up depreciation of real value vs USD price (which is arbitrary, plus government controlled)

it’s more like if you were to value Hertz as if they were a self-driving car company, only to find they’re a car rental company
Car rentals are a great comparison, but not for the reason you think. Cars depreciate value similarly to GPUs. The depriciation lifecycle timeframe is actually similar between hyperscaler GPUs and mainstream corporate car rental companies ike Hertz. They sell their cars after 2-3 years or 20-40k miles. There is a huge market for used cars. Hertz runs their used car sales out of their rental retail offices and a lot of overhead is shared. So take the difference in cost to buy new in bulk from the manufacturer from the retail sales price for a 2-3 year old car. As long as Hertz can make more money renting it out in that time, that's revenue positive.

Same with GPUs. There is also a huge market for used GPUs from 1-2 generations ago. The A100 is a six year old chip at this point and is still running strong, especially for inference. Like cars, chips can be refurbished and repaired. A hyperscaler or even mid level player here isn't going to hold onto chips for their entire usable lifespan.

It is depreciating, but demand has been very high.

There's a reason old 3090's went from $600 in 2022 o to over $1K in 2026.

My local inference rig now costs three times what I bought it for. If I'd gotten the max ram I could at the time I would have made $10k after selling the excess to my current spec.

How someone can look at an asset class thats appreciated an order of magnitude in the last two years and say it will depreciate in value when the tailwinds are even stronger now is beyond me.

Yes, toilet paper and N95s were expensive and hard to buy once, which is why I stockpiled a lifetime supply of them. Suckers!
“Graph go up to the right. Graph stop at edge of paper. Must go up forever!”
Fundamentals dictate hardware is a depreciating asset, they're not wrong. They're just ignoring the reality of the current market.
This was true when Moores law wasn't dead. Per watts performance has been flat since Ampere. There is a reason why undervolted 3090s are still used.
GPUs do have a life expectancy. They don’t run forever, especially at high temperatures and full utilization.
You undervolt them because the last 50% of power adss 10% of compute.
Performance goes way up if you use liquid nitrogen to cool the chips. Maybe finally someone's willing to pay for that.
I have been hearing that memory suppliers are _intentionally_ not scaling up new factories like crazy because they assume the demand won't be there on the long term and they don't want to have spare unused capacity. Probably because Samsung and SK have a near duopoly on it as well...

At some point the market will be saturated with supply and prices will come down for older gen hardware. It can take years though, but it happened to fiber cable and fiber doesn't even depreciate like chips.

Will it continue to appreciate to infinity? Maintain its value forever? Or will something else happen?

The same argument you’ve made would work for tulip bulbs, dotcom prices, or whatever. Prices go up until they don’t. Exponentials don’t last forever and the intrinsics of technology assets depreciate: things wear out and are also replaced with better things.

everything* is 3x more expensive in the same amount of time though. that's inflation mostly.

* except ram

> if there are no customers they’re not costing you more money, unlike your computer which you are using for training

So are you using the computers or not? I'd argue that if you're using them for training, then it's not wasted capacity. And if you're not using them, then you can turn them off, so you're not sucking up energy.

Compute is also a rapidly depreciating asset.

I don’t know but this dude at my son’s school has a 32GB RTX 5090 and it’s worth more than what he paid for; and he did the same trick with the RTX 4090 before that.

Until shortages are the rule, these assets are appreciating

"depreciating" is not being used in the right sense.

There is depreciation, which is taking the purchase price and dividing it across N number of years (typically 5). That's the D in EBITDA and is mostly used as a profitability calculation.

The depreciation of a GPU also gets mucked up in the current GPU financed market as well. DDTL loans. The people running the GPUs often don't even own the GPU, they lease it, so there is nothing for them to depreciate (D).

The analogy that a GPU is like a used car makes zero sense. There is no oil or tires to change on a GPU. They don't wear out in the same way that a rental car would. They are housed in climate controlled locations with clean power. They just don't fail the way that is portrayed in the press.

Useful life of a GPU is based on profitability. When does opex cost more than profitability?

Some companies, like mine, also have support contracts. Anything goes wrong with the GPU (or any part of the system), Dell comes and fixes it at no extra charge. We just migrate customers and workloads to hot spares while the parts are replaced.

As for compute going down in value... the 122TB of enterprise nvme and 2GB of ram in each server that I bought 2 years ago is now worth vastly more than I paid for it. I'm also renting my GPUs out for more money now due to supply being so tight and demand being so high.

Compute is about to come an appreciating asset in the near-term, and it some ways it already is.

The frontier labs are shifting from pricing grounded in the price of compute, to pricing grounded in the intelligence provided, or more specifically the economic value of that intelligence downstream.

The margins on that allow them to pay a hefty premium on compute and still come out ahead.

As they buy more compute at high prices, they're also pricing out competition from cheaper models. It's already become materially more difficult to get compute to run open weight models at competitive prices as a result of frontier labs in the last year.

There is zero evidence of this shift in pricing occurring. It’s still a dream which seems unlikely
News to me?

Opus 4.7 has all the signs of a smaller model distilled from a newer pretraining run... except a smaller price.

Flash 3.5 raised in price pretty meaningfully over Flash 3

GPT 5.4 got a small price bump over gpt-5.3-Codex/gpt-5.2, then gpt-5.5 doubled pricing over gpt-5.4

Even open weights isn't immune: Kimi K2.6 was originally priced higher despite openly being 2.5 + more post-training, same with GLM 5.1 vs 5

-

All while rental prices are spiking month over month, and NVIDIA Inception discounted prices for buying are higher than undiscounted prices for buying 6 months ago...

It feels like this is the line people are using to justify the expense of compute capex
I run a consumer AI product and the current reality of trying to get compute vs what it was 6-12 months ago is enough to justify it to anyone who has the money.

I think OpenClaw created a mania that was completely unfounded (Apple Silicon is worth dirt compared to literally anything from NVIDIA including consumer GPUs), but the prediction of compute becoming scarce was correct

The fact that you can sell or lease out something for more than you bought it for is justification in and of itself.
Not necessarily. The GPU leases Spacex has made are month to month, so they are taking on all of the risk. If demand goes down, they're the ones stuck with the assets.
That's a very big "if," which this what thread has been about.
Dream? It is a nightmare that computers aren't getting significantly more efficient anymore.
In the short term, compute becomes an appreciating asset.

In the medium term, everyone ramps up production. Huawei and other Chinese companies work really hard to develop in-house alternatives. At some point, the hype cycle will peak and less money will flow into datacentres (yes, this will happen. It always does. Even for technologies that change society. The bubble always bursts).

The question is not if this will happen. It will happen. It's just a question of when it happens and how big the magnitude of the cycle is.

no need for a car analogy.

the comment you replied to is word-by-word what people hyping canadian telecoms were saying before the dotcom crash!