| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by adjejmxbdjdn 5 days ago
	Compute is also a rapidly depreciating asset. I want to make a comparison with a car rental business and say that it would be like valuing Hertz entirely on the basis of the number of cars they own, as opposed to how many they rent out, but cars have a much longer depreciation period, if there are no customers they’re not costing you more money, unlike your computer which you are using for training and sucking up massive amounts of energy, and those cars do maintain decent value even after they’re of little use to the car rental company, unlike the compute here.

9 comments

nl 5 days ago

> Compute is also a rapidly depreciating asset.

That's the default assumption but in the new GPU+Memory constrained age isn't true.

Time on 4 year old H100 servers costs more now than when they were new (!!)

link

RealityVoid 5 days ago

> That's the default assumption but in the new GPU+Memory constrained age isn't true.

Is it an age or a temporary situation?

link

nl 5 days ago

Memory is unlikely to drop in price before mid-2027 when new capacity starts to come online.

The GPU shortage looks to be even longer lived.

link

williamdclt 5 days ago

So, temporary situation then. That's a pretty short period with no paradigm shift, just a delay in capacity.

link

Seanambers 4 days ago

Temporary until its not.

It's the new normal, get used to it.

The MAG7 isn't pumping all their FCF + new debt issuance into DC's just for fun.

The world is seemingly moving into a era where compute is becoming expensive and scarce.

Only thing that can possibly change this is LLMs hitting a vertical unscalable wall.

More AI compute = more CPU, memory, storage needs.

link

_1100 4 days ago

Do you think we will recognize any walls? Or is there a point where the output might look different with respect to different paradigms / modalities we throw at it, but we won't be able to understand the quantitative differences as good/bad/scalable?

link

__blockcipher__ 5 days ago

It’s gonna take a lot longer than mid 2027. 2029 earliest IMO. Hyperscaler spend is basically already spoken for the next 2 years.

link

baq 5 days ago

Everything is a temporary situation on long enough timeframes, especially if it’s exponentially growing. Moore’s law which dictates that compute depreciates quickly has been slowing down a lot in the last few years, coupled with the explosion in demand we’ve found ourselves in a prolonged shortage situation. The bubble will pop, but if you predict when correctly, you will be a rich man.

link

frognumber 5 days ago

It's very unclear to me.

The key question is on direction of LLMs. Right now, LLMs are taking over human jobs. If the cost of silicon+power < cost of human being doing the same work, what rational reason is there to employ a human being?

If this applies to SWEs, lawyers, business analysts, many research scientists, .... this situation could persist for a long, long time. While capital costs less than the inputs of labor (nominal food, housing, etc.), there is no need for labor.

The key question is about continued progress in models, and of the tooling around them:

- Plateau: Old silicon obsoletes in due course

- Rise quickly: Old silicon maintains value for a long time

link

fluidcruft 5 days ago

What I don't understand is if nobody has jobs, who's paying the machines to do anything?

So okay cool you don't need people to design and build cars. Who's going to buy the cars and where exactly are they finding money?

But see also the "radiologists driving to work" meme for why I think tech in general is currently getting high off their own farts.

link

saulpw 5 days ago

Rich people become the only consumers.

link

andy_ppp 5 days ago

Yes, the plan seems to be anti human in the extreme. Why do you need the plebs if they can be entirely replaced by AI? But the question then becomes why does the AI (and before that their security detail in a post money world) need billionaires?

link

laichzeit0 5 days ago

I think the Amish will mostly be fine. Maybe that's how the future looks like.

link

ben_w 5 days ago

Long term, or short term?

Short term, money physically exists and gets spent, so if you wave a magic want of oversimplification and transition all labour to AI instantly, all the money currently in bank accounts and wallets gets spend on the same businesses it was already getting spent on, a lot of which gets spent on stuff from other businesses who have in this scenario also replaced all their labour with AI.

Eventually, perhaps quickly, all this money ends up in the hands of shareholders and landlords. There's a lot of both in the economy; famously retirement funds, but smaller-scale shareholders and landlords also exist. I wouldn't want to guess what the distribution looks like, probably highly variable between countries not just social classes (the definitions of which themselves can vary between countries).

Long term, money exists as a convenient fiction to help us organise transactions of goods and services: while it may be physically possible to eat gold and banknotes, you're not getting any real nutrients out of it when you do. So in a world where goods and services come from machines, the options are too broad to forecast: humanity could be relegated to the same role and economic stature as other primates (both in and out of zoos), or we could get universal UBI denominated in machine labour credits which lets each of us live better lives than the most extravagant billionaires live today.

link

fluidcruft 5 days ago

I don't know. It just seems odd because money was used as an abstraction of labor and if labor disappears it seems like money has no fundamental value. If you can't pay people to do something (because machines are doing all the labor). Then people have no money and money has no value to people. Industrialization resulted in transition to service-based economy but this new wave of machines are being said to replace service work.

I'm just trying to understand if suppose you have fully robotic farms and fully automated slaughterhouses and fully automated McDonald's, who is McDonald's selling anything to and how do these people supposedly buying fully-mechanized burgers have jobs? Something just doesn't add up about this in my head about how this equation balances.

UBI ultimately seems like socialism with extra steps. Mostly is comes across as billionaires desperately begging for an alternative to being nationalized.

link

somenameforme 5 days ago

The overwhelming majority of the labor force remains service, manual labor, and other such stuff that LLMs will have no real effect on. So the economy will be fine, but I do agree with you from a different angle. The entire goal of LLMs seems self destructive. If they're successful then the endgame is completely removing the barriers to entry to producing software and other digital tech. But if we do reach that endgame then the value of tech is going to plummet because there will be absolutely no barriers to entry to compete, or even just individuals homebrewing up what they need on demand.

Like imagine there was something you could buy where you insert some lumber, give it some passable description of furniture, and it outputs it. And you paid $20/month for access to this. And this was all being bankrolled by the furniture industry? I mean, sure guys - it's much appreciated, but I don't think I've ever seen anybody so enthusiastic about digging their own grave. I think it's already obvious that the gazillion dollars of API calls isn't going to materialize - it seems the handful of companies that trialed that are already reversing course hard. And in the future where LLMs are successful, that'd be even more true.

link

hdgvhicv 5 days ago

Llms either reach the point where they can quickly design and build physical robots to take on that service industry or they stop exponential growth.

Both of those are devastating for their valuation. Stopping growth means open modes catch up in a year or so. Continuing means end of the current economy.

link

paulhebert 5 days ago

> what rational reason is there to employ a human being?

To maintain a functioning society and social contract?

Is wanting low unemployment in our society not rational?

link

Earw0rm 5 days ago

It's ethnically rational, and morally right.

However.

It's not rational relative to the short-term incentives of a typical corporation or investment vehicle. PE, VC, fund managers aren't paid to give a fuck about the social contract. Literally not in their job description.

link

ben_w 5 days ago

> Is wanting low unemployment in our society not rational?

Only conditionally on there being bad consequences for high unemployment.

I don't particularly trust politicians, but there's a whole host of hypothetical scenarios about futures where work is essentially optional. Unfortunately, they're all either in the sci-fi or religion sections of the book store:

Despite people occasionally investigating UBI, the efforts to research UBI seriously have the same problems that Marx had with literal Communism, in that there's an obvious difference between any partial transition as compared to a global transition, and we don't have a completely disconnected parallel world to be a petri dish for us to test the economic outcomes on.

link

hackyhacky 5 days ago

Correct. Unfortunately, that's not how capitalism makes decisions.

link

tirant 5 days ago

Capitalism does not decide anything. Capitalism allows individuals to take decisions in a free market.

If you want to complain about selfishness then do it on selfish individuals, which by the way, are present in all types of economic systems.

link

mikepurvis 5 days ago

Are current datacenter deployments structured in such a way that the memory can later be moved to newer GPU dies? Or is it all packaged together as on consumer graphics cards?

I assumed the latter and therefore that the memory is depreciating along with the GPU cores it's soldered onto PCBs with.

... or is it a different argument being made, perhaps that depreciation for GPUs has slowed because rising demand will keep them in service longer?

link

nl 5 days ago

The argument is that all GPUs are currently appreciating (!!)

Google is still running 10 year old Tesla T4s at full capacity.

This is way beyond the expected lifetime.

link

fragmede 5 days ago

Removing RAM chips off old cards is uneconomical, until it isn't. With things going the way they are, if you've got a card with soldered on RAM that could be transplanted to a newer card, I think you'll start seeing that happening.

link

fer 5 days ago

It has already become economic. While not exactly the same, the NVIDIA 2080 11GB cards are notorious for being upcycled with extra RAM: https://www.reddit.com/r/nvidia/comments/146us12/nvidia_gefo...

link

donaldjbiden 5 days ago

Chinese recyclers already do this with laptops

link

adjejmxbdjdn 5 days ago

> Time on 4 year old H100 servers costs more now than when they were new (!!)

There are several confounding factors.

We’ve seen massive inflation since then. So some growth in cost was expected.

More importantly, the current Tech industry almost always starts by selling things at a loss. The increased cost could simply be the industry choosing to not subsidize that particular service anymore.

But also, I don’t think that’s a realistic comparison. Rented out GPUs are likely not a similar use profile as compute used for training LLMs. The latter is likely closer to the cryptocurrency GPUs that are running at full tilt 24/7.

And those things physically burn out.

link

nl 5 days ago

> Rented out GPUs are likely not a similar use profile as compute used for training LLMs. The latter is likely closer to the cryptocurrency GPUs that are running at full tilt 24/7.

This is untrue.

H100's are used for training (well were, but are now outdated because B100/B200s are much faster).

Most of the reason people rent H100s is for smaller training runs.

If you are doing inference you usually buy managed capacity at Baseten or something, and that is often priced differently (although it comes down to an extra margin on longer term H100 prices basically).

Inference utilization is often actually higher than training now because so much effort has been spent on optimizing that stack.

link

blensor 5 days ago

I also feel that the GPU/NPU value does not lose money as fast anymore.

What I am wondering though is how long can you run such a system at basically full load without interruption before it starts to just physically degrade.

If I have a H100 and I let it run for 4 years at full throttle does it still have the same theoretical value as it had at the start or are the chips just burning out.

I think I remember that back when the cards used for crypto mining were sold en masse on ebay the advice was to stay away from them because they are more likely to fail?

link

SkiFire13 5 days ago

Quite the opposite, GPUs running at a stable rate degrade less than GPU that continuously hit highs and lows (like it would happen on a gaming rig).

link

yobbo 5 days ago

Normal use means loading data into the GPU for each batch. The load is not even, though training might be worse than "production".

link

blensor 5 days ago

After digging around a bit I found an unverified claim from 2024 that GPUs in datacenters have a lifespan of 1-3 years

https://www.tomshardware.com/pc-components/gpus/datacenter-g...

Others say that moderate load means a lifespan of ~5 years

Not sure what that means but I would assume that a datacenter will start replacing a node once the error rate hits a certain threshold without really investigating why it failed, so the practical lifespan may be shorter than 5 years even if it would technically still be usable enough

link

RetroTechie 5 days ago

https://en.wikipedia.org/wiki/Electromigration

Temperature is a big factor, as well as current density.

But there's also the # and magnitude of thermal cycles (which translate into mechanical stress, leading to metal-fatigue like effects on contact points etc), attack from chemicals in the air, cosmic radiation, ESD damage & more. Some may matter, some not.

That's why "new" > "used" in case of electronics. Especially since you don't know the (ab)use history of used parts.

link

formerly_proven 5 days ago

> I also feel that the GPU/NPU value does not lose money as fast anymore.

That's because the rate of improvement in silicon manufacturing has been continually declining for a few decades, which has a compounding effect. Just compare the technological improvements in successive decades. 1976->1986->1996->2006->2016->2026.

That's why "in real terms" performance has only been very slowly improving if you compare apples to apples (and not e.g. apples to oranges by reducing precision, like nvidia tends to do, or by comparing chips with x W to an MCM with x*2 W and saying the latter is much faster). The "just halve the number of bits in each generation" strategy has also run out now, there's no more bits to halve.

link

almog 5 days ago

Depreciating doesn't just mean it could depreciate in value relative to the performance of newer GPUs, but also that its lifespan is limited by reliability issues and failures.

link

code51 5 days ago

That's just inflation (yeah, the global one) and demand at play.

Let's not mix up depreciation of real value vs USD price (which is arbitrary, plus government controlled)

link

scionaura 5 days ago

it’s more like if you were to value Hertz as if they were a self-driving car company, only to find they’re a car rental company

link

jubilanti 5 days ago

Car rentals are a great comparison, but not for the reason you think. Cars depreciate value similarly to GPUs. The depriciation lifecycle timeframe is actually similar between hyperscaler GPUs and mainstream corporate car rental companies ike Hertz. They sell their cars after 2-3 years or 20-40k miles. There is a huge market for used cars. Hertz runs their used car sales out of their rental retail offices and a lot of overhead is shared. So take the difference in cost to buy new in bulk from the manufacturer from the retail sales price for a 2-3 year old car. As long as Hertz can make more money renting it out in that time, that's revenue positive.

Same with GPUs. There is also a huge market for used GPUs from 1-2 generations ago. The A100 is a six year old chip at this point and is still running strong, especially for inference. Like cars, chips can be refurbished and repaired. A hyperscaler or even mid level player here isn't going to hold onto chips for their entire usable lifespan.

link

shdh 5 days ago

It is depreciating, but demand has been very high.

There's a reason old 3090's went from $600 in 2022 o to over $1K in 2026.

link

noosphr 5 days ago

My local inference rig now costs three times what I bought it for. If I'd gotten the max ram I could at the time I would have made $10k after selling the excess to my current spec.

How someone can look at an asset class thats appreciated an order of magnitude in the last two years and say it will depreciate in value when the tailwinds are even stronger now is beyond me.

link

snypher 5 days ago

Yes, toilet paper and N95s were expensive and hard to buy once, which is why I stockpiled a lifetime supply of them. Suckers!

link

dpark 5 days ago

“Graph go up to the right. Graph stop at edge of paper. Must go up forever!”

link

shdh 5 days ago

Fundamentals dictate hardware is a depreciating asset, they're not wrong. They're just ignoring the reality of the current market.

link

noosphr 5 days ago

This was true when Moores law wasn't dead. Per watts performance has been flat since Ampere. There is a reason why undervolted 3090s are still used.

link

codechicago277 5 days ago

GPUs do have a life expectancy. They don’t run forever, especially at high temperatures and full utilization.

link

noosphr 5 days ago

You undervolt them because the last 50% of power adss 10% of compute.

link

fragmede 5 days ago

Performance goes way up if you use liquid nitrogen to cool the chips. Maybe finally someone's willing to pay for that.

link

DanielHB 5 days ago

I have been hearing that memory suppliers are _intentionally_ not scaling up new factories like crazy because they assume the demand won't be there on the long term and they don't want to have spare unused capacity. Probably because Samsung and SK have a near duopoly on it as well...

At some point the market will be saturated with supply and prices will come down for older gen hardware. It can take years though, but it happened to fiber cable and fiber doesn't even depreciate like chips.

link

mlyle 5 days ago

Will it continue to appreciate to infinity? Maintain its value forever? Or will something else happen?

The same argument you’ve made would work for tulip bulbs, dotcom prices, or whatever. Prices go up until they don’t. Exponentials don’t last forever and the intrinsics of technology assets depreciate: things wear out and are also replaced with better things.

link

iririririr 5 days ago

everything* is 3x more expensive in the same amount of time though. that's inflation mostly.

* except ram

link

rolisz 5 days ago

> if there are no customers they’re not costing you more money, unlike your computer which you are using for training

So are you using the computers or not? I'd argue that if you're using them for training, then it's not wasted capacity. And if you're not using them, then you can turn them off, so you're not sucking up energy.

link

eecc 5 days ago

Compute is also a rapidly depreciating asset.

I don’t know but this dude at my son’s school has a 32GB RTX 5090 and it’s worth more than what he paid for; and he did the same trick with the RTX 4090 before that.

Until shortages are the rule, these assets are appreciating

link

latchkey 5 days ago

"depreciating" is not being used in the right sense.

There is depreciation, which is taking the purchase price and dividing it across N number of years (typically 5). That's the D in EBITDA and is mostly used as a profitability calculation.

The depreciation of a GPU also gets mucked up in the current GPU financed market as well. DDTL loans. The people running the GPUs often don't even own the GPU, they lease it, so there is nothing for them to depreciate (D).

The analogy that a GPU is like a used car makes zero sense. There is no oil or tires to change on a GPU. They don't wear out in the same way that a rental car would. They are housed in climate controlled locations with clean power. They just don't fail the way that is portrayed in the press.

Useful life of a GPU is based on profitability. When does opex cost more than profitability?

Some companies, like mine, also have support contracts. Anything goes wrong with the GPU (or any part of the system), Dell comes and fixes it at no extra charge. We just migrate customers and workloads to hot spares while the parts are replaced.

As for compute going down in value... the 122TB of enterprise nvme and 2GB of ram in each server that I bought 2 years ago is now worth vastly more than I paid for it. I'm also renting my GPUs out for more money now due to supply being so tight and demand being so high.

link

BoorishBears 5 days ago

Compute is about to come an appreciating asset in the near-term, and it some ways it already is.

The frontier labs are shifting from pricing grounded in the price of compute, to pricing grounded in the intelligence provided, or more specifically the economic value of that intelligence downstream.

The margins on that allow them to pay a hefty premium on compute and still come out ahead.

As they buy more compute at high prices, they're also pricing out competition from cheaper models. It's already become materially more difficult to get compute to run open weight models at competitive prices as a result of frontier labs in the last year.

link

enos_feedler 5 days ago

There is zero evidence of this shift in pricing occurring. It’s still a dream which seems unlikely

link

BoorishBears 5 days ago

News to me?

Opus 4.7 has all the signs of a smaller model distilled from a newer pretraining run... except a smaller price.

Flash 3.5 raised in price pretty meaningfully over Flash 3

GPT 5.4 got a small price bump over gpt-5.3-Codex/gpt-5.2, then gpt-5.5 doubled pricing over gpt-5.4

Even open weights isn't immune: Kimi K2.6 was originally priced higher despite openly being 2.5 + more post-training, same with GLM 5.1 vs 5

All while rental prices are spiking month over month, and NVIDIA Inception discounted prices for buying are higher than undiscounted prices for buying 6 months ago...

link

dylan604 5 days ago

It feels like this is the line people are using to justify the expense of compute capex

link

BoorishBears 5 days ago

I run a consumer AI product and the current reality of trying to get compute vs what it was 6-12 months ago is enough to justify it to anyone who has the money.

I think OpenClaw created a mania that was completely unfounded (Apple Silicon is worth dirt compared to literally anything from NVIDIA including consumer GPUs), but the prediction of compute becoming scarce was correct

link

otterley 5 days ago

The fact that you can sell or lease out something for more than you bought it for is justification in and of itself.

link

adgjlsfhk1 5 days ago

Not necessarily. The GPU leases Spacex has made are month to month, so they are taking on all of the risk. If demand goes down, they're the ones stuck with the assets.

link

otterley 5 days ago

That's a very big "if," which this what thread has been about.

link

koolala 5 days ago

Dream? It is a nightmare that computers aren't getting significantly more efficient anymore.

link

leoedin 5 days ago

In the short term, compute becomes an appreciating asset.

In the medium term, everyone ramps up production. Huawei and other Chinese companies work really hard to develop in-house alternatives. At some point, the hype cycle will peak and less money will flow into datacentres (yes, this will happen. It always does. Even for technologies that change society. The bubble always bursts).

The question is not if this will happen. It will happen. It's just a question of when it happens and how big the magnitude of the cycle is.

link

iririririr 5 days ago

no need for a car analogy.

the comment you replied to is word-by-word what people hyping canadian telecoms were saying before the dotcom crash!

link