| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Tycho 490 days ago
	Yes. Why do get these replies on HN that seem to only consider the most shallow, surface details? It could well be that xAI wins the AI race by betting on hardware first and foremost - new ideas are quickly copied by everyone, but a compute edge is hard to match.

3 comments

HarHarVeryFunny 490 days ago

The compute edge belongs to those like Google (TPU) and Amazon/Anthropic (Trainium) building their own accelerators and not paying NVIDIAs 1000% cost markups. Microsoft just announced experimenting with Cerebras wafer scale chips for LLM inference which are also a cost savings.

Microsoft is in process of building optical links between existing datacenters to create meta-clusters, and I'd expect that others like Amazon and Meta may be doing the same.

Of course for Musk this is an irrational ego-driven pursuit, so he can throw as much money at it as he has available, but trying to sell AI when you're paying 10x the competition for FLOPs seems problematic, even you you are capable of building a competitive product.

link

Tycho 490 days ago

Timing matters. A long term strategy for superior hardware might bear fruit too late.

link

HarHarVeryFunny 490 days ago

I'm not sure about that - I expect AI is going to become a commodity market, so it doesn't matter how late you are if you've got a cheaper price.

In terms of who's got a lead on cheap (non-NVIDIA) hardware, I guess you have to give it to Google who are on their 6th generation TPU.

link

vardump 490 days ago

I wonder how Tesla's training computer Dojo is doing. Although I guess there's a reason for buying so much Nvidia hardware...

link

theckel 490 days ago

Curious where you saw the Microsoft/Cerebras experimentation noted online? That's very interesting.

link

HarHarVeryFunny 490 days ago

It was mentioned in Anthropic Jack Clark's "Import AI" newsletter.

https://jack-clark.net/2025/02/17/import-ai-400-distillation...

link

gmerc 490 days ago

DeepSeek just showed the compute edge is not that hard to match. They could have chosen to keep the gains proprietary but probably made good money playing the market instead, quants as they are.

https://centreforaileadership.org/resources/deepseeks_narrat...

If you’re using your compute capacity at 1.25% efficiency, you are not going to win because your iteration time is just going to be too long to stay competitive.

link

scarmig 490 days ago

Software and algorithmic improvements diffuse faster than hardware, even with attempts to keep them secret. Maybe a company doubles the efficiency, but in 3 months, it's leaked and everyone is using it. And then the compute edge becomes that much more durable.

link

mirekrusin 490 days ago

Optimisation efforts don’t negate investment in capacity but multiply output.

link

Tycho 490 days ago

Sorry, you missed the point - DeepSeek tried some new software ideas, they did not manage to secure the same computation capacity.

link

gmerc 490 days ago

They achieved the same results for 1.25% of the computation cost... If they actually had that computation capacity, it would be game over with the AGI race by the same logic.

link

acchow 490 days ago

> but a compute edge is hard to match.

xAI bought hardware off the open market. Their compute edge could dissappear in a month if Google or Amazon wanted to raise their compute by a whole xAI

link

Tycho 490 days ago

Not if there’s a hardware shortage.

link

acchow 490 days ago

Ok, 2 months.

Remember, the new B200 have 2.2x the performance of xAI’s current H100 “hardware edge”. So it only takes an order half the size.

Or you could order the old H100 instead and avoid the B200 shortage.

link