xAI Is Reportedly Using Just 11% of Its 550k Nvidia GPUs

Y	Hacker News new \| ask \| show \| jobs

	xAI Is Reportedly Using Just 11% of Its 550k Nvidia GPUs (wccftech.com)
	27 points by lossolo 50 days ago

8 comments

maxrev17 49 days ago

Why is everyone stating conjecture based answers like my 10 year old kid with absolutely no evidence to back it?

link

brazukadev 49 days ago

The most childish reply to this thread is yours, max.

link

aggakake 50 days ago

Aren't Xai's datacenters powered by [currently very expensive] diesel?

link

Frannky 49 days ago

Grok is pretty bad. No wonder usage is low. I think they messed up when they removed the human annotation team and went in the direction of automation.

The bet can eventually pay off when they figure out how to train without human help and also generate useful models. Imagine is terrible too.

More competition is great for us users. I hope they recover. In the meantime why not hosting oss models like google does?

link

nwah1 49 days ago

My understanding is that inference (running existing models) is around 1/4th of the average compute budget for AI companies. Training new models takes up about 3/4ths.

As such, using only 11% of their GPUs indicates that they've elected not to do as much training as they are capable of.

link

brazukadev 49 days ago

if they "elected" to do that, with such a terrible model, they are the most incompetent AI lab ever.

link

indoordin0saur 46 days ago

I've actually found it to be very good, as good as the other big models. Which version did you most recently use and for what purpose?

link

Frannky 45 days ago

I tried all the versions, starting from Grok3. Grok3 was the one I was getting real work done with. After that just terrible experiences.

I use LLMs for asking questions and coding. Generated answers are bad. Generated code is bad. Images and videos look super fake. And there is no fixed subscription to use it in CLI terminals.

If it was as good as other LLMs, as you say, why 11% usage, and why selling compute to Claude after all the badmouthing? I prefer deepseek 10x.

link

dlcarrier 49 days ago

That's a problem that any general purpose design has. It's something Dojo would have fixed, but it went too far in the other direction and only supported training. Rumor has it the new version will support inference too.

link

thatguysaguy 49 days ago

It looks like people in this thread are confusing fleet utilization and MFU. If they're doing a lot of RL, it's really not surprising to see such low numbers.

link

londons_explore 49 days ago

Part of this is a human problem. The company wants better utilisation, so hires resourcing experts tasked to allocate resources between projects and teams.

These experts set up quota systems, priority allocation, month-ahead plans, burst and idle quotas, etc, all with a goal to get the resource better used.

However it ends up having the reverse effect - teams now waste the resource deliberately to make it appear they have better utilisation, and run pointless jobs because "use it or lose it" quota systems discourage being thrifty.

These problems are compounded by there being hundreds of resource types - "I've got plenty of CPU and GPU TFlops for my project, but I've run out of disk spindle hours so can't run the training job".

End result is that the company as a whole doesn't even know real utilisation, and makes exceptionally poor use of resources.

link

alexdumny 50 days ago

This is the exect information I am looking for

link

downrightmike 50 days ago

Soon the market will flood with liquidations of everything from these

link

blourvim 50 days ago

Article says,this is a software issue. Where GPU'S are unable to get to be fully utilized due to scaling issues. I dont know how hardware that scale works, but it could very well be that they still need all of their hardware to get their current compute

link

brazukadev 50 days ago

If they had the demand, this problem would be fixed. Even giving free credits xai would not get the users, nobody wants to use Elon's LLM.

That's why he bought Cursor, trying to get the customers to have an audience to give free credits.

link

kelseyfrog 50 days ago

Where does one go (virtually or physically) to participate as a buyer in these markets?

link

grosswait 49 days ago

“Elon Musk's xAI, the software firm behind Gorq” - this is not an autocorrect error.

link