Hacker News new | ask | show | jobs
by Aurornis 477 days ago
It’s a $600 card with 16GB of RAM. That’s a good deal.
2 comments

It does seem like there is room for an $700-800 9070AI with 32 gb vram.
There's room but neither GPU vendor is willing to sell 32gb at that price
I get their point, but at the end of the day it's politics and marketing having it their own way.

With a 32GB card well below 1000$ it would sell like candies for anybody doing anything AI-related that's not training (you can easily run inference and fine tuning on such a card).

But it would massively eat in their data center sales which is what executives and investors want to see.

It's a tragedy because such a card would get a lot of love and support from amateurs to make it work great in the ML/AI context and thus improve their data center offerings long term.

So this is gonna end up in the same fashion AMD turns: it will disappoint or be ignored by most gamers cuz it has less brand power and no DLSS, and AMD will still disappoint at the data center level.

I think it could work out with a weak gpu (or high TDP). You want to make the card have higher TCO for datacenter, but if you make it a 3 slot card with 400W TDP that's 2x slower than your server GPUS, I think it works out. Once you have $10k of server (cpu+ram+networking) if your options are adding 2 9070AIs or 3 MI-300whatevers, the server GPUS would win for a server.
If you created a 32GB card that was great at AI workloads and cheap, it doesn't matter what you set the MSRP to. Street price would rise to the same level as other 32GB cards with similar performance.
The 4060 16GB was only about $440 a couple of months ago.