I'm really intrigued to get one, but where is the 32GB version for all the LLM g...

Aurornis · 2025-02-28T14:07:36 1740751656

It’s a $600 card with 16GB of RAM. That’s a good deal.

adgjlsfhk1 · 2025-02-28T14:19:18 1740752358

It does seem like there is room for an $700-800 9070AI with 32 gb vram.

Numerlor · 2025-02-28T15:45:31 1740757531

There's room but neither GPU vendor is willing to sell 32gb at that price

epolanski · 2025-02-28T15:59:50 1740758390

I get their point, but at the end of the day it's politics and marketing having it their own way.

With a 32GB card well below 1000$ it would sell like candies for anybody doing anything AI-related that's not training (you can easily run inference and fine tuning on such a card).

But it would massively eat in their data center sales which is what executives and investors want to see.

It's a tragedy because such a card would get a lot of love and support from amateurs to make it work great in the ML/AI context and thus improve their data center offerings long term.

So this is gonna end up in the same fashion AMD turns: it will disappoint or be ignored by most gamers cuz it has less brand power and no DLSS, and AMD will still disappoint at the data center level.

adgjlsfhk1 · 2025-02-28T16:51:10 1740761470

I think it could work out with a weak gpu (or high TDP). You want to make the card have higher TCO for datacenter, but if you make it a 3 slot card with 400W TDP that's 2x slower than your server GPUS, I think it works out. Once you have $10k of server (cpu+ram+networking) if your options are adding 2 9070AIs or 3 MI-300whatevers, the server GPUS would win for a server.

Aurornis · 2025-02-28T17:52:16 1740765136

If you created a 32GB card that was great at AI workloads and cheap, it doesn't matter what you set the MSRP to. Street price would rise to the same level as other 32GB cards with similar performance.

kube-system · 2025-02-28T14:37:50 1740753470

The 4060 16GB was only about $440 a couple of months ago.

ryao · 2025-02-28T14:55:24 1740754524

That is likely to be AMD’s workstation variant. Here is the workstation variant of the 7900 XTX that had double memory:

https://www.techpowerup.com/gpu-specs/radeon-pro-w7900.c4147

BoredPositron · 2025-02-28T14:20:19 1740752419

There were rumors that a 32GB skew is coming.

nosebear · 2025-02-28T14:31:18 1740753078

And AMD already said there is not a 9070XT 32GB coming. Which I understand as "we're building a 32GB card with this chip, but it's not coming before christmas and will cost you a kidney".

I really would like to upgrade from my 2070 Super but I'm not getting a 16GB card now just to buy another one with 32GB later on.

ryao · 2025-02-28T14:56:54 1740754614

I don’t believe that considering that there was a 48GB 7900 for the workstation market:

https://www.techpowerup.com/gpu-specs/radeon-pro-w7900.c4147

The denials are probably more saying that there will be no consumer targeted 32GB version than that there will be no 32GB version at all.

nosebear · 2025-02-28T14:59:10 1740754750

Yes, I agree. It will probably be a 3000$ unit.

lostmsu · 2025-02-28T16:42:42 1740760962

I'd rather have a $3000 one with 80GB

dwood_dev · 2025-03-01T15:25:59 1740842759

If I'm spending $3k, I'm probably getting a Nvidia project digits box with 128GB.

Then again, if the tokens/s of digits is comparable to a M4 Max 128GB, then I'm getting a MBP instead.