Hacker News new | ask | show | jobs
by p1esk 854 days ago
H200 has 141GB, B100 (out next month) will probably have even more. How much memory do you need?
1 comments

We need 128gb with a 4070 chip for about 2000 dollars. Thats what we want.
I've never tried it, but in Windows you can have CUDA apps fall back to system ram when GPU vram is exhausted. You could slap 128gb in your rig with a 4070. I'm sure performance falls off a cliff, but if it's the difference between possible and impossible that might be acceptable.

https://nvidia.custhelp.com/app/answers/detail/a_id/5490/~/s...

Nvidia will not build that any time soon. RAM is the dividing line between charging $40,000 vs $2500…
Please give me some DIMM slots on the GPU so that I can choose my own memory like I'm used to from the CPU-world and which I can re-use when I upgrade my GPU.
An M1 Mac Studio with that much RAM can be had for around $3K if you look for good deals, and will give you ~8 tok/s on a 70B model, or ~5 tok/s for a 120B one.
Unfortunately production capacity for that is limited, and with sufficient demand, all pricing is an auction. Therefore, we aren't going to be seeing that card in years
Yes please.