Y
Hacker News
new
|
ask
|
show
|
jobs
by
hoss1474489
206 days ago
GPUs in 16x slots is still important for LLM stuff, especially multi-GPU, where lots of data needs to move between cards during computation.
2 comments
__alexs
205 days ago
A 16x PCIE 6.0 setup has more bandwidth than any dual channel DDR5 memory kit.
link
Dylan16807
206 days ago
Depends on what you're doing. I'm pretty sure the bandwidth for inference isn't much.
link
eurekin
205 days ago
Depends, if it's tensor parallel or pipeline parallel. Only PP doesn't pass too much. TP does
link