Hacker News new | ask | show | jobs
by AnthonyMouse 844 days ago
NVLink is most needed for training. For inference a lot of the popular models can usefully be run on multiple GPUs without it:

https://www.reddit.com/r/LocalLLaMA/comments/142rm0m/llamacp...