Hacker News new | ask | show | jobs
by ipsum2 1033 days ago
This is wrong, NVLink is crucial for tensor parallelism in models for training and in large (>40B param) models for inference.