|
|
|
|
|
by p1esk
1452 days ago
|
|
Can it measure internode traffic for distributed training runs? This is something I needed recently and couldn’t achieve using nccl-test utilities like mpirun. ib_write_bw also didn’t work, I suspect because of multiple virtual links. |
|