|
|
|
|
|
by llm_trw
582 days ago
|
|
At this scale cuda is quite useless. You need to develop your own in house solution to distributing workloads. The difference to regular clusters is that all the memory is globally visible, so machine 0023 can access and modify address 0x0123456789abcdef0123456789abcdef which happens to be on machine 0999. |
|