Y
Hacker News
new
|
ask
|
show
|
jobs
by
meehai
265 days ago
Yeah, but if you can do topologies based on latencies you may get some decent tradeoffs. For example with N=1M nodes each doing batch updates in a tree manner, i.e the all reduce is actually layered by latency between nodes.