Y
Hacker News
new
|
ask
|
show
|
jobs
by
greenavocado
65 days ago
So distribute copies of the model in RAM to multiple machines, have each machine update different parts of the model weights, and sync updates over the network
1 comments
olliepro
65 days ago
decentralized training makes a lot more sense when the required hardware isn't a $40K GPU...
link