| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fridder 2 hours ago
	We need a SETI@Home but for model training

5 comments

Azantys 1 hour ago

I think model training is pretty hard to do efficiently on a vastly distributed network. If the model cant fit into the VRAM of the node your performance becomes so bad its useless, so a distributed model could only be properly trained if the size of the model doesnt exceed the majority of the nodes VRAM sizes. Maybe there is a different way of doing training but this would be the only way I can see. And it would still be much worse than just using a big datacenter where everything is fully interconnected. BOINC projects work great because its usually just a lot of small compute and memory required so every old desktop and laptop can contribute. Training a model which can compete and is not tiny requires neither low compute or low memory amount. BOINC tasks take minutes usually or sometimes hours but not weeks or months like training a model from scratch. But something like 7B or lower could maybe be trained like this. Im not sure but I think someone is already working on something like this but I dont remember the name of the project.

link

wuschel 1 hour ago

My understanding is that in addition to your comment and the development of a method to separate the training data for distributed learning, the latency/bandwidth of systems connected on the internet is a challenge, too. Information has to be sent around before and after any hypothetical number crunching.

link

calebkaiser 1 hour ago

This has been a (noble) goal of lots of different projects in the community for a long time. Federated learning projects like Flower have been chipping away at it for a long time. There are many many hurdles to be cleared before anything in this area is super feasible as an alternative, but I applaud everyone who works on it.

link

0x3f 1 hour ago

Consumer hardware over the internet is not really suitable for this, AFAIK.

link

baby_souffle 1 hour ago

There's some really early days work on making training loops robust to failure but they all have trade-offs right now.

I remain hopeful that we'll be able to democratize the entire tech stack for this tech.

link

ainka-ainka 1 hour ago

Here's a project trying that - https://nousresearch.com/nous-psyche

link

kamranjon 2 hours ago

Have been thinking about this a lot lately.

link