| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by ftxbro 1180 days ago
	Yes there is petals/bloom https://github.com/bigscience-workshop/petals but it's not so great. Maybe it will improve or a better one will come.

3 comments

riedel 1180 days ago

I read that it is only scoring the model collaboratively but it allows some fine-tuning I guess.

Getting the actual gradient descent to parallelize is more difficult because one needs to average the gradient when using data/batch parallelism. It becomes more a network speed than GPU speed problem. Or are LLMs somehow different?

link

whalesalad 1180 days ago

Really interesting live monitor of the network: http://health.petals.ml

link

polishdude20 1180 days ago

I wonder how they handle illegal content. Like, if you're running training data on your computer, what's to stop someone else's data that is illegal, from being uploaded to your computer as part of training?

link