| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by andoando 82 days ago
	Why isnt LLM training itself open sourced? With all the compute in the world, something like Folding@home here would be killer

4 comments

DesaiAshu 82 days ago

data bandwidth limits distributed training under current architectures. really interesting implications if we can make progress on that

link

dogcomplex 82 days ago

Limits but doesn't prohibit. See https://www.primeintellect.ai/blog/intellect-3 - still useful and can scale enormously. Takes a particular shape and relies heavily on RL, but still big.

link

andoando 81 days ago

What bandwith limits? Im assuming the forward and backward passes have to be done sequentially?

link

DesaiAshu 80 days ago

Yes also passing data within each layer

link

mike_hearn 81 days ago

It is in some cases. NVIDIA's models are open source, in the truest sense that you can download the training set and training scripts and make your own.

link

throwaway27448 81 days ago

It's either illegal or extremely expensive to source quality training material.

link

m4rtink 81 days ago

Yeah, turns out if you want to train a model without scrapping and overloading the whole of Internet while ignoring all the licenses and basic decency is actually hard & expensive!

link

doctorwho42 81 days ago

Well it is, it's in the name "OpenAI". /S

link