| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by zdwolfe 255 days ago
	I find it odd that any LLM could be considered open source. Sure the weights are available to download and use, but you can't reasonably reconstruct the output model as it's impractical for an individual to gather a useful dataset or spend $5,000,000+ of GPU time training.

1 comments

_ea1k 255 days ago

Distillation can extract the knowledge from an existing model into a newly trained one. That doesn't solve the cost problem, but costs are steadily coming down.

link

goku12 254 days ago

That's still a crude repurposement of an inscrutable artifact. Open source requires you to share the source data from which that artifact (the model parameters) was created.

link