Hacker News new | ask | show | jobs
by bangkoksbest 940 days ago
The model is the instance because the instance is what is being used at inference time. The training simply produces some set of weights.

Open source would mean someone could see and run the model code locally/independently to create an instance. For an LLM, this is much less insurmountable of an issue in terms of resources needed.

2 comments

We should probably be calling for an open model or open data approach if that's what we think is best. Calling it open source leaves plenty of gray area in the definition.

For example, I run a few instances of open source software though my instances' databases are private.

That is questionable.

We are still answering questions on LLM/AI scaling. Ya, you might have your cottage industry AI giving out answers on the trickle of information you feed it, but will that even be comparable to one being fed megawatts of power with terabytes of data per second flowing into its databanks?