Hacker News new | ask | show | jobs
by orochimaaru 940 days ago
This was a view that was initially taken by the government for encryption as well. But everyone can agree open sourcing the algorithms and libraries has been the best move.

It is the same with AI/AGI. Anything closed source and having regulatory oversight is useless, decreases innovation, increases bureaucracy and will only serve those who wish to build a “moat” to further their hold on the technology.

2 comments

Sure, open source worked well for encryption. It wouldn't work well for bioweapons or nuclear weapons technology. The question is which category does AGI best fit? We shouldn't elide analysis for obtuse comparisons to the past.
> Anything closed source and having regulatory oversight is useless

Source isn't even the problem, unless you're a billionaire, you can't afford to train the model.

This isn't the wheel, the printing press, the PC, or the public internet, which created opportunities for everyone.

This is pay-to-play that is only affordable to the nation-state the mega-corporation, and the latter might let you play around as a digital sharecropper on their platform, until they cut you off, because they can make more money by having first-party ownership of whatever you built.

The model is the instance because the instance is what is being used at inference time. The training simply produces some set of weights.

Open source would mean someone could see and run the model code locally/independently to create an instance. For an LLM, this is much less insurmountable of an issue in terms of resources needed.

We should probably be calling for an open model or open data approach if that's what we think is best. Calling it open source leaves plenty of gray area in the definition.

For example, I run a few instances of open source software though my instances' databases are private.

That is questionable.

We are still answering questions on LLM/AI scaling. Ya, you might have your cottage industry AI giving out answers on the trickle of information you feed it, but will that even be comparable to one being fed megawatts of power with terabytes of data per second flowing into its databanks?