Regulating open-source AI will only help giants like Google and OpenAI, stifling innovation. It creates barriers only they can afford, limiting competition and diversity. Open-source fosters transparency and rapid progress. We do not need government regulations, or we'll end up like Europe with China leading.
I think all of the hand-wringing about open weights vs. open source, and which term to use, is an ideological point people are getting stuck on. I don’t think it’s a real problem.
Small models can be fine-tuned to perform specific tasks with similar accuracy to large models. Small models can be served for internal use with a VERY modest hardware outlay.
There are also providers now that will let you upload low-rank adapters you have trained in top of open foundational models, so that you can use their efficient serverless infrastructure with your fine-tuned models. This requires even less capital.
None of this would exist had OpenAI’s vision of centralized, locked-down API access become the reality.
That's true, but "White House" in this headline was actually referring to "Alan Davidson, an assistant secretary of the U.S. Commerce Department". I'm actually not sure where the article got the term "open source" from either, it says it's from a report but I don't know what report or who wrote it. It seems like they are using "White House" as a term for people in the executive branch working at the direction of the White House, which is common.
When I say "the White House has quite competent people working on this" it's because the NTIA RFC I linked was based on a direct executive order from President Biden.
On October 30, 2023, President Biden issued an Executive Order on “Safe, Secure, and Trustworthy Development and Use of Artificial Intelligence,” which directed the Secretary of Commerce, acting through the Assistant Secretary of Commerce for Communications and Information, and in consultation with the Secretary of State, to conduct a public consultation process and issue a report on the potential risks, benefits, other implications, and appropriate policy and regulatory approaches to dual-use foundation models for which the model weights are widely available. Pursuant to that Executive Order, the National Telecommunications and Information Administration (NTIA) hereby issues this Request for Comment on these issues. Responses received will be used to submit a report to the President on the potential benefits, risks, and implications of dual-use foundation models for which the model weights are widely available, as well as policy and regulatory recommendations pertaining to those models.
Better question: Is it possible to restrict open source AI?
If it is open then everyone has access. Any restrictions would be akin to demanding that linux not use strong encryption or firefox implement content censorship.
Its possible to restrict DIY building of pretty much anything if your end goal was to stop people from doing something outside of their basement with it. I can't build my own open source coal fired power plant and except to sell power without the EPA coming to kill me. Same would be if i used a open source AI that violated some new consumer protection / anti fraud law if i choose to use it over the public internet / build it into a product. Hell you could probably go after the devs for being accessory if you really wanted to.
The license really does nothing to protect your project from regulation its just that the government doesn't care about open source yet.
It hasn't proven possible yet but who knows what will be developed the future?
A lot of open source efforts depend on big companies training million-dollar models and giving them away. These companies will often apply some censoring adjustment to the weights, which the open source community then undoes through fine tuning.
But perhaps in the future new methods of censorship will be developed, which are radically harder to undo?
And of course, there's always heavy-handed options available - if we can require hairdressers to hold professional licenses, we could require the same of anyone who wants to upload to huggingface or civitai.
Why? All three of those companies have an obvious and outspoken commitment to releasing models for free, they aren't trying to manipulate fear to sell a product like OpenAI.
Microsoft is invested in OpenAI. Have they released any open weight models themselves?
Apple is similarly partnering with OpenAI.
Google has released nerfed versions of its Gemini models (Gemma 2B and Gemma 7B).
It seems the only company truly setting an example for open weights is Meta. Hopefully the other companies realize that it’s in their best interest to do the same (as it seems Google is begrudgingly realizing with its nerfed releases).
Eventually open weights will win, and if OpenAI (the most ironically named company) continues to rely on closed models as its moat, it will lose.
Not only have all three of these companies released LLM model weights, Google and Microsoft especially have released nearly countless model architectures, weights, evaluation/training/inference/etc frameworks, toolkits, etc across an extremely wide spectrum.
They mainly offer open tooling to help you adopt their closed-source tech (e.g. RAG frameworks) but you don't get the freedom to do what you want with the model (which is the core).
At some point, these big companies may even want to put more regulation or restrictions on their open-weights / open-source competitors. "Hey this is not validated for safety, you can use only Gemini"
They don't? Microsoft and Google both promote Open Source inferencing frameworks, and while both also have proprietary products I think it's completely dishonest to say they "mainly" support them. ONNX and Tensorflow have been supported longer and more productively than any of the closed-source AI frameworks Google or Microsoft offer.
Well then yes, I would definitely say Phi. And on top of that Microsoft has been releasing a ton of the tooling surrounding LLMs and AI as well, including Semantic Kernel.
Really makes me wonder what the angle here is for them. With open source, I can somewhat understand given market standards become easier to hire for and you get feedback on your tooling. It seems unlikely that either will be true for open-weight models. It also seems unlikely they would be able to establish market domination and then increase prices if everything remains open.
In all three cases, they own hardware (Apple in a slightly different sense) that they'd love for you to pay to run your favorite models on, open or closed.
fascinating how successful the bullshit rebranding of "sometimes we upload a TB file of weights" as "open source" has gone for them politically. I can't really imagine they thought it would go this well. will be interesting to see how this unexpected boon of a loophole will change their strategy, particularly """OpenAI""", who didn't think to do give themselves enough cover.
Say the government actually wanted to regulate open source AI, how exactly would they do that?
If the govt wants to regulate commercial AI, they can do that by going to the companies responsible for building that AI and say "do X, Y and Z or else we will punish you with A, B or C". But what do you say to a collective of developers that could be all around the globe? What stops that group from just saying "F* You" and continuing on?
Good. Effective Austruism is trying to destroy democratic institutions and it's likely a bigger threat to society than fascism. The pro-progress people needs to organize themselves to stop types like SBF from giving AI to China.