Hacker News new | ask | show | jobs
by tyler33 954 days ago
we just have to download the weights and use piratebay, or even emule, just like old times
1 comments

Yes, but companies will have no incentive to publish open source models anymore. Or, it could be so difficult/beaurocratic no one will bother and keep it close source
What will actually happen is that innovation there will just move somewhere else (and it has partly done so).

This proposal is the US doing that bicycle/stick meme, it will backfire spectacularly.

The innovation is largely happening within the megacorps anyway, this is solely intended to make sure the innovation cannot move somewhere else.
Mistral and Falcon is not from megacorps and not even US.and many other opensource chinese models . And both are based models that means they are totally organic outside of US.
That’s what they told us. Turns out Google stopped innovating a long time ago. They could say stuff like this when Bard wasn’t out but now we have Mistral and friends to compare to Llama.

Now it turns out they were just bullshitting at Google.

> Now it turns out they were just bullshitting at Google.

I don't think Google was bullshitting when they wrote, documented and released Tensorflow, BERT and flan-t5 to the public. Their failure to beat OpenAI in a money-pissing competition really doesn't feel like it reflects on their capability (or intentions) as a company. It certainly doesn't feel like they were "bullshitting" anyone.

Everyone told us they had secret tech that they were keeping inside. But then Bard came out and it was like GPT-3. I don’t know man. The proof of the pudding is in the eating.

> The innovation is largely happening within the megacorps anyway

That was the part I was replying to. Whichever megacorp this is, it’s not Google.

Still, one of those corporations wants to capture the market and has monopolistic attitude. Meta clearly chose the other direction, when publishing their models and allowing us all to participate in.
Then we'll create a distributed infrastructure for the creation of models. Run some program and donate spare GPU cycles to generate public AI tools that will be made available to all.
I really really would like to believe this could work in practice.

Given current data volume used during the training phase (tb/s), I highly doubt it's possible without two, magnitude changing, breakthroughs at once