Hacker News new | ask | show | jobs
by DanyWin 1196 days ago
To me OpenChatKit is just a first step towards better and better open-source models. Other actors like AWS and Hugging Face are also working on that and Hugging Face has already proved its ability to train and make available LLMs on a huge scale like Bloom.

I think it's just the beginning and the open-source community will provide very competitive LLMs.

1 comments

> I think it's just the beginning and the open-source community will provide very competitive LLMs.

It is indeed. The way to challenge OpenAI's offerings is with open-source AI models. Even better when they achieve and surpass GPT-4 level capabilities.

They (OpenAI) cannot win the race to the bottom or $0. Stable Diffusion (and even the leaked Facebook LLaMa) is already at the finish line and more alternatives will also be there to surpass GPT-3 and 4 and will release them for free in the open.

Eventually, Open source AI models will eventually disrupt closed ones. Just like how DALLE-2 has been disrupted quickly by Stable Diffusion.

I think one caveat is access to training data. If proprietary models can be trained on useful data from private sources, or worse, if there are successful legal challenges against using public but copyrighted data for training, then it will be difficult for open-source models to compete with proprietary models.
1) In a lot of western countries (EU, UK too I think) the hammer has already come down in favor of using public but copyrighted data.

2) Wouldn’t that cause open source models to be favored? A big company has lawyers that ensure that the internal practices comply with the law while on the other hand, good luck suing some random guy from 4chan who made a model that may or may not incorporate copyrighted data.

Sure there will be "bootleg" models available on BitTorrent or whatever, but generally "open source" refers to legitimately licensed code a big company with lawyers would be ok with incorporating into their own business.
Your incorrect assertions about the current state of play was what turned this from a bit of a hypothesis to “open source always wins” delusion.