Hacker News new | ask | show | jobs
by brianjking 1153 days ago
I'll never understand why everyone is spending so much time on a model you cannot use commercially (at all).

Secondly, most of us can't even use the model for research or personal use, given the license.

7 comments

The notion that model weights are copyrightable is absurd on its face. In the US you cannot gain a copyright though sweat of the brow, there must be substantial creative work. Nor does mere collection (e.g. feist v rural) create a copyright. Feeding common crawl to a standard network structure and letting an optimizer do its thing isn't creative, it's just expensive. It requires expertise and skill, sure but so does creating a phone book.

The companies working on AI would be foolish to argue for more copyrightability are because it would be hard to conclude the models were copyrightable works without also concluding that the models are unlawful derivatives of the material they were trained on. "Congrats, models can be owned, but regrets: you're bankrupt now because you just committed 4.6 billion acts of copyright infringement carrying statutory damages of $250k each."

You might argue that this is far from sure, OKAY-- but parties that take this view will out-compete ones that don't. If it does turn out to be problematic, the people that had something to work from now will pivot to backing their work on something else and will still be ahead of people sitting on their hands.

You could see it as a calculated risk, but it seems at least as safe as the one behind the underlying authors of the model weights training on material they're not licensed to distribute.

If you can use it for research/purpose purposes, you just do, and if you manage to build something good enough, you can get funding to train your own model then.

Also, with business there are few "can do / can't do" - it's about managing risks. If a penalty for doing is negligible (FB cannot catch you abusing license in private), from a business standpoint there is no issue in doing so - especially with things that are ethically kind-of-ok.

Because none of the interesting things I want to do with LLMs have anything to do with commercial uses. And because I don't care what a license says when I'm doing stuff in private. I just case whether I am practically able to do it.
For me it's because most of what I am learning and trying to do is applicable to LLM's in general. One day the right model will come along, until then I want to play.
There are efforts to provide an open source replica of the training dataset and independently trained models. So far the dataset has been recreated following the original paper (allowing for some vagueness that Meta researchers didn't specify):

https://github.com/togethercomputer/RedPajama-Data/

https://twitter.com/togethercompute/status/16479179892645191...

Why can’t you use it for personal use?

I doubt the Facebook Police are going to bust down your door at 3am.

…or are they? peeks through curtains

Because it's fun :-) And effort to bring up a could-be-commercial version is on going.

https://www.together.xyz/blog/redpajama