|
|
|
|
|
by api
825 days ago
|
|
The entire AI industry is powered by piracy at a massive scale. Very little training data is properly licensed or compensated. It's just more obvious with open models because we can investigate them. Closed models are sausages and we don't know what went in. Download a movie and you can get sued or your Internet connection terminated, but pirate the entire collective output of humanity and sell it back to us from behind a paywall and that's fine. I have more sympathy for Stability here because at least they opened the models. IMHO models trained on not-properly-licensed (pirated) data should at the very least not be copyrightable and should be public domain. (These piracy enterprises are aware of this as a possible legal outcome in some jurisdictions, so the whole AI safety bullshit performance is an attempt to scare people about open models to head off the potential of questionably-trained models being declared uncopyrightable and forced to be released.) |
|
My understanding is that ML model weights cannot be copyrighted as an original creative work. They are trade-secrets and protected through contracts but once leaked to third parties it’s not a copyright violation to use/distribute.
Whether the model is actually a derivative work of the training data is another interesting question.
Or is my theory off here?