|
|
|
|
|
by nickpsecurity
416 days ago
|
|
The FairTrained models claim to train with only public domain and legal works. Companies are also licensing works. This company has a lawful, foundation model: https://273ventures.com/kl3m-the-first-legal-large-language-... So, it's really the majority of companies breaking the law who will be affected. Companies using permissible and licensed works will be fine. The other companies would finally have to buy large collections of content, too. Their billions will have go to something other than GPU's. |
|
Not really sure a claim is good enough. I don't know that you can just go into court and say, "Trust me, I don't use copyrighted material."
And I also can't see any way, other than providing training data and training an identically structured model on that data, that a company can conclusively show that they got the weights in an allegedly copyright free model from the copyright free training data a company provides.