Hacker News new | ask | show | jobs
by Hizonner 689 days ago
> that can't happen with the vast majority of these models because they're trained on unlicensed data

Tough beans? There's lots of actual software that can't be open source because it embeds stuff with incompatible restrictions, but nobody tries to redefine "open source" because of that.

... and, on a vaguely similar-flavored note, you'd better hope that the models you're using end up found to be noninfringing or fair use or something with respect to those "unlicensed data", because otherwise you're in a world of hurt. It's actually a lot easier to argue that the models aren't copyrightable than it is to argue that they're not derivative of the input.

> I've decided to draw my personal line at Open Source Initiative compliance for the license they release the model itself under.

You're allowed to draw your personal line about what you'll use anywhere you want, but that doesn't mean that you should try to redefine "open source" or support anybody who does.