|
|
|
|
|
by CoastalCoder
598 days ago
|
|
I'm curious: Suppose I upload some code to GitHub, but I didn't have the authority to share it with anyone at all. And then it was used to train DL models. How would various jurisdictions handle that? Would any of them force the deletion of all resulting model weights? And how might the remedies differ based on the kind of data? E.g., copyright vs. trade secret vs. protected medical info vs. military secrets? |
|
Microsoft wouldn't be able to pull that code out of already trained and, given that MS didn't do anything illegal when they used code that you said was yours to share, I wouldn't expect them to liable at all. That means MS wouldn't likely be fined, nor would they have to eat the costs of removing the models entirely.
If it were that easy to ruin anyone's model after it was trained no one would be able to make one at all. The training sets used to date almost certainly contain legally questionable content, and anyone interested in stopping GitHub (for example) would just pepper repos with content that violates licenses.