| > AI models are intellectual property If companies train on data they don't own and expect to own their model weights, that's hypocritical. Model weights shouldn't be copyrightable if the training data was pilfered. But this hasn't been tested because models are locked away in data centers as trade secrets. There's no opportunity to observe or copy them outside of using their outputs as synthetic data. On that subject, training on model outputs should be fair use, and an area we should use legislation to defend access to (similar to web scraping provisions). |
Its not hypocritical to follow a line of legal analysis whoch holds that copying material in the course of training AI on it is outside the scope of copyright protection (as, e.g., fair use in the US), but that the model weights resulting from the training are protected by copyright.
It maybe wrong, and it may be convenient for the interests of the firms involved, but it is not self-inconsistent in the way required for it to be hypocrisy.