Can you summarize? I'm reading https://deepdive.opensource.org/wp-content/uploads/2023/02/D... but it seems to tackle too many questions when I'm really only interested on what criteria to use when deciding whether (for example) Stable Diffusion is open source or not.
Anyway, to go on a tangent, some day maybe with zero knowledge proofs we will be able to prove that a given pretrained model was indeed the result of training using a given dataset, in a way that can be verified vastly cheaper than training the model itself from scratch. (This same technique could also be applied to other things like verifying if a binary was compiled from a given source with a given compiler, hopefully verified in a cheaper way than compiling and applying all optimizations from scratch).
If this ever materialize, then we can just demand proofs.