I feel to be consistent the output of that model will also be under that same open license.
I can see this being extremely limiting in training data, as only "compatible" licensed data would be possible to package together to train each model.
So what? Figure it out. They have billions in investor funding and we’re supposed to just let them keep behaving this way at our expense?
Facebook was busted torrenting all sorts of things in violation of laws/regulations that would lead to my internet being cut off by my ISP. They did it at scale and faced no consequences. Scraping sites, taking down public libraries, torrenting, they just do whatever they want with impunity. You should be angry!
I can see this being extremely limiting in training data, as only "compatible" licensed data would be possible to package together to train each model.