Yes, I agree and this annoys me. From a privacy perspective, it's still better to have fifty siloed data stores that aren't allowed to mix than for your data to be in a single large store that can do whatever it wants.
And when you consider the comparison, it's even worse: the data they're refusing to allow Apple to train on is _public_ data. There were several court cases in the early 1900's where owners of big buildings tried to sue photographers for selling pictures of their buildings (or pictures that included their buildings) and it was finally agreed that since the building was in public, pictures of it were fair game. If you're putting an article on the internet for anybody to read, "anybody" seems like it would include an AI as well.
Interesting! There's a difference though to me: if the building has a giant poster of a popular IP on it, afaik it doesn't give the photographer any rights to use that IP in their creations. I see enough series shot in Vancouver that avoid including things like that or keep it blurred out if it can't be out of frame...
Content on websites has copyright in some form, just because it isn't behind a paywall/accountwall doesn't make it public domain.
But the AI didn't reproduce it, the AI "read" it and incorporated it into its knowledge base. I can read & cite all the academic papers I want - I can even reproduce their text verbatim, as long as I properly cite them.