Hacker News new | ask | show | jobs
by jwells89 671 days ago
It has to do with what the resulting model is used for. It gets particularly dodgy if its commercial usage, because most if not all of the data used for training wasn’t licensed for that, making for a “laundering” effect.

Though I also think there’s an argument to be made that images need to be properly licensed to even be “analyzed” in this way, because it’s ultimately an unauthorized copy even if it involves picking the image apart and obfuscation. They were published with the intent of being viewed by the public, not for being reproduced in any shape or form.