Hacker News new | ask | show | jobs
by millimeterman 1168 days ago
My uninformed speculation is that this might be an artifact of the embedding layer (e.g. CLIP), not the image training data.

Presumably, the dataset for training the embedding layer (which is trained separately and then fixed) is not stripped of copyrighted content. So it will have learned that "Pikachu" is related to words such as "yellow" and "rat".

Therefore, even if the image dataset didn't have a single picture of Pikachu, the image generator will still likely produce a yellow rat. Just one that doesn't actually resemble Pikachu.

1 comments

That seems plausible to me. I assume if Adobe felt like even these results were too legally risky they could create a new embedding layer with fewer copyrighted concepts (so Deadpool would always output a spooky looking pool). I don't think that's likely though.