Hacker News new | ask | show | jobs
by alxhill 1169 days ago
This is false and represents a poor understanding of how these models work - they do abstract concepts and no you can't trivially get training images out.
2 comments

it's exactly the argument that the court cases against training on copyrighted works without permission are using

if the courts agree and this is ruled to be infringement then we'll left with products with level of quality we see here

meaning the technology will be an economic dead end

here's to hoping

Copyright is about copying. It is not about observing. Reading a copyrighted book isn't infringement. Writing out copies of it is, even if you don't use a computer or anything.
>they do abstract concepts

I don't think, given that we don't even have a particularly full understanding of how human conceptual logic functions, that we can claim that even AIs are using it as well. It's only "abstracting" in the sense that it has labeled one million objects in its training set with the word "tree," and fuses many of those images together to form a general picture dependent on specific parameters made to limit its set (oak tree, winter tree, etc.)

But that is different from me or you using the word tree, which is just a signifier among signifiers, it stands for nothing but a negation of the very thing it points to in a certain set of symbolic relations. Humans communicate in the order of symbolic structures, our minds function much more like LLMs, creating multitudinous pattern relations. What you call "abstract concepts" are of a secondary order imposed to create rigorous exactitude overtop the riddled mess that we call the human psyche.