| > For the first answer I got zero results on Google, so it's quite unlikely that it was part of the training data Sure, not literally part of the training data. Statistical AI operates in a transformed space derived from the training data, points in that space will not, in general, exist in the original. So imagine generating 1000 circles and putting their radii on a line: 0.1, 0.2, 0.3, ... The circles are the training data, and the "implied line" is the transformed space. Now, AI here is capable of generating a circle with radius 0.15 and hence that circle is "not in the original dataset". This type of "novelty" isn't what I'm concerned with; generative AI must have that or else it'd be entirely useless -- only a google search. Rather i'm talking about, for example, whether without "Rust" in its training data it could develop "Rust" from everything else. Is there enough data on lifetimes/borrowing/etc. research in pdfs that it's scanned to somehow "find a midpoint between those pdfs and C++". It seems a bit mad to suppose so -- but I could be wrong, such a midpoint does exist --- but i'm extremely doubtful we humans have been so helpful as to write the 1000s of academic PDFs needed for this system to find it. The novelty I'm talking about is dimensions in the transformed space. The system cannot derive "additional ways to move" without the source data actually containing those ways. This is, roughly, equivalent to saying that it's biased towards the on-average ways we have conceptualised our problems as represented by the on-average distribution of academic articles, github repos, webpages, etc. *that we happened to have created*. This is a serious "intellectually conservative" bias. For sure it can find circles it hasnt seen; but could it find spheres from circles alone? No. |
Or working in the opposite direction: we can think of AIs as processing concepts in some dimensional space, sure. But we have no conception at all of what that space is like, so there's no reason to expect that a midpoint in that space between two objects we're familiar with would also be familiar to us. I mean, I have no idea what the midpoint between Rust and C++ is, or how I'd go about describing it. Surely an AI that thinks in tensors is more capable than we are to explore the space between known concepts, so why couldn't we expect to learn something novel from one?