|
|
|
|
|
by toss1
1249 days ago
|
|
Yup. What seems to be largely missed is that these models have zero understanding, and are actually destroyers of information, not creators. In classic Information Theory, information is basically surprise value — how much unexpected info is in the message? — yet these "AI" systems put out the most expected subset in each instance. This highly averaged output is very recognizable and so very striking, but it is not actually very informative (perhaps except in cases where it is specifically used as a verbose search engine, where the query takes advantage of the breadth of the AI's training). |
|
Forgive me, but isn't this kind of moving-the-goalposts? Information is the surprise value from the recipient's point of view, which meas the recipient's bayesian prior probability is "expected". Saying "these "AI" systems put out the most expected subset in each instance" assumes that the recipient's priors exactly equal those of the model which would only be the case when the model is talking to itself. (or I suppose to an even more complex model with perfect knowledge of ChatGPT's weights)
The fact that no information is transferred when the model talks to itself should not be surprising and would apply to any AI. (even including a superhuman post-singularity god-like AI)