|
|
|
|
|
by ta8645
496 days ago
|
|
> no models of reality involved. It literally has a mathematical model that maps what would, colloquially at least, be known as reality. What exactly do you think those math pipelines represent? They're not arbitrary numbers; they are generated from actual data that is generated by reality. There's no anthropomorphizing at all. |
|
a corpus of training data from the internet is finite.
any finite number divided by infinity ends up tending towards zero.
so, mathematically at least, the training data is not a sufficient sample of reality because the proportion of reality being sampled is basically always zero!
fun with maths ;)
> What exactly do you think those math pipelines represent?
probability distributions of human language, in the case of text only LLMs.
which is a very small subset of stuff in reality.
-
also, training data scraped from the public internet is a woeful representation of “reality” if you ask me.
that’s why LLMs i think are bullshit machines. the systems are built on other people’s bullshit posted on the public internet. we get bullshit out because we made a bunch of bullshit. it’s just a feedback loop.
(some of the training data is not bullshit. but there is a lot of bullshit in there).