|
When you say "predicting facts" you imply "predicting true future events." Delphi is no longer operational, so it simply can't be done. (At least, not past a certain -- very, very low -- complexity threshold in the macroscopic non-quantum world.) "Science" is coming up with, and testing, theories -- they may be true, they may be false, and you can't know, and shouldn't hold a very strong position, until you test them. It's true that a more intelligent person will come up with better hypotheses and more inventive ways to put them to the test, but that's not what you seemed to be talking about, nor are we in any disagreement on that point. A more intelligent cat will also catch mice more effectively -- it'll have a more accurate mental model of the mouse and of its own physical capabilities in time and space. Still, the outcome of the hunt is never perfectly predictable. Some outcomes are statistical -- and, intriguingly, LLMs mirror this in how they predict tokens. > LLMs seem to be dramatically bad at "procedures". How do you figure, and how did you reach this conclusion? |
And Michelson and Morley did through Einstein's theory. And Jack did when he said "if my theory is correct, that falling brick will break my skull more probably than not". And it's a matter in which LLMs tend to fail, when they go "surely your operating system will have a `scratchmyback` command to allow you to work more hours sitting in front of it, it just makes sense".
> How do you figure [that «LLMs seem to be dramatically bad at "procedures"»], and how did you reach this conclusion?
I just tried with a main widespread engine, and it failed. And it showed that it still seemed to be guessing an output instead of actually checking to build the output (as if remembering that very often "2+2=4" instead of checking "1 and 1, and 1 and 1: 1, 2, 3, 4").