Hacker News new | ask | show | jobs
by lossolo 660 days ago
> When an LLM solves a novel problem, it's also reasoning, unless you use some contrived definition of the word "reasoning" that doesn't match how the word is actually used in normal conversation. Also I fully expect the human brain to be encoded in a mathematical model.

Depends on what your definition of a novel problem is. If it's some variation of a problem that has already been seen in some form in the training data, then yes. But if you mean a truly novel problem—one that humans haven't solved in any form before (like the Millennium Problems, a cancer cure, new physics theories, etc.)—then no, LLMs haven't solved a single problem.

> And if it wasn't obvious, an LLM can string together two words that it had never seen together in the training dataset, it really shows how people tend to simplify the extremely complex dynamics by which these models operate.

Well, for anyone who knows how latent space and attention work in transformer models, it's pretty obvious that they can be used together. But I guess for someone who doesn't know the internals, this could seem like magic or reasoning.

1 comments

>then no, LLMs haven't solved a single problem.

Using your definition of a novel problem, do most people solve novel problems? If so, give me an example of a novel problem you have solved.

Sure, I did—a lot of them. These are the ones that were not in my training dataset in any form before I solved them, as it's impossible for a human to hold all scientific papers, historical facts, and in general, the entirety of human knowledge and experiences from the entire internet in their brain.
you haven't given me a concrete example.