Hacker News new | ask | show | jobs
by nprateem 757 days ago
But I don't want tricks. I want to know that it knows so I don't have to continually guess whether it's right or not.
2 comments

That's simply not possible. Human understanding is still unreliable, even for geniuses.
That’s why I am asking a computer.
I'm with you. I get that this is akin to asking a human, because we're trying to reason, so we will bring along (assumedly) unavoidable deficiencies of human reasoning. But if I were to ask a human genius this question, ne would grab a calculator and employ it as ne did the rest of ner reasoning.

So it seems like we should probably teach LLMs to "use a calculator", rather than try to get them to be more right when doing math 'in their head'.

Indeed, "use a calculator" is "just a trick"!
Solving that will be a much bigger deal but it's at odds with producing a highly accurate emulation of human thought and language. Language models can serve as tools to understand and experiment with logic formulated as natural language but it isn't their primary purpose. What you're asking is equivalent to creating an auditable trace of everything that goes into making a statement which is pretty much impossible even for the person making a statement. We can get close by limiting ourselves to narrow domains like mathematics but even then someone can come along and question the premises on which we construct such a system. I'm not saying it isn't worth pursuing, it just isn't the standard that we should hold a model to when we ourselves are incapable of it. The goal here is to create a system capable of doing the things that a human can do. If you prefer to have a system that behaves within the confines of a mathematical formalism with well defined rules then build that model instead.
It's entirely possible. Don't use LLMs for math. Use the computers we already have that have been capable of doing math accurately for a century. Right tool, right job.
OP said they didn't want tricks from their LLM. Using a calculator, like we do, is technically a trick.
My calculator manages
Your calculator is deterministic. Humans and AI are not.
LLMs are deterministic. We just sample the results, no? Also, no reason AI needs not be deterministic.
> LLMs are deterministic.

In theory, yes. In practice, parallelism combined with floating point math make current implementations fundamentally non-deterministic.

Not fundamentally, but extremely costly to keep track of everything. Still deterministic though.
Can you elaborate more on the parallelism aspect?
Temperature cannot ever reach 0 (this causes a division error), so they are not deterministic.
Exactly
The point is don't ask an LLM to do tasks that a calculator can do. Ask if to use the calculator, just like most humans would.
The point is that you shouldn't need to ask. If it's actually a general-purpose system it will do it automatically.
I know. That was my point.
A basic transformer architecture performs only a bounded amount of computation per generated token, so it can never emulate a machine computing sufficiently hard problems.
Yes, because it's feed forward. It must have loops to be a Turing machine.
It does. The output is fed back in.
It indeed does, but it must generate a token per loop, and can thereby solve some linearly complex problems, but it cannot solve harder problems.