Hacker News new | ask | show | jobs
by naasking 758 days ago
That's simply not possible. Human understanding is still unreliable, even for geniuses.
3 comments

That’s why I am asking a computer.
I'm with you. I get that this is akin to asking a human, because we're trying to reason, so we will bring along (assumedly) unavoidable deficiencies of human reasoning. But if I were to ask a human genius this question, ne would grab a calculator and employ it as ne did the rest of ner reasoning.

So it seems like we should probably teach LLMs to "use a calculator", rather than try to get them to be more right when doing math 'in their head'.

Indeed, "use a calculator" is "just a trick"!
Solving that will be a much bigger deal but it's at odds with producing a highly accurate emulation of human thought and language. Language models can serve as tools to understand and experiment with logic formulated as natural language but it isn't their primary purpose. What you're asking is equivalent to creating an auditable trace of everything that goes into making a statement which is pretty much impossible even for the person making a statement. We can get close by limiting ourselves to narrow domains like mathematics but even then someone can come along and question the premises on which we construct such a system. I'm not saying it isn't worth pursuing, it just isn't the standard that we should hold a model to when we ourselves are incapable of it. The goal here is to create a system capable of doing the things that a human can do. If you prefer to have a system that behaves within the confines of a mathematical formalism with well defined rules then build that model instead.
It's entirely possible. Don't use LLMs for math. Use the computers we already have that have been capable of doing math accurately for a century. Right tool, right job.
OP said they didn't want tricks from their LLM. Using a calculator, like we do, is technically a trick.
My calculator manages
Your calculator is deterministic. Humans and AI are not.
LLMs are deterministic. We just sample the results, no? Also, no reason AI needs not be deterministic.
> LLMs are deterministic.

In theory, yes. In practice, parallelism combined with floating point math make current implementations fundamentally non-deterministic.

Not fundamentally, but extremely costly to keep track of everything. Still deterministic though.
Can you elaborate more on the parallelism aspect?
Temperature cannot ever reach 0 (this causes a division error), so they are not deterministic.
Exactly
The point is don't ask an LLM to do tasks that a calculator can do. Ask if to use the calculator, just like most humans would.
The point is that you shouldn't need to ask. If it's actually a general-purpose system it will do it automatically.
An LLM in isolation is not a general purpose system, but with ChatGPT at least, most of the time you don't need to ask. In fact, it's increasingly difficult to force it to do "manual" maths, as it's strongly predisposed to do things like write and evaluate Python code instead of doing it "manually".

E.g. I verified just now and asked it to multiply two huge numbers, and it immediately spat out Python code, then evaluated it and gave me the result, rather than try to add the numbers itself.

It still needs to be trained on what calculators are, on how to use them, and on when it is appropriate to do so. Again, same as humans.
I know. That was my point.