Hacker News new | ask | show | jobs
by RandomLensman 985 days ago
Get it to explain something, lets say dynamic hedging for derivatives, and then ask it to explain how to exactly hedge something specific. Or describe some physical situation with a quirk and then let to derive the implications. Someone on HN had an example of asking to imagine entropy working in reverse in a cup of coffee with sugar dissolved. While it discussed sugar spontaneously forming crystals and other things, it never considered what the water would do, for example, let alone bigger issues such as if even the existence of water works etc.

Again, humans are often poor at these things, too, but if it had "mechanized" reasoning capabilities instead of "replicative" ones (i.e., just repeating stuff), I would expect it to do generally better.

2 comments

Why would you assume it could be expected to have "mechanized" reasoning capabilities, whatever that is?

I find these questions generally poor at gauging anything when people haven't given them to a representative sample of people first as a benchmark. Consider that not long ago there was a tedious trend of people posting "difficult" questions of orders of operations involving basic arithmetic, and a significant proportion of people in the threads would continue to belabour and argue for the wrong result even after having been told in excruciating detail how to apply the rules. In other words: I think people here tend to massively overestimate the reasoning ability of the average person.

E.g. to the example questions here, I'd bet the average person can't give a satisfactory definition of entropy, much less be able to tell what it does "forwards" before even considering "reverse". So why would we treat this as a benchmark of whether or not an LLM can reason?

Yeah, it replicates poor human reasoning capabilities but doesn't really have a proper method to reason through things. The later is what I expect from a true machine intelligence.

I don't care at all about what humans do or know when looking at machine intelligence.

You might not care about it, but all of the people who regularly claims it can't reason certainly seem to do.

Defining "true machine intelligence" without referencing the only intelligence most people would agree is "true" intelligence seems like a bizarre attempt at setting the bar unreasonably high, and defining "replicating poor human reasoning capabilities" to me is an admission from you that they do reason whether or not you think their ability to do so is "proper".

Replication someone else's reasoning isn't reasoning. Otherwise, a book would "reason".

And, yes, most humans fail to reason properly a lot of the time. Any simple probability puzzle shows that.

> Replication someone else's reasoning isn't reasoning. Otherwise, a book would "reason".

This isn't reasoning. This is a meaningless platitude. Matching the level of reasoning of someone else would inherently be reasoning. Replicating the level of reasoning would be.

> And, yes, most humans fail to reason properly a lot of the time. Any simple probability puzzle shows that.

That is an argument for lowering the bar for assessing whether an entity has the ability to reason, not raising it. Using this as an argument to me is another illustration of poor reasoning. Should I argue that you don't have the ability to reason because I don't think this meets the bar of proper reasoning?

Ad hominems always signal the end of a discussion. Good luck!
Reasoning and intelligence aren’t binary yes/no things, it can have some capability for reasoning and intelligence while still being below what a human can do
That is a point more orthogonal to mine. I am saying it has no general "method" to go from concept to application. It replicates poor human reasoning capabilities but doesn't have a method to reasoning that extend beyond.
That’s mostly true for LLMs due to their training goal but not for all kinds of machine learning