| > If you've not looked at it I really recommend othello gpt. I skimmed it and read the conclusion, and it looks interesting, will take a closer look when I have time. The prompt covers a subject that goes completely over my head so I can't tell how well it reasons. I don't know what 1 proton to 100 neutrons means, but I gather it's radioactive. I don't think it's far fetched that it draws the same conclusion from the training set because to you it seems obvious, and is probably well known to anyone who knows the subject. Kind of like it would understand that "hotter than the sun" is super hot, can correlate to different melting points. But I wouldn't say it understands the concept of temperature. Given the right prompt it might give you the impression it does. The feelings of the scenario reads like any PR comment after a tragedy. "We feel shock and disbelief" and so on. The scenario being hypothetical doesn't change that since it's probabilities. It acts just like you'd think it would. The earlier example with the helium balloon is similar, it assumes a human context and not the form, and environment the helium is in. True intelligence might not even consider the presence of atmosphere as the norm. "It has no weight outside of your human constraints" would be novel. Lets say it has odd numbers between 1-9 in the database. Given the prompt 2 and 8 you will get back 1,3,7,9, sprinkled with some natural language and we get the impression it's intelligent. Are you saying it understands the effect the neutron to proton ratio has, as opposed to just comparing the vectors closest to your prompt that it builds the answer from? Being tested on new and hypothetical examples only means it will be further from the vectors but still close enough to give us the impression it understands the subject. If the training data didn't include the words neutron or proton it would have no idea where to begin. In my first comment that started this chain I said: > I don't see why not. It's not taking a single answer from a database no, it's taking several based on probability and merging them into what it thinks we're looking for. I don't think even this latest answer is any proof of anything other than that. Are you claiming there is? And what are you claiming is happening? |
I'm claiming that it's reasoning through the problem.
> True intelligence might not even consider the presence of atmosphere as the norm
I hugely disagree, that's not how an intelligent human would answer the question, and if it did this people would be complaining that it clearly doesn't understand the human context in which the question was likely asked.
> I don't know what 1 proton to 100 neutrons means, but I gather it's radioactive.
It would be hydrogen, but a type of hydrogen that doesn't appear in nature. It's a deliberately absurd example so that it's not in the training set. Its answers are different if the question involves tritium which while radioactive has a moderate half-life and wouldn't immediately pop the balloon.
> I don't think it's far fetched that it draws the same conclusion from the training set because to you it seems obvious,
Only because I can reason through what would happen, not because it's something I've seen talked about before.
To figure out what it would do, it cannot rely on an explanation elsewhere, it needs to first identify that the ratio of protons to neutrons is extreme. Then it needs to understand that this typically results in particular kinds of radiation.
It has to then use that information to consider how that would interact with the material of the balloon (and that this is important).
It has to use that information to consider how it would affect people, and what their reactions would be both before and after it explodes/pops.
This is multi-step reasoning through an issue that involves pulling together common expectations, physics and how humans react.
Here's a statement in it that shows to me more than just pulling a few answers together
> Balloon Behavior: Instead of floating up like a helium-filled balloon, this balloon would drop to the ground because the gas inside is denser than air. This might surprise the attendees, and curious children might approach or pick up the balloon, further exposing themselves to radiation.
-
> The feelings of the scenario reads like any PR comment after a tragedy. "We feel shock and disbelief" and so on.
Those are typical things, which is not surprising, but it is also clearly linked with the question. You have to understand how out of context this would be.
> If the training data didn't include the words neutron or proton it would have no idea where to begin.
Fully rediscovering what took humans many years to do off-the-cuff is an outrageously high bar.
What features of a question would you look for to identify whether it's "taking several answers from a database and merging them together" or performing some reasoning? I've asked a few times but don't understand what you're expecting.