| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by raincole 94 days ago

AI's mistakes are sometimes so subtle.

Just yesterday I asked Gemini Pro 3.0 this question:

> Find such colors A and B:

> A and B are both valid sRGB color.

> Interpolating between them in CIELAB space like this

> C_cielab = (A_cielab + B_cielab) / 2

> results in a color C that can't be represented in sRGB

It gave me a correct answer, great!

...and then it proceeded to tell me to use Oklab, claiming it doesn't have this problem because the sRGB gamut is convex in Oklab.

If I didn't know Oklab does have the exact same problem I would have been fooled. It just sounds too reasonable.

1 comments

zozbot234 94 days ago

You can sometime run a quick second check by taking the AI's claim and asking it for an evaluation within a fresh context. It won't be misled by the surrounding text and its answer will at least be somewhat unbiased (though it might still be quite wrong).

It helps if you phrase the question openly, not obviously fishing for a yes-or-no answer. Or, if you have to ask for a yes-or-no question, make it sound like you're obviously expecting the answer that's actually less likely, so the AI will (1) either be more willing to argue against it, or (2) provide good arguments for it you might not have considered, because it "knows" the answer is unexpected and it wants to flatter your judgment.

link

GCUMstlyHarmls 94 days ago

> It helps if you phrase the question openly, not obviously fishing for a yes-or-no answer. Or, if you have to ask for a yes-or-no question, make it sound like you're obviously expecting the answer that's actually less likely,

I do this all the time and hate that I have to do it, with the additional "do not yes-man me, be critical."

link

MadxX79 94 days ago

Great, now I have two answers and still no clue which one is the right one.

link

nxpnsv 94 days ago

Now get a third opinion, and marvel at all the thinking that you have accomplished

link

MadxX79 94 days ago

In my experience the last answer it gives is usually the right one

link

nxpnsv 94 days ago

Ah, so the trick is to figure out which one will be the last answer. The halting problem....

link

paulhebert 93 days ago

It’s always the last place you look

link

signatoremo 94 days ago

Any difference that asking two humans?

link

MadxX79 94 days ago

Yeah, enormously. People will hedge depending on how sure they are about something. They might also have credentials in whatever you ask them, if you get legal advice from a lawyer, that can be judged to be more reliable than from a lay person.

Relationships with real people are pretty cool actually. If you talk to people that you have a longer relationship with, you might also be able to judge their areas of expertise and how prone to bullshitting they are.

link