| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by cglan 1178 days ago

Not exactly seeing great performance with this model. It gets tripped up by ridiculously silly questions.

For example "if you have a spoon, a knife, a fork, a carrot and a calculator, which one would you use for math?"

> None. The question is grammatically incorrect as it implies that the person has all of those items in their possession at once

"The bus was going so fast it passed the racecar". What vehicle was going the fastest?"

> It's impossible to tell which one went faster, as they were both traveling at high speeds

2 comments

pumanoir 1178 days ago

ChatGPT and Claude also get tripped by both questions

link

refulgentis 1178 days ago

I think Claude is correct on the merits (Claude’s awesome in general)

Without more context about the speeds of the bus and racecar, I cannot determine which vehicle was going fastest based on the given statement. Simply saying that the bus passed the racecar does not provide enough information to compare their speeds.

link

CuriouslyC 1177 days ago

This is true, the question isn't inconsistent with the bus and the racecar travelling in opposite directions, it's just implied that they're going in the same direction.

link

cubefox 1178 days ago

GPT-4 (Bing) solves them correctly.

link

andriym 1178 days ago

yeah, gonna need a few more M's of data work for that unfortunately.

link