|
|
|
|
|
by ben_w
197 days ago
|
|
> Out of interest, was your intended answer "where you started, facing east"? Or anything close to it so long as the logic is right, yes. I care about the reasoning failure, not the small difference between the exact quarter-circumferences of these great circles and 10,000km; (Not that it really matters, but now you've said the answer, this test becomes even less reliable than it already was). > FWIW, Claude Opus 4.5 gets this right for me, assuming that is the intended answer. Like I said, now the best ones sometimes don't [always get it wrong]. For me yesterday, Claude (albeit Sonnet 4.5, because my testing is cheap) avoided the south pole issue, but then got the third leg wrong and ended up at the north pole. A while back ChatGPT 5 (I looked the result up) got the answer right, yesterday GPT-5-thinking-mini (auto-selected by the system) got it wrong same way as you report on the south pole but then also got the equator wrong and ended up near the north pole. "Never" to "unreliable success" is still an improvement. |
|