|
This is what I got on ChatGPT today. I assume it used GPT4: Prompt ChatGPT Actual Match
397,356 * 930,547 369,685,207,932 369,758,433,732 FALSE
36,330 * 26,951 979,458,630 979,129,830 FALSE
8,681 * 9,330 80,911,430 80,993,730 FALSE
278 * 903 250,734 251,034 FALSE
82 * 77 6,314 6,314 TRUE
Edit: # of correct digits (counting from leftmost) only exceeds 3 on the smallest pair. It drops to two, as well, on the 3x3 set. |
I have a theory that it does arithmetic badly because the logic goes right-to-left, when LLMs write left-to-right. If the digits were to be reversed, it might not make as many mistakes. I ran out of attempts before I could test this properly.