|
|
|
|
|
by CamperBob2
888 days ago
|
|
As soon as ChatGPT got released, I tried to make it solve IMO-style problems. It failed. Have you tried the same questions with ChatGPT 4? It is a transformational change (no pun intended) over the earlier releases, and over all open-source models. Just today, I needed to interpret some awkwardly-timestamped log data. I asked it a few questions along the lines of "What time it was 10,000 seconds before xx:yy:zz?" It didn't give me the answer, but it wrote and executed a Python program that did. |
|
> When producing full natural-language proofs on IMO-AG-30, however, GPT-4 has a success rate of 0% ...