|
Ask a local AI or a chatbot that allows you to disable tool calling to multiply two large number for example. This is what Mistral outputs: The result of multiplying 63,157,997,633 by 63,114,90,009 is: 3,965,689,999,999,999,999,999 (approximately 3.966 × 10²⁴). That's like 5 orders of magnitude off, the scientific notation doesn't even match the full integer, and the mantissa is also slightly wrong. |
GPT-5 pro without tools can easily solve your question and much harder ones.
Rather: does there exist a model that can perform these calculations reliably is a better way to falsify this claim.
Else we can always find the worst model with 1B parameters to falsify any claim made on LLM's.