|
|
|
|
|
by simianwords
118 days ago
|
|
I don't think finding the worst model and asking it to do things is the best way to go. GPT-5 pro without tools can easily solve your question and much harder ones. Rather: does there exist a model that can perform these calculations reliably is a better way to falsify this claim. Else we can always find the worst model with 1B parameters to falsify any claim made on LLM's. |
|
The only way to be sure a model calls no tool is to run it locally and control the network.