|
|
|
|
|
by Kuinox
302 days ago
|
|
Most of the failures for theses simple logic question come from the inability to simply copy data accuratly.
Logic is too abstract to be measured, but this single bench show something getting in it's way.
I got another bench that show that the LLMs do basic mistakes that can be easily avoided with minimum logic and observation. |
|