Hacker News new | ask | show | jobs
by margalabargala 563 days ago
Nah, for complex problems maybe, not for single digit addition that should be in the training corpus directly.

Regardless, the intention here is to highlight a difference between Gemini and ChatGPT/Claude, neither of which will agree to simple mathberrors.