Hacker News new | ask | show | jobs
by laurent_du 472 days ago
There's a very simple math question I asked every "thinking" models and every one of them not only couldn't solve it, but gave me logically incorrect answers and tried to gaslight me into accepting them as correct. QwQ spend a lot of time on a loop, repeating the same arguments over and over that were not leading to anything, but eventually it found a correct argument and solved it.

So as far as I am concerned this model is smarter than o1 at least in this instance.