| o3 has been the worst model of the new 3 for me. Ask it to create a Typescript server side hello world. It produces a JS example. Telling it that's incorrect (but no more detail) results in it iterating all sorts of mistakes. In 20 iterations it never once asked me what was incorrect. In contrast, o4-mini asked me after 5, o4-mini-high asked me after 1, but narrowed the question to "is it incorrect due to choice of runtime?" rather than "what's incorrect?" I told it to "ask the right question" based on my statement ("it is incorrect") and it correctly asked "what is wrong with it?" before I pointed out no Typescript types. This is the critical thinking we need not just reasoning (incorrectly). |
Well TS is a strict superset of JS so it’s technically correct (which is the best kind of correct) to produce JS when asked for a TS version. So you’re the one that’s wrong.