Hacker News new | ask | show | jobs
by floam 455 days ago
ChatGPT definitely noticed: o1, o3-mini, o3-mini-high.

Maybe 4o will get it wrong? I wouldn’t try it for math.

1 comments

I tried 4.5 which i thought was the best model, seems like the reasoning models do get it.