Hacker News new | ask | show | jobs
by rumblefrog 558 days ago
I generally find o1, or the previous o1-preview to perform better than Claude 3.5 Sonnet in complex reasonings, new Sonnet is more on-par with o1-mini in my experience.

Would expect o1-pro to perform even better.