Y
Hacker News
new
|
ask
|
show
|
jobs
by
rumblefrog
558 days ago
I generally find o1, or the previous o1-preview to perform better than Claude 3.5 Sonnet in complex reasonings, new Sonnet is more on-par with o1-mini in my experience.
Would expect o1-pro to perform even better.