Hacker News new | ask | show | jobs
by bongodongobob 517 days ago
I played it with 3.5 and it was great. This isn't something o1 just picked up on.
1 comments

yep, agree. One big part of the experiment was to see how well it does the reasoning by asking it to output the reasoning.