Hacker News new | ask | show | jobs
by atleastoptimal 43 days ago
If it's only pretending to reason, then how is it that the CoT output improves performance on every single benchmark/test?