|
|
|
|
|
by Eisenstein
50 days ago
|
|
> Right, and then look at any number of research papers showing that CoT output has limited impact on the end result. Which research papers? Do I have to find them? > We've trained these models to pretend to reason. I have no idea why that matters. Can you tell me what the difference is if it looks exactly the same and has the same result? |
|
https://arxiv.org/html/2506.02878v1
https://arxiv.org/pdf/2508.01191
Anthropic themselves: https://www.anthropic.com/research/reasoning-models-dont-say...
They were approaching this from an interpretability standpoint, but the more interesting finding in there is that models come up with an answer that fits their training and context provided. CoT is generated to fit the anticipated answer.
In these studies, there are examples of CoT that directly contradicts the response these models ultimately settle on.
This is not reasoning. This is pretense.