|
|
|
|
|
by grey-area
74 days ago
|
|
So like many of the promises from AI companies, reported chain of thought is not actually true (see results below). I suppose this is unsurprising given how they function. Is chain of thought even added to the context or is it extraneous babble providing a plausible post-hoc justification? People certainly seem to treat it as it is presented, as a series of logical steps leading to an answer. ‘After checking that the models really did use the hints to aid in their answers, we tested how often they mentioned them in their Chain-of-Thought. The overall answer: not often. On average across all the different hint types, Claude 3.7 Sonnet mentioned the hint 25% of the time, and DeepSeek R1 mentioned it 39% of the time. A substantial majority of answers, then, were unfaithful.‘ |
|