Hacker News new | ask | show | jobs
by andrepd 63 days ago
CoT is basically bullshit, entirely confabulated and not related to any "thought process"...
2 comments

But still CoT distillation WORKS. See the DeepSeek R1 paper.
Tokens relate to each other. More tokens more compute