Y
Hacker News
new
|
ask
|
show
|
jobs
by
andrepd
63 days ago
CoT is basically bullshit, entirely confabulated and not related to any "thought process"...
2 comments
clbrmbr
63 days ago
But still CoT distillation WORKS. See the DeepSeek R1 paper.
link
whattheheckheck
63 days ago
Tokens relate to each other. More tokens more compute
link