|
|
|
|
|
by seanhunter
77 days ago
|
|
I don't know what you mean by that. We know what's going on under the hood always: linear algebra, the attention mechanism etc. To my first approximation all "Chain of thought" means is that instead of having to prompt the model to discuss everything in text and then decide at the end[1], now it sort of automatically does that so you don't need to prompt it. [1] Which used to bring about very substantial improvements in performance on some tasks |
|
You can easily see this for yourself by carefully walking through a given trace with a critical eye. Here's an example from myself a few days ago. https://news.ycombinator.com/item?id=47623324