|
|
|
|
|
by dartos
466 days ago
|
|
> while we literally watch reasoning models say things like "oh that's not right, let me try a different approach". Not saying I disagree with your premise that errors can’t be corrected by using more and more tokens, but this argument is weird to me. The model isn’t intentionally generating text. The kinds of “oh let me try a different approach” lines I see are often followed by the same approach just taken. I wouldn’t say most of the time, but often enough that I notice. Just because a model generates text doesn’t mean that the text actually represents anything at all, let alone a reflection of an internal process. |
|
What does it represent then? What are all these billion weights for? It's not a bag full of NULLs that just pulls next words from a look-up table. Obviously there is some kind of internal process.
Also I don't get why people ignore the temporal aspect. Humans too generate thoughts in sequence, and can't arbitrarily mutate what came before. Time and memory is what forces sequential order - we too just keep piling on more thoughts to correct previous thoughts while they are still in working memory (context).