|
|
|
|
|
by famouswaffles
537 days ago
|
|
>We know why these models work. We know precisely how, physically, they come to their conclusions (it’s just processor instructions as with all software) I don't know why you would classify this as knowing much of anything. Processor instructions ? Really? If the average user is given unfettered access to the entire source code of his/her favorite app, does he suddenly understand it ? That seems like a ridiculous assertion. In reality, it's even worse. We can't pinpoint what weights, how and in what ways and instances are contributing exactly to basic things like whether a word should be preceded by 'the' or 'a' and it only gets more intractable as models get bigger and bigger. Sure, you could probably say we understand these NNs better than brains but it's not by much at all. |
|
And one that I didn’t make.
I don’t think when we say “we understand” we’re talking about your average Joe.
I mean “we” as in all of human knowledge.
> We can't pinpoint what weights, how and in what ways and instances are contributing exactly to basic things like whether a word should be preceded by 'the' or 'a' and it only gets more intractable as models get bigger and bigger.
There is research coming out on this subject. I read a paper recently about how llama’s weights seemed to be grouped by concept like “president” or “actors.”
But just the fact that we know that information encoded in weights affects outcomes and we know the underlying mechanisms involved in the creation of those weights and the execution of the model shows that we know much more about how they work than an organic brain.
The whole organic brain thing is kind of a tangent anyway.
My point is that it’s not correct to say that we don’t know how these systems work. We do. It’s not voodoo.
We just don’t have a high level understanding of the form in which information is encoded in the weights of any given model.