|
|
|
|
|
by _heimdall
466 days ago
|
|
> We literally know exactly what is going on with every layer. Unless I missed a huge break in the observability problem, this isn't correct. We know exactly how every layer is designed and we know how we functionally expect that to work. We don't know what actually happens in the model at time of inference. I.e. we know what pieces were used to build the thing but when we actually use it its a black box - we only know inputs and outputs. |
|
How could we not know? Every processor instruction is observable.
What we specifically don’t have a good view is the causal relationship between input tokens, a model’s weights, and the output.
We don’t know specifically what weights matter or why.
That’s very different than not understanding what processes are taking place.