|
|
|
|
|
by levocardia
491 days ago
|
|
It would be much more interesting to see PCA (or t-SNE or whatever) on the internal representation within the model itself. As in the activations of a certain number of layers or neurons, as they change from token to token. I don't think the OpenAI embeddings are necessarily an appropriate "map" of the model's internal thoughts. I suppose that raises another questions: Do LLMs "think" in language? Or do they think in a more abstract space, then translate it to language later? My money is on the latter. |
|
The processing happens in latent space and then is converted to tokens/token space. There is research into reasoning models which can spend extra compute in latent space instead of in token space: https://arxiv.org/abs/2412.06769