|
|
|
|
|
by 5555watch
75 days ago
|
|
It's not "13 parameters to reason", they just rotated the full 8B parameter space in 13 dimensions and found a rotation that was still able to reason. Depending on the latent structure, it's possible a nice rotation that would be perfect for some one specific problem, but you still got to search for it, and it's not a guarantee to exist. But it's a nice step towards LLM parameter-space interpretability. |
|