Hacker News new | ask | show | jobs
by 5555watch 75 days ago
It's not "13 parameters to reason", they just rotated the full 8B parameter space in 13 dimensions and found a rotation that was still able to reason.

Depending on the latent structure, it's possible a nice rotation that would be perfect for some one specific problem, but you still got to search for it, and it's not a guarantee to exist.

But it's a nice step towards LLM parameter-space interpretability.