Hacker News new | ask | show | jobs
by s314 32 days ago
Using a logit lens (prior art: https://www.lesswrong.com/posts/AcKRB8wDpdaN6v6ru/interpreti...)