Hacker News new | ask | show | jobs
When Models Examine Themselves: Vocabulary-Activation Correspondence (arxiv.org)
1 points by tcbrah 94 days ago