Hacker News new | ask | show | jobs
Training and Evaluating LLMs as General-Purpose Activation Explainers (alignment.anthropic.com)
1 points by not4uffin 185 days ago