Hacker News new | ask | show | jobs
by rbranson 187 days ago
Bricken isn’t just making this up. He’s one of the leading researchers in model interpretability. See: https://arxiv.org/abs/2411.14257