| Thanks for the feedback! I'm one of the authors. I just wanted to make sure you noticed that this is linking to an accessible blog post that's trying to communicate a research result to a non-technical audience? The actual research result is covered in two papers which you can find here: - Methods paper: https://transformer-circuits.pub/2025/attribution-graphs/met... - Paper applying this method to case studies in Claude 3.5 Haiku: https://transformer-circuits.pub/2025/attribution-graphs/bio... These papers are jointly 150 pages and are quite technically dense, so it's very understandable that most commenters here are focusing on the non-technical blog post. But I just wanted to make sure that you were aware of the papers, given your feedback. |
Considering the two following statements made in the reply:
And The onus of clarifying the article's assertions: And As it pertains to anthropomorphizing an algorithm (a.k.a. stating it "thinks") is on the author(s).