Hacker News new | ask | show | jobs
Thinking Sparks: Emergent Attention Heads in Reasoning Models (arxiv.org)
1 points by diwank 235 days ago