Hacker News new | ask | show | jobs
Why do LLMs attend to the first token? (arxiv.org)
2 points by adhi01 393 days ago
1 comments

Curious if the authors had a chance to look at the Softpick paper? https://arxiv.org/abs/2504.20966