Hacker News new | ask | show | jobs
by natnat 255 days ago
I think this has a lot to do with the mechanism of RoPE attention, where physical closeness in the code is a signal of relevance.