Y
Hacker News
new
|
ask
|
show
|
jobs
by
natnat
255 days ago
I think this has a lot to do with the mechanism of RoPE attention, where physical closeness in the code is a signal of relevance.