Hacker News new | ask | show | jobs
by mhartz 979 days ago
Thanks, so does that mean position within the buffer is irrelevant?
1 comments

it does feel like so, the position eventually loses its meaning as more and more data gets crunched by the training process, eventually it's just a context of the past 4 tokens it feels like