Y
Hacker News
new
|
ask
|
show
|
jobs
by
nivekkevin
979 days ago
it's a rolling buffer, so it just upsert index % 4 in this case
1 comments
mhartz
979 days ago
Thanks, so does that mean position within the buffer is irrelevant?
link
nivekkevin
979 days ago
it does feel like so, the position eventually loses its meaning as more and more data gets crunched by the training process, eventually it's just a context of the past 4 tokens it feels like
link