Y
Hacker News
new
|
ask
|
show
|
jobs
by
hansonw
842 days ago
Indeed:
https://arxiv.org/pdf/2402.01032.pdf
Perhaps future iterations of SSMs will accommodate dynamically sized (but still non-linearly-growing) hidden states / memories!