Y
Hacker News
new
|
ask
|
show
|
jobs
by
daavidhauser
747 days ago
xLSTM has a working memory and seems to outperform transformer architectures:
https://arxiv.org/abs/2405.04517
1 comments
jawon
746 days ago
Thanks for that. It looks like the kind of thing I'm looking for. I'll give it a read.
link