Hacker News new | ask | show | jobs
by daavidhauser 747 days ago
xLSTM has a working memory and seems to outperform transformer architectures: https://arxiv.org/abs/2405.04517
1 comments

Thanks for that. It looks like the kind of thing I'm looking for. I'll give it a read.