Hacker News new | ask | show | jobs
by logicchains 768 days ago
To clarify, is the sLSTM strictly necessary (to achieve better accuracy than those other architectures), or is the mLSTM good enough? The [1/0] model in the paper seemed to do quite well.
1 comments

For language in general it seems fine. But there might be specific tasks where it is necessary indeed.