Hacker News new | ask | show | jobs
by theGnuMe 940 days ago
There's another paper replacing attention with FF networks so just combine the two and you've got something.
1 comments

Link? Sounds like a good read! :)
Not op but might be this: https://arxiv.org/pdf/2311.10642.pdf