Hacker News new | ask | show | jobs
Pretraining Without Attention (arxiv.org)
2 points by SongofEarth 776 days ago