Hacker News new | ask | show | jobs
Self-attention Does Not Need O(n^2) Memory (arxiv.org)
3 points by latentdeepspace 1648 days ago