Hacker News new | ask | show | jobs
Faster Causal Attention over Long Sequences (arxiv.org)
1 points by dpstart01 1109 days ago