Hacker News new | ask | show | jobs
CoLT5: Faster Long-Range Transformers with Conditional Computation (arxiv.org)
4 points by 1xdevloper 1189 days ago