Hacker News new | ask | show | jobs
How Minimax-01 Achieves 1M Token Context Length with Linear Attention (MIT) (yacinemahdid.com)
2 points by research_pie 443 days ago