Hacker News new | ask | show | jobs
by esafak 198 days ago
Yeah, DeepSeek Sparse Attention. Section 2: https://arxiv.org/abs/2512.02556