Y
Hacker News
new
|
ask
|
show
|
jobs
by
esafak
198 days ago
Yeah,
DeepSeek Sparse Attention
. Section 2:
https://arxiv.org/abs/2512.02556