Hacker News new | ask | show | jobs
TriAttention: Efficient Long Reasoning with Trigonometric KV Compression (weianmao.github.io)
2 points by tamnd 74 days ago