Hacker News new | ask | show | jobs
by thw20 58 days ago
Good work! This is very interesting. Here's a related work that construct low-rank approximation for attention: https://arxiv.org/abs/2505.12942.

Maybe the idea of Query calibration matrix Rxx is of interest to the author!

1 comments

Thanks, really appreciate the pointer. Will dig into it.