Hacker News new | ask | show | jobs
by miven 754 days ago
I don't get your point, how is what you're suggesting here different from a few papers we already have on KV cache pruning methods like [1]?

[1] https://arxiv.org/abs/2305.15805